Categorization of broadcast audio objects in complex auditory scenes

Woodcock, JS ORCID:, Davies, WJ ORCID:, Cox, TJ ORCID: and Melchior, F 2016, 'Categorization of broadcast audio objects in complex auditory scenes' , Journal of the Audio Engineering Society, 64 (6) , pp. 380-394.

PDF - Published Version
Available under License Creative Commons Attribution 4.0.

Download (404kB) | Preview
[img] PDF - Accepted Version
Restricted to Repository staff only

Download (511kB) | Request a copy


This paper presents a series of experiments to determine a categorization framework for broadcast audio objects. Object-based audio is becoming an evermore important paradigm for the representation of complex sound scenes. However, there is a lack of knowledge regarding object level perception and cognitive processing of complex broadcast audio scenes. As categorization is a fundamental strategy in reducing cognitive load, knowledge of the categories utilized by listeners in the perception of complex scenes will be beneficial to the development of perceptually based representations and rendering strategies for object-based audio. In this study, expert and non-expert listeners took part in a free card sorting task using audio objects from a variety of different types of programme material. Hierarchical agglomerative clustering suggests that there are 7 general categories, which relate to sounds indicating actions and movement, continuous and transient background sound, clear speech, non-diegetic music and effects, vocalisations, and prominent attention grabbing transient sounds. A three dimensional perceptual space calculated via multidimensional scaling suggests that these categories vary along dimensions related to the semantic content of the objects, the temporal extent of the objects, and whether the object indicates the presence of people.

Item Type: Article
Schools: Schools > School of Computing, Science and Engineering > Salford Innovation Research Centre
Journal or Publication Title: Journal of the Audio Engineering Society
Publisher: Audio Engineering Society (AES)
ISSN: 1549-4950
Related URLs:
Funders: Engineering and Physical Sciences Research Council (EPSRC)
Depositing User: USIR Admin
Date Deposited: 29 Feb 2016 10:57
Last Modified: 15 Feb 2022 20:22

Actions (login required)

Edit record (repository staff only) Edit record (repository staff only)