The effect of situation-specific non-speech acoustic cues on the intelligibility of speech in noise

Ward, L, Shirley, BG ORCID: https://orcid.org/0000-0001-9634-4489, Tang, Y and Davies, WJ ORCID: https://orcid.org/0000-0002-5835-7489 2017, The effect of situation-specific non-speech acoustic cues on the intelligibility of speech in noise , in: INTERSPEECH 2017, 18th Annual Conference of the International Speech Communication Association, August 20-24, 2017, Stockholm, Sweden.

[img]
Preview
PDF - Updated Version
Download (214kB) | Preview

Abstract

In everyday life, speech is often accompanied by a situation-specific acoustic cue; a hungry bark as you ask ‘Has anyone fed the dog?’. This paper investigates the effect such cues have on speech intelligibility in noise and evaluates their interaction with the established effect of situation-specific semantic cues. This work is motivated by the introduction of new object-based broadcast formats, which have the potential to optimise intelligibility by controlling the level of individual broadcast audio elements, at point of service. Results of this study show that situation-specific acoustic cues alone can improve word recognition in multi-talker babble by 69.5%, a similar amount to semantic cues. The combination of both semantic and acoustic cues provide further improvement of 106.0% compared with no cues, and 18.7% compared with semantic cues only. Interestingly, whilst increasing subjective intelligibility of the target word, the presence of acoustic cues degraded the objective intelligibility of the speech-based semantic cues by 47.0% (equivalent to reducing the speech level by 4.5 dB). This paper discusses the interactions between the two types of cues and the implications that these results have for assessing and improving the intelligibility of broadcast speech.

Item Type: Conference or Workshop Item (Paper)
Schools: Schools > School of Computing, Science and Engineering
Journal or Publication Title: INTERSPEECH 2017, 18th Annual Conference of the International Speech Communication Association
Funders: General Sir John Monash Foundation
Depositing User: Y Tang
Date Deposited: 07 Jun 2017 12:48
Last Modified: 15 Feb 2022 22:06
URI: https://usir.salford.ac.uk/id/eprint/42533

Actions (login required)

Edit record (repository staff only) Edit record (repository staff only)

Downloads

Downloads per month over past year