Intelligibility prediction for speech mixed with white Gaussian noise at low signal-to-noise ratios

Graetzer, SN ORCID: https://orcid.org/0000-0003-1446-5637 and Hopkins, C 2021, 'Intelligibility prediction for speech mixed with white Gaussian noise at low signal-to-noise ratios' , The Journal of the Acoustical Society of America (JASA), 149 (2) , pp. 1346-1362.

[img]
Preview
PDF - Published Version
Available under License Creative Commons Attribution 4.0.

Download (4MB) | Preview

Abstract

The effect of additive white Gaussian noise and high-pass filtering on speech intelligibility at signal-to-noise ratios (SNRs) from -26 to 0 dB was evaluated using British English talkers and normal hearing listeners. SNRs below -10 dB were considered as they are relevant to speech security applications. Eight objective metrics were assessed: Short-Time Objective Intelligibility (STOI), a proposed variant termed STOI+, Extended Short-Time Objective Intelligibility (ESTOI), Normalised Covariance Metric (NCM), Normalised Sub-band Envelope Correlation metric (NSEC), two metrics derived from the Coherence Speech Intelligibility Index (CSII), and an envelope-based regression method Speech Transmission Index (STI). For speech and noise mixtures associated with intelligibility scores ranging from 0% to 98%, STOI+ performed at least as well as other metrics, and under some conditions better than STOI, ESTOI, STI, NSEC, CSIIMid and CSIIHigh. Both STOI+ and NCM were associated with relatively low prediction error and bias for intelligibility prediction at SNRs from -26 to 0 dB. STI performed least well in terms of correlation with intelligibility scores, prediction error, bias and reliability. Logistic regression modelling demonstrated that high-pass filtering, which increases the proportion of high to low frequency energy, was detrimental to intelligibility for SNRs between -5 and -17 dB inclusive.

Item Type: Article
Schools: Schools > School of Computing, Science and Engineering
Journal or Publication Title: The Journal of the Acoustical Society of America (JASA)
Publisher: Acoustical Society of America
ISSN: 0001-4966
Related URLs:
Depositing User: Dr Simone Graetzer
Date Deposited: 05 Mar 2021 09:11
Last Modified: 16 Feb 2022 06:51
URI: https://usir.salford.ac.uk/id/eprint/59778

Actions (login required)

Edit record (repository staff only) Edit record (repository staff only)

Downloads

Downloads per month over past year