Li, FF ORCID: https://orcid.org/0000-0001-9053-963X and Cox, TJ
ORCID: https://orcid.org/0000-0002-4075-7564
2007,
'A neural network model for speech intelligibility quantification'
, Applied Soft Computing, 7 (1)
, pp. 145-155.
Abstract
A neural network based model is developed to quantify speech intelligibility by blind-estimating speech transmission index, an objective rating index for speech intelligibility of transmission channels, from transmitted speech signals without resort to knowledge of original speech signals. It consists of a Hilbert transform processor for speech envelope detection, a Welch average periodogram algorithm for envelope spectrum estimation, a principal components analysis (PCA) network for speech feature extraction and a multi-layer back-propagation network for non-linear mapping and case generalisation. The developed model circumvents the use of artificial test signals by exploiting naturally occurring speech signals as probe stimuli, reduces measurement channels from two to one and hence facilitates in situ assessment of speech intelligibility. From a cognitive science viewpoint, the proposed method might be viewed as a successful paradigm of mimicking human perception of speech intelligibility using a hybrid model built around artificial neural networks.
Item Type: | Article |
---|---|
Themes: | Subjects / Themes > Q Science > QC Physics > QC221-246 Acoustics - Sound Subjects outside of the University Themes |
Schools: | Schools > School of the Built Environment Schools > School of Computing, Science and Engineering > Salford Innovation Research Centre |
Journal or Publication Title: | Applied Soft Computing |
Publisher: | Elsevier |
Refereed: | Yes |
ISSN: | 1568-4946 |
Depositing User: | H Kenna |
Date Deposited: | 11 Sep 2007 13:18 |
Last Modified: | 27 Aug 2021 19:48 |
URI: | https://usir.salford.ac.uk/id/eprint/435 |
Actions (login required)
![]() |
Edit record (repository staff only) |