Kendrick, P, Jackson, IR, Li, FF, Fazenda, BM and Cox, TJ 2015, 'Perceived audio quality of sounds degraded by non-linear distortions and single-ended assessment using HASQI' , Journal of the Audio Engineering Society, 63 (9) , pp. 698-712.
- Published Version
Available under License Creative Commons Attribution.
Download (3MB) | Preview
For field recordings and user generated content recorded on phones, tablets, and other mobile devices nonlinear distortions caused by clipping and limiting at pre-amplification stages, and dynamic range control (DRC) are common causes of poor audio quality. A single-ended method to detect these distortions and predict perceived degradation in speech, music, and soundscapes has been developed. This was done by training an ensemble of decision trees. During training, both clean and distorted audio was available and so the perceived quality could be gauged using HASQI (Hearing Aid Sound Quality Index). The new single-ended method can correctly predict HASQI from distorted samples to an accuracy of ±0.19 (95% confidence interval) using a quality range between 0.0 and 1.0. The method also has potential for estimating HASQI when other types of degradations are present. Subsequent perceptual tests validated the method for music and soundscapes. For the average mean opinion score for perceived audio quality on a scale from 0 to 1, the single ended method could estimate it within ±0.33.
|Themes:||Media, Digital Technology and the Creative Economy|
|Schools:||Schools > School of Computing, Science and Engineering > Salford Innovation Research Centre (SIRC)|
|Journal or Publication Title:||Journal of the Audio Engineering Society|
|Publisher:||Audio Engineering Society|
|Funders:||Engineering and Physical Sciences Research Council (EPSRC)|
|Depositing User:||P Kendrick|
|Date Deposited:||19 Oct 2015 11:34|
|Last Modified:||22 Dec 2015 13:43|
Actions (login required)
|Edit record (repository staff only)|