Li, FF ORCID: https://orcid.org/0000-0001-9053-963X
2014,
'Soft-computing audio classification as a pre-processor for automated content descriptor generation'
, International Journal of Computer and Communication Engineering, 3 (2)
, pp. 101-104.
![]() |
PDF
- Published Version
Restricted to Repository staff only Download (1MB) | Request a copy |
Abstract
Soundtracks of multimedia files are information rich sources, from which much content-related information and metadata can be extracted. There exist many individual algorithms for the recognition and analysis of speech, music or event sounds, allowing for information embedded in audio format files to be retrieved or represented in a semantic fashion. However, soundtracks are typically a mixture these three different types of signals, and sometimes overlapped. Segmentation and classification therefore become essential pre-processors for audio based information retrieval and metadata generation. This paper stresses the importance of a universal audio indexing and segmentation pre-processor, proposes a high-level architecture for such a system, and presents signal processing algorithms based on soft-computing and two important but neglected feature spaces to improve the accuracy of classification.
Item Type: | Article |
---|---|
Schools: | Schools > School of Computing, Science and Engineering > Salford Innovation Research Centre |
Journal or Publication Title: | International Journal of Computer and Communication Engineering |
ISSN: | 2010-3743 |
Related URLs: | |
Funders: | University of Salford |
Depositing User: | Dr Francis F. Li |
Date Deposited: | 09 May 2016 08:41 |
Last Modified: | 15 Feb 2022 20:45 |
URI: | https://usir.salford.ac.uk/id/eprint/38904 |
Actions (login required)
![]() |
Edit record (repository staff only) |