Soft-computing audio classification as a pre-processor for automated content descriptor generation

Li, FF ORCID: https://orcid.org/0000-0001-9053-963X 2014, 'Soft-computing audio classification as a pre-processor for automated content descriptor generation' , International Journal of Computer and Communication Engineering, 3 (2) , pp. 101-104.

[img] PDF - Published Version
Restricted to Repository staff only

Download (1MB) | Request a copy

Abstract

Soundtracks of multimedia files are information rich sources, from which much content-related information and metadata can be extracted. There exist many individual algorithms for the recognition and analysis of speech, music or event sounds, allowing for information embedded in audio format files to be retrieved or represented in a semantic fashion. However, soundtracks are typically a mixture these three different types of signals, and sometimes overlapped. Segmentation and classification therefore become essential pre-processors for audio based information retrieval and metadata generation. This paper stresses the importance of a universal audio indexing and segmentation pre-processor, proposes a high-level architecture for such a system, and presents signal processing algorithms based on soft-computing and two important but neglected feature spaces to improve the accuracy of classification.

Item Type: Article
Schools: Schools > School of Computing, Science and Engineering > Salford Innovation Research Centre
Journal or Publication Title: International Journal of Computer and Communication Engineering
ISSN: 2010-3743
Related URLs:
Funders: University of Salford
Depositing User: Dr Francis F. Li
Date Deposited: 09 May 2016 08:41
Last Modified: 15 Feb 2022 20:45
URI: https://usir.salford.ac.uk/id/eprint/38904

Actions (login required)

Edit record (repository staff only) Edit record (repository staff only)

Downloads

Downloads per month over past year