General sound classification and similarity in MPEG-7

Casey, Michael A.. 2001. General sound classification and similarity in MPEG-7. Organised Sound, 6(2), ISSN 1355-7718 [Article]

No full text available

Abstract or Description

We introduce a system for generalised sound classification and similarity using a machine-learning framework. Applications of the system include automatic classification of environmental sounds, musical instruments, music genre and human speakers. In addition to classification, the system may also be used for computing similarity metrics between a target sound and other sounds in a database. We discuss the use of hidden Markov models for representing the temporal evolution of audio spectra and present results of testing the system on classification and retrieval tasks. The system has been incorporated into the MPEG-7 international standard for multimedia content description and is therefore publicly available in the form of a set of standardised interfaces and software reference tools for developers and researchers.

Item Type:

Article

Identification Number (DOI):

https://doi.org/10.1017/S1355771801002126

Departments, Centres and Research Units:

Computing

Dates:

DateEvent
2001Published

Item ID:

15151

Date Deposited:

01 Dec 2015 14:14

Last Modified:

20 Jun 2017 09:43

Peer Reviewed:

Yes, this version has been peer-reviewed.

URI:

https://research.gold.ac.uk/id/eprint/15151

Edit Record Edit Record (login required)