Acoustic Lexemes for Organizing Internet Audio

Casey, Michael A.. 2005. Acoustic Lexemes for Organizing Internet Audio. Contemporary Music Review, 24(6), pp. 489-508. ISSN 0749-4467 [Article]

No full text available

Abstract or Description

In this article, a method is proposed for automatic fine-scale audio description that draws inspiration from ontological sound description methods such as Shaeffer's Objets Sonores and Smalley's Spectromorphology. The goal is complete automation of audio description at the level of sound objects for indexing and retrieving sound segments within Internet audio documents. To automatically segment audio documents into acoustic lexemes, a hidden Markov model is employed. It is demonstrated that the symbol stream of cluster labels, generated by the Viterbi algorithm, constitutes a detailed description of audio as a sequence of spectral archetypes. The ASCII base-64 encoding scheme maps cluster indices to one-character symbols that are segmented into 8-gram sequences for indexing in a relational database. To illustrate the methods, the essential components of an audio search engine are described: the automatic cataloguer, the retrieval engine and the query language. The results of experiments that test the accuracy and the retrieval efficiency of six new similarity-matching algorithms for audio using acoustic lexemes are presented. The article concludes with examples of audio matching using the structured query language (SQL) for creating new musical sequences from large extant audio collections.

Item Type:

Article

Identification Number (DOI):

https://doi.org/10.1080/07494460500296169

Departments, Centres and Research Units:

Computing

Dates:

DateEvent
October 2005Published

Item ID:

15150

Date Deposited:

01 Dec 2015 14:02

Last Modified:

20 Jun 2017 09:43

Peer Reviewed:

Yes, this version has been peer-reviewed.

URI:

https://research.gold.ac.uk/id/eprint/15150

Edit Record Edit Record (login required)