Statistical Methods in Music Corpus Studies: Application, Use Cases, and Best Practice Examples

Müllensiefen, Daniel and Frieler, Klaus. 2022. Statistical Methods in Music Corpus Studies: Application, Use Cases, and Best Practice Examples. In: Daniel Shanahan; John Ashley Burgoyne and Ian Quinn, eds. The Oxford Handbook of Music and Corpus Studies. Oxford: Oxford University Press. ISBN 9780190945442 [Book Section]

No full text available
[img] Text
MullensiefenFrieler_CorpusStatistics_2022.pdf - Published Version
Permissions: Administrator Access Only until 18 August 2024.

Download (1MB)

Abstract or Description

In this chapter, the authors explain that there are two common goals in musical corpus analysis. The first is the description and comparison of musical corpora, the second is to establish relationships between musical structures and extra-musical data, which can refer to metadata of a particular musical piece (genre, style, and period labels, composer and performer attributions, etc.) or to listeners’ perceptions and evaluations. The authors give a brief overview of basic and advanced statistical methods that have been employed in music corpus studies. The chapter covers descriptive statistics and visualizations, feature selection and aggregation using principal component analysis. In addition, random forests and linear regression methods for use in the context of corpus studies are briefly explained, as well as supervised and unsupervised classification techniques. Each topic and method is introduced with a conceptual explanation, suggestions for its application, and usage scenarios from the research literature.

Item Type:

Book Section

Identification Number (DOI):

Additional Information:

Reproduced by permission of Oxford University Press

Departments, Centres and Research Units:



18 August 2022Published Online

Item ID:


Date Deposited:

13 Sep 2022 13:34

Last Modified:

13 Sep 2022 13:34


View statistics for this item...

Edit Record Edit Record (login required)