Modeling Timbre Similarity of Short Music Clips

被引:3
|
作者
Siedenburg, Kai [1 ]
Mullensiefen, Daniel [2 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Oldenburg, Germany
[2] Goldsmiths Univ London, Dept Psychol, London, England
来源
FRONTIERS IN PSYCHOLOGY | 2017年 / 8卷
关键词
short audio clips; music similarity; timbre; audio features; genre; LEAST-SQUARES REGRESSION; SOUNDS; DISSIMILARITY; DESCRIPTORS; RECOGNITION; EXCERPTS;
D O I
10.3389/fpsyg.2017.00639
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
There is evidence from a number of recent studies that most listeners are able to extract information related to song identity, emotion, or genre from music excerpts with durations in the range of tenths of seconds. Because of these very short durations, timbre as a multifaceted auditory attribute appears as a plausible candidate for the type of features that listeners make use of when processing short music excerpts. However, the importance of timbre in listening tasks that involve short excerpts has not yet been demonstrated empirically. Hence, the goal of this study was to develop a method that allows to explore to what degree similarity judgments of shortmusic clips can bemodeled with low-level acoustic features related to timbre. We utilized the similarity data from two large samples of participants: Sample I was obtained via an online survey, used 16 clips of 400 ms length, and contained responses of 137,339 participants. Sample II was collected in a lab environment, used 16 clips of 800 ms length, and contained responses from 648 participants. Our model used two sets of audio features which included commonly used timbre descriptors and the well-known Mel-frequency cepstral coefficients as well as their temporal derivates. In order to predict pairwise similarities, the resulting distances between clips in terms of their audio features were used as predictor variables with partial least-squares regression. We found that a sparse selection of three to seven features from both descriptor sets-mainly encoding the coarse shape of the spectrum as well as spectrotemporal variability-best predicted similarities across the two sets of sounds. Notably, the inclusion of non-acoustic predictors of musical genre and record release date allowed much better generalization performance and explained up to 50% of shared variance (R-2) between observations and model predictions. Overall, the results of this study empirically demonstrate that both acoustic features related to timbre as well as higher level categorical features such as musical genre play a major role in the perception of short music clips.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Analysis of sound features for music timbre recognition
    Zhang, Xin
    Ras, Zbigniew W.
    MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 3 - +
  • [22] The concept of 'Klangfarbe' (Music, timbre, tone quality)
    Jost, P
    MUSIKTHEORIE, 2001, 16 (02): : 181 - 183
  • [23] Effects of timbre and tempo change on memory for music
    Halpern, Andrea R.
    Muellensiefen, Daniel
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2008, 61 (09): : 1371 - 1384
  • [24] Timeshards: repetition, timbre, and identity in dance music
    Jowers, P
    TIME & SOCIETY, 1999, 8 (02) : 381 - 396
  • [25] Instrumental Ensemble in Russian Music - Timbre Innovations
    Samoylova, Nailya
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON ART STUDIES: SCIENCE, EXPERIENCE, EDUCATION (ICASSEE 2017), 2017, 171
  • [26] SOME ASPECTS OF TIMBRE IN THE MUSIC OF DALLAPICCOLA,LUIGI
    MICHEL, P
    REVUE DE MUSICOLOGIE, 1992, 78 (02) : 307 - 330
  • [27] Rhythm and Timbre Analysis for Carnatic Music Processing
    Heshi, Rushiraj
    Suma, S. M.
    Koolagudi, Shashidhar G.
    Bhandari, Smriti
    Rao, K. S.
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS (ICACNI 2015), VOL 1, 2016, 43 : 603 - 609
  • [28] A Comparison of Approaches to Timbre Descriptors in Music Information Retrieval and Music Psychology
    Siedenburg, Kai
    Fujinaga, Ichiro
    McAdams, Stephen
    JOURNAL OF NEW MUSIC RESEARCH, 2016, 45 (01) : 27 - 41
  • [29] Timbre similarity:: Convergence of neural, behavioral, and computational approaches
    Toiviainen, P
    Tervaniemi, M
    Louhivuori, J
    Saher, M
    Huotilainen, M
    Näätänen, R
    MUSIC PERCEPTION, 1998, 16 (02): : 223 - 241
  • [30] Inter genre similarity modeling for automatic music genre classification
    Bagci, Ulas
    Erzin, Engin
    2006 IEEE 14TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1 AND 2, 2006, : 639 - +