Modeling Timbre Similarity of Short Music Clips

被引:3
|
作者
Siedenburg, Kai [1 ]
Mullensiefen, Daniel [2 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Oldenburg, Germany
[2] Goldsmiths Univ London, Dept Psychol, London, England
来源
FRONTIERS IN PSYCHOLOGY | 2017年 / 8卷
关键词
short audio clips; music similarity; timbre; audio features; genre; LEAST-SQUARES REGRESSION; SOUNDS; DISSIMILARITY; DESCRIPTORS; RECOGNITION; EXCERPTS;
D O I
10.3389/fpsyg.2017.00639
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
There is evidence from a number of recent studies that most listeners are able to extract information related to song identity, emotion, or genre from music excerpts with durations in the range of tenths of seconds. Because of these very short durations, timbre as a multifaceted auditory attribute appears as a plausible candidate for the type of features that listeners make use of when processing short music excerpts. However, the importance of timbre in listening tasks that involve short excerpts has not yet been demonstrated empirically. Hence, the goal of this study was to develop a method that allows to explore to what degree similarity judgments of shortmusic clips can bemodeled with low-level acoustic features related to timbre. We utilized the similarity data from two large samples of participants: Sample I was obtained via an online survey, used 16 clips of 400 ms length, and contained responses of 137,339 participants. Sample II was collected in a lab environment, used 16 clips of 800 ms length, and contained responses from 648 participants. Our model used two sets of audio features which included commonly used timbre descriptors and the well-known Mel-frequency cepstral coefficients as well as their temporal derivates. In order to predict pairwise similarities, the resulting distances between clips in terms of their audio features were used as predictor variables with partial least-squares regression. We found that a sparse selection of three to seven features from both descriptor sets-mainly encoding the coarse shape of the spectrum as well as spectrotemporal variability-best predicted similarities across the two sets of sounds. Notably, the inclusion of non-acoustic predictors of musical genre and record release date allowed much better generalization performance and explained up to 50% of shared variance (R-2) between observations and model predictions. Overall, the results of this study empirically demonstrate that both acoustic features related to timbre as well as higher level categorical features such as musical genre play a major role in the perception of short music clips.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] VOCAL TIMBRE ANALYSIS USING LATENT DIRICHLET ALLOCATION AND CROSS-GENDER VOCAL TIMBRE SIMILARITY
    Nakano, Tomoyasu
    Yoshii, Kazuyoshi
    Goto, Masataka
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [32] SINGING VOICE TIMBRE CLASSIFICATION OF CHINESE POPULAR MUSIC
    Sha, Cheng-Ya
    Yang, Yi-Hsuan
    Lin, Yu-Ching
    Chen, Homer H.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 734 - 738
  • [33] Music Genre Classification Using Polyphonic Timbre Models
    de Leon, Franz A.
    Martinez, Kirk
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 415 - 420
  • [34] Natural Highs: Timbre and Chills in Electronic Dance Music
    Auricchio, Nino
    POPULAR MUSIC STUDIES TODAY, 2017, : 11 - 23
  • [35] Dark timbre: the aesthetics of tone colour in goth music
    Van Elferen, Isabella
    POPULAR MUSIC, 2018, 37 (01) : 22 - 39
  • [36] TIMBRE FEATURES OF MODERN MUSIC AND EMOTION TRAINING MODEL
    Moos, Aleksey
    Moos, Evgeny
    SGEM 2015, BOOK 4: ARTS, PERFORMING ARTS, ARCHITECTURE AND DESIGN, 2015, : 147 - 152
  • [37] Note Recognition of Polyphonic Music Based on Timbre Model
    Shi, Lixin
    Zhang, Junxing
    Li, Min
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 1, PROCEEDINGS, 2009, : 174 - 177
  • [38] INSTRUMENT TIMBRE ENHANCES PERCEPTUAL SEGREGATION IN ORCHESTRAL MUSIC
    Fischer, Manda
    Soden, Kit
    Thoret, Etienne
    Montrey, Marcel
    McAdams, Stephen
    MUSIC PERCEPTION, 2021, 38 (05): : 473 - 498
  • [39] Instrument classification in polyphonic music based on timbre analysis
    Zhang, T
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS II, 2001, 4519 : 136 - 147
  • [40] Timbre Music and the Birth of New Cult-Sound
    Ruttkay, Juraj
    AMTA '09: PROCEEDINGS OF THE 10TH WSEAS INTERNATIONAL CONFERENCE ON ACOUSTICS AND MUSIC: THEORY AND APPLICATIONS, 2009, : 65 - 67