Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification

被引:0
|
作者
Florian Eyben
Gláucia L Salomão
Johan Sundberg
Klaus R Scherer
Björn W Schuller
机构
[1] MISP Group,Department of Speech Music Hearing, School of Computer Science and Communication
[2] Technische Universität München,Department of Computing
[3] KTH (Royal Institute of Technology),Department of Linguistics
[4] Université De Genève,Chair of Complex and Intelligent Systems
[5] Imperial College London,undefined
[6] Stockholm University,undefined
[7] University College of Music Education,undefined
[8] University of Passau,undefined
[9] audEERING UG (limited),undefined
关键词
Emotion recognition; Singing voice; Acoustic features; Feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
We investigate the automatic recognition of emotions in the singing voice and study the worth and role of a variety of relevant acoustic parameters. The data set contains phrases and vocalises sung by eight renowned professional opera singers in ten different emotions and a neutral state. The states are mapped to ternary arousal and valence labels. We propose a small set of relevant acoustic features basing on our previous findings on the same data and compare it with a large-scale state-of-the-art feature set for paralinguistics recognition, the baseline feature set of the Interspeech 2013 Computational Paralinguistics ChallengE (ComParE). A feature importance analysis with respect to classification accuracy and correlation of features with the targets is provided in the paper. Results show that the classification performance with both feature sets is similar for arousal, while the ComParE set is superior for valence. Intra singer feature ranking criteria further improve the classification accuracy in a leave-one-singer-out cross validation significantly.
引用
收藏
相关论文
共 50 条
  • [31] Anatomical features of singing voice. An analysis method
    DeLeon, J
    Cuyas, JM
    Ramos, A
    Rivero, JF
    [J]. SYDNEY '97 - XVI WORLD CONGRESS OF OTORHINOLARYNGOLOGY HEAD AND NECK SURGERY, TOMES 1 AND 2, 1996, : 1657 - 1660
  • [32] SINGING VOICE TIMBRE CLASSIFICATION OF CHINESE POPULAR MUSIC
    Sha, Cheng-Ya
    Yang, Yi-Hsuan
    Lin, Yu-Ching
    Chen, Homer H.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 734 - 738
  • [33] Analysis and classification of phonation types in speech and singing voice
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    Yegnanarayana, B.
    [J]. SPEECH COMMUNICATION, 2020, 118 : 33 - 47
  • [34] A SCHEME FOR SYSTEMATIC CLASSIFICATION OF THE SINGING VOICE IN INDIAN MUSIC
    SATHYANARAYANA, R
    [J]. JOURNAL OF THE INDIAN MUSICOLOGICAL SOCIETY, 1991, 22 : 18 - 31
  • [35] Listening evaluation and classification of female singing voice categories
    Fric, Marek
    Pavlechova, Angelika
    [J]. LOGOPEDICS PHONIATRICS VOCOLOGY, 2020, 45 (03) : 97 - 109
  • [36] The Impact of Tonsillectomy on the Adult Singing Voice: Acoustic and Aerodynamic Measures
    Burckardt, Elizabeth S.
    Hillman, Robert E.
    Murton, Olivia
    Mehta, Daryush
    Van Stan, Jarrad
    Burns, James A.
    [J]. JOURNAL OF VOICE, 2023, 37 (01) : 101 - 104
  • [37] The Perception of Emotion in the Singing Voice The Understanding of Music Mood for Music Organisation
    Parada-Cabaleiro, Emilia
    Baird, Alice
    Batliner, Anton
    Cummins, Nicholas
    Hantke, Simone
    Schuller, Bjoern W.
    [J]. PROCEEDINGS OF DLFM 2017: THE 4TH INTERNATIONAL WORKSHOP ON DIGITAL LIBRARIES FOR MUSICOLOGY, 2017, : 29 - 36
  • [38] Improving Automatic Singing Skill Evaluation with Timbral Features, Attention, and Singing Voice Separation
    Ju, Yaolong
    Xu, Chunyang
    Guo, Yichen
    Li, Jinhu
    Lui, Simon
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 612 - 617
  • [39] Spectral Features for Emotion Classification
    Koolagudi, Shashidhar G.
    Nandy, Sourav
    Rao, K. Sreenivasa
    [J]. 2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1292 - 1296
  • [40] Combining acoustic features and medical data in deep learning networks for voice pathology classification
    Miliaresi, Ioanna
    Poutos, Kyriakos
    Pikrakis, Aggelos
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 1190 - 1194