Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification

被引:0
|
作者
Florian Eyben
Gláucia L Salomão
Johan Sundberg
Klaus R Scherer
Björn W Schuller
机构
[1] MISP Group,Department of Speech Music Hearing, School of Computer Science and Communication
[2] Technische Universität München,Department of Computing
[3] KTH (Royal Institute of Technology),Department of Linguistics
[4] Université De Genève,Chair of Complex and Intelligent Systems
[5] Imperial College London,undefined
[6] Stockholm University,undefined
[7] University College of Music Education,undefined
[8] University of Passau,undefined
[9] audEERING UG (limited),undefined
关键词
Emotion recognition; Singing voice; Acoustic features; Feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
We investigate the automatic recognition of emotions in the singing voice and study the worth and role of a variety of relevant acoustic parameters. The data set contains phrases and vocalises sung by eight renowned professional opera singers in ten different emotions and a neutral state. The states are mapped to ternary arousal and valence labels. We propose a small set of relevant acoustic features basing on our previous findings on the same data and compare it with a large-scale state-of-the-art feature set for paralinguistics recognition, the baseline feature set of the Interspeech 2013 Computational Paralinguistics ChallengE (ComParE). A feature importance analysis with respect to classification accuracy and correlation of features with the targets is provided in the paper. Results show that the classification performance with both feature sets is similar for arousal, while the ComParE set is superior for valence. Intra singer feature ranking criteria further improve the classification accuracy in a leave-one-singer-out cross validation significantly.
引用
收藏
相关论文
共 50 条
  • [1] Emotion in the singing voice-a deeper look at acoustic features in the light of automatic classification
    Eyben, Florian
    Salomao, Glaucia L.
    Sundberg, Johan
    Scherer, Klaus R.
    Schuller, Bjoern W.
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
  • [2] Comparing the acoustic expression of emotion in the speaking and the singing voice
    Scherer, Klaus R.
    Sundberg, Johan
    Tamarit, Lucas
    Salomao, Glaucia L.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 29 (01): : 218 - 235
  • [3] The expression of emotion in the singing voice: Acoustic patterns in vocal performance
    Scherer, Klaus R.
    Sundberg, Johan
    Fantini, Bernardino
    Trznadel, Stephanie
    Eyben, Florian
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (04): : 1805 - 1815
  • [4] Perceptual (but not acoustic) features predict singing voice preferences
    Bruder, Camila
    Poeppel, David
    Larrouy-Maestri, Pauline
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [5] Ranking Speech Features for Their Usage in Singing Emotion Classification
    Zaporowski, Szymon
    Kostek, Bozena
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2020), 2020, 12117 : 225 - 234
  • [6] Speech Emotion Classification using Acoustic Features
    Chen, Shizhe
    Jin, Qin
    Li, Xirong
    Yang, Gang
    Xu, Jieping
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 579 - 583
  • [7] An Investigation of Acoustic Features for Singing Voice Conversion based on Perceptual Age
    Kobayashi, Kazuhiro
    Doi, Hironori
    Toda, Tomoki
    Nakano, Tomoyasu
    Goto, Masataka
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1056 - 1060
  • [8] ICA-FX features for classification of singing voice and instrumental sound
    Leung, TW
    Ngo, CW
    Lau, RWH
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 367 - 370
  • [9] Acoustic analysis of the singing and speaking voice in singing students
    Lundy, DS
    Roy, S
    Casiano, RR
    Xue, JW
    Evans, J
    [J]. JOURNAL OF VOICE, 2000, 14 (04) : 490 - 493
  • [10] An extraction method of acoustic features for music emotion classification
    Qin, Jiwei
    Xu, Liang
    Wang, Jinsheng
    Guo, Fei
    [J]. Sensors and Transducers, 2014, 175 (07): : 83 - 87