CLASSIFIER FUSION FOR EMOTION RECOGNITION FROM SPEECH

被引:14
|
作者
Scherer, Stefan [1 ]
Schwenker, Friedhelm [1 ]
Palm, Guenther [1 ]
机构
[1] Univ Ulm, Inst Neural Informat Proc, Ulm, Germany
关键词
Modulation spectrum features; RASTA-PLP; Zwicker loudness; HUMAN-COMPUTER INTERACTION;
D O I
10.1007/978-0-387-76485-6_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The intention of this work is the investigation of the performance of an automatic emotion recognizer using biologically motivated features, comprising perceived loudness features proposed by Zwicker, robust RASTA-PLP features, and novel long-term modulation spectrum-based features. Single classifiers using only one type of features and multi-classifier systems utilizing all three types are examined using two-classifier fusion techniques. For all the experiments the standard Berlin Database of Emotional Speech comprising recordings of seven different emotions is used to evaluate the performance of the proposed multi-classifier system. The performance is compared with earlier work as well as with human recognition performance. The results reveal that using simple fusion techniques could improve the performance significantly, outperforming other classifiers used in earlier work. The generalization ability of the proposed system is further investigated in a leave-out one-speaker experiment, uncovering a strong ability to recognize emotions expressed by unknown speakers. Moreover, similarities between earlier speech analysis and the automatic emotion recognition results were found.
引用
收藏
页码:95 / 117
页数:23
相关论文
共 50 条
  • [1] Ensemble Classifier based on Decision-Fusion of Multiple Models for Speech Emotion Recognition
    Noh, Kyoungju
    Lim, Jiyoun
    Chung, Seungeun
    Kim, Gague
    Jeong, Hyuntae
    [J]. 2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1246 - 1248
  • [2] An integrated framework for emotion recognition using speech and static images with deep classifier fusion approach
    Jayanthi K.
    Mohan S.
    B L.
    [J]. International Journal of Information Technology, 2022, 14 (7) : 3401 - 3411
  • [3] Multi-Classifier Speech Emotion Recognition System
    Partila, Pavol
    Tovarek, Jaromir
    Voznak, Miroslav
    Rozhon, Jan
    Sevcik, Lukas
    Baran, Remigiusz
    [J]. 2018 26TH TELECOMMUNICATIONS FORUM (TELFOR), 2018, : 416 - 419
  • [4] Design of Hierarchical Classifier to Improve Speech Emotion Recognition
    Vasuki, P.
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (01): : 19 - 33
  • [5] Emotion Recognition from Speech by Combining Databases and Fusion of Classifiers
    Lefter, Iulia
    Rothkrantz, Leon J. M.
    Wiggers, Pascal
    van Leeuwen, David A.
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 353 - +
  • [6] Emotion Recognition from Speech using Extended Feature Selection and a Simple Classifier
    Hassan, Ali
    Damper, Robert I.
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2011 - 2014
  • [7] Anchor Model Fusion for Emotion Recognition in Speech
    Ortego-Resa, Carlos
    Lopez-Moreno, Ignacio
    Ramos, Daniel
    Gonzalez-Rodriguez, Joaquin
    [J]. BIOMETRIC ID MANAGEMENT AND MULTIMODAL COMMUNICATION, PROCEEDINGS, 2009, 5707 : 49 - 56
  • [8] Speech Emotion Recognition Based on Feature Fusion
    Shen, Qi
    Chen, Guanggen
    Chang, Lin
    [J]. PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, MACHINERY AND ENERGY ENGINEERING (MSMEE 2017), 2017, 123 : 1071 - 1074
  • [9] Improvement Of Speech Emotion Recognition with Neural Network Classifier by Using Speech Spectrogram
    Prasomphan, Sathit
    [J]. 2015 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2015), 2015, : 73 - 76
  • [10] Ensemble majority voting classifier for speech emotion recognition and prediction
    Anagnostopoulos, Theodoros
    Skourlas, Christos
    [J]. Journal of Systems and Information Technology, 2014, 16 (03) : 222 - 232