Prosodic, spectral and voice quality feature selection using a long-term stopping criterion for audio-based emotion recognition

被引:14
|
作者
Kaechele, Markus [1 ]
Zharkov, Dimitrij [1 ]
Meudt, Sascha [1 ]
Schwenker, Friedhelm [1 ]
机构
[1] Univ Ulm, Inst Neural Informat Proc, D-89069 Ulm, Germany
关键词
CLASSIFIER SYSTEMS;
D O I
10.1109/ICPR.2014.148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition from speech is an important field of research in human-machine-interfaces, and has begun to influence everyday life by employment in different areas such as call centers or wearable companions in the form of smartphones. In the proposed classification architecture, different spectral, prosodic and the relatively novel voice quality features are extracted from the speech signals. These features are then used to represent long-term information of the speech, leading to utterance-wise suprasegmental features. The most promising of these features are selected using a forward-selection/backward-elimination algorithm with a novel long-term termination criterion for the selection. The overall system has been evaluated using recordings from the public Berlin emotion database. Utilizing the resulted features, a recognition rate of 88,97% has been achieved which surpasses the performance of humans on this database and is comparable to the state of the art performance on this dataset.
引用
收藏
页码:803 / 808
页数:6
相关论文
共 50 条
  • [31] Audio-visual emotion recognition using FCBF feature selection method and particle swarm optimization for fuzzy ARTMAP neural networks
    Gharavian, Davood
    Bejani, Mehdi
    Sheikhan, Mansour
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (02) : 2331 - 2352
  • [32] Feature Selection for Facial Emotion Recognition Using Cosine Similarity-Based Harmony Search Algorithm
    Saha, Soumyajit
    Ghosh, Manosij
    Ghosh, Soulib
    Sen, Shibaprasad
    Singh, Pawan Kumar
    Geem, Zong Woo
    Sarkar, Ram
    APPLIED SCIENCES-BASEL, 2020, 10 (08):
  • [33] Feature selection for facial emotion recognition using late hill-climbing based memetic algorithm
    Manosij Ghosh
    Tuhin Kundu
    Dipayan Ghosh
    Ram Sarkar
    Multimedia Tools and Applications, 2019, 78 : 25753 - 25779
  • [34] Feature selection for facial emotion recognition using late hill-climbing based memetic algorithm
    Ghosh, Manosij
    Kundu, Tuhin
    Ghosh, Dipayan
    Sarkar, Ram
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 25753 - 25779
  • [35] Enhancing BCI-Based Emotion Recognition Using an Improved Particle Swarm Optimization for Feature Selection
    Li, Zina
    Qiu, Lina
    Li, Ruixin
    He, Zhipeng
    Xiao, Jun
    Liang, Yan
    Wang, Fei
    Pan, Jiahui
    SENSORS, 2020, 20 (11)
  • [36] Evolutionary computation algorithms for feature selection of EEG-based emotion recognition using mobile sensors
    Nakisa, Bahareh
    Rastgoo, Mohammad Naim
    Tjondronegoro, Dian
    Chandran, Vinod
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 93 : 143 - 155
  • [37] Long-term relevance feedback and feature selection for adaptive content based image suggestion
    Boutemedjet, Sabri
    Ziou, Djemel
    PATTERN RECOGNITION, 2010, 43 (12) : 3925 - 3937
  • [38] Recognition of Negative Emotion Using Long Short-Term Memory with Bio-Signal Feature Compression
    Lee, JeeEun
    Yoo, Sun K.
    SENSORS, 2020, 20 (02)
  • [39] Long-term voice outcome after thyroidectomy using energy based devices
    Park, Min Woo
    Baek, Seung-Kuk
    Park, Euy-Hyun
    Jung, Kwang-Yoon
    AURIS NASUS LARYNX, 2018, 45 (03) : 527 - 532
  • [40] A comparison using different speech parameters in the automatic emotion recognition using feature subset selection based on evolutionary algorithms
    Alvarez, Aitor
    Cearreta, Idoia
    Lopez, Juan Miguel
    Arruti, Andoni
    Lazkano, Elena
    Sierra, Basilio
    Garay, Nestor
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2007, 4629 : 423 - 430