Prosodic, spectral and voice quality feature selection using a long-term stopping criterion for audio-based emotion recognition

被引:14
|
作者
Kaechele, Markus [1 ]
Zharkov, Dimitrij [1 ]
Meudt, Sascha [1 ]
Schwenker, Friedhelm [1 ]
机构
[1] Univ Ulm, Inst Neural Informat Proc, D-89069 Ulm, Germany
关键词
CLASSIFIER SYSTEMS;
D O I
10.1109/ICPR.2014.148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition from speech is an important field of research in human-machine-interfaces, and has begun to influence everyday life by employment in different areas such as call centers or wearable companions in the form of smartphones. In the proposed classification architecture, different spectral, prosodic and the relatively novel voice quality features are extracted from the speech signals. These features are then used to represent long-term information of the speech, leading to utterance-wise suprasegmental features. The most promising of these features are selected using a forward-selection/backward-elimination algorithm with a novel long-term termination criterion for the selection. The overall system has been evaluated using recordings from the public Berlin emotion database. Utilizing the resulted features, a recognition rate of 88,97% has been achieved which surpasses the performance of humans on this database and is comparable to the state of the art performance on this dataset.
引用
收藏
页码:803 / 808
页数:6
相关论文
共 50 条
  • [41] Long-Time Speech Emotion Recognition Using Feature Compensation and Accentuation-Based Fusion
    Jiu Sun
    Jinxin Zhu
    Jun Shao
    Circuits, Systems, and Signal Processing, 2024, 43 : 916 - 940
  • [42] Long-Time Speech Emotion Recognition Using Feature Compensation and Accentuation-Based Fusion
    Sun, Jiu
    Zhu, Jinxin
    Shao, Jun
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (02) : 916 - 940
  • [43] Emotion-net: Automatic emotion recognition system using optimal feature selection-based hidden markov CNN model
    Krishna, B. Hari
    Victor, J. Sharon Rose
    Rao, Goda Srinivasa
    Babu, Ch. Raja Kishore
    Raju, K. Srujan
    Basha, T. S. Ghouse
    Reddy, V. Bharath Simha
    AIN SHAMS ENGINEERING JOURNAL, 2024, 15 (12)
  • [44] Visual Place Recognition in Long-term and Large-scale Environment based on CNN Feature
    Zhu, Jianliang
    Ai, Yunfeng
    Tian, Bin
    Cao, Dongpu
    Scherer, Sebastian
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1679 - 1685
  • [45] Audio-Visual Group-based Emotion Recognition using Local and Global Feature Aggregation based Multi-Task Learning
    Li, Sunan
    Lian, Hailun
    Lu, Cheng
    Zhao, Yan
    Tang, Chuangao
    Zong, Yuan
    Zheng, Wenming
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 741 - 745
  • [46] Integrated Long-Term Stock Selection Models Based on Feature Selection and Machine Learning Algorithms for China Stock Market
    Yuan, Xianghui
    Yuan, Jin
    Jiang, Tianzhao
    Ain, Qurat Ul
    IEEE ACCESS, 2020, 8 : 22672 - 22685
  • [47] Emotion analysis and recognition in 3D space using classifier-dependent feature selection in response to tactile enhanced audio–visual content using EEG
    Raheel A.
    Computers in Biology and Medicine, 2024, 179
  • [48] Optimal feature selection based speech emotion recognition using two-stream deep convolutional neural network
    Mustaqeem
    Kwon, Soonil
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (09) : 5116 - 5135
  • [49] Explainable feature selection and deep learning based emotion recognition in virtual reality using eye tracker and physiological data
    Alharbi, Hadeel
    FRONTIERS IN MEDICINE, 2024, 11
  • [50] Improving EEG signal-based emotion recognition using a hybrid GWO-XGBoost feature selection method
    Asemi, Hanie
    Farajzadeh, Nacer
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99