Speech Emotion Recognition Based on a Recurrent Neural Network Classification Model

被引:0
|
作者
Fonnegra, Ruben D. [1 ]
Diaz, Gloria M. [1 ]
机构
[1] Inst Tecnol Metropolitano, Medellin, Colombia
关键词
Speech emotion recognition; Audio signals; Deep learning;
D O I
10.1007/978-3-319-76270-8_59
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Affective computing is still one of the most active areas of study for developing best human-machine interactions. Specifically, speech emotion recognition is widely used, due to its implementation feasibility. In this paper, we investigate the discriminative capabilities of recurrent neural networks in human emotion analysis from low-level acoustic descriptors extracted from speech signals. The proposed approach starts extracting 1580 features from the audio signal using the well-known OpenSmile toolbox. These features are then used as input to a recurrent Long Short-Term Memory (LSTM) neural network, which is trained for deciding the emotion content of the evaluated utterance. Performance evaluation was conducted by two experiments: a gender independent and a gender-dependent classification. Experimental results show that the proposed approach achieves 92% emotion recognition accuracy in the gender independent experiment, which outperforms previous works using the same experimental data. In the gender-dependent experiment, accuracy was 94.3% and 84.4% for men and women, respectively.
引用
收藏
页码:882 / 892
页数:11
相关论文
共 50 条
  • [21] Speech emotion recognition based on Graph-LSTM neural network
    Li, Yan
    Wang, Yapeng
    Yang, Xu
    Im, Sio-Kei
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [22] Speech emotion recognition based on Graph-LSTM neural network
    Yan Li
    Yapeng Wang
    Xu Yang
    Sio-Kei Im
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [23] Multilayer Neural Network Based Speech Emotion Recognition for Smart Assistance
    Kumar, Sandeep
    Haq, MohdAnul
    Jain, Arpit
    Jason, C. Andy
    Moparthi, Nageswara Rao
    Mittal, Nitin
    Alzamil, Zamil S.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1523 - 1540
  • [24] Segment-Based Speech Emotion Recognition Using Recurrent Neural Networks
    Tzinis, Efthymios
    Potamianos, Alexandros
    [J]. 2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 190 - 195
  • [25] End-to-End Speech Emotion Recognition Based on Neural Network
    Zhu, Bing
    Zhou, Wenkai
    Wang, Yutian
    Wang, Hui
    Cai, Juan Juan
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1634 - 1638
  • [26] Emotion Classification Based on Convolutional Neural Network Using Speech Data
    Vrebcevic, N.
    Mijic, I.
    Petrinovic, D.
    [J]. 2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1007 - 1012
  • [27] Stochastic Recurrent Neural Network for Speech Recognition
    Chien, Jen-Tzung
    Shen, Chen
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1313 - 1317
  • [28] Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition
    Li, Ke
    Xu, Hainan
    Wang, Yiming
    Povey, Daniel
    Khudanpur, Sanjeev
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3373 - 3377
  • [29] Recurrent Neural Network Language Model with Part-of-speech for Mandarin Speech Recognition
    Gong, Caixia
    Li, Xiangang
    Wu, Xihong
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 459 - 463
  • [30] Speech Emotion Recognition Using Convolutional-Recurrent Neural Networks with Attention Model
    Mu, Yawei
    Gomez, Hernandez
    Cano Montes, Antonio
    Alcaraz Martinez, Carlos
    Wang, Xuetian
    Gao, Hongmin
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE AND INTERNET TECHNOLOGY, CII 2017, 2017, : 341 - 350