Speech Emotion Recognition Based on a Recurrent Neural Network Classification Model

被引:0
|
作者
Fonnegra, Ruben D. [1 ]
Diaz, Gloria M. [1 ]
机构
[1] Inst Tecnol Metropolitano, Medellin, Colombia
关键词
Speech emotion recognition; Audio signals; Deep learning;
D O I
10.1007/978-3-319-76270-8_59
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Affective computing is still one of the most active areas of study for developing best human-machine interactions. Specifically, speech emotion recognition is widely used, due to its implementation feasibility. In this paper, we investigate the discriminative capabilities of recurrent neural networks in human emotion analysis from low-level acoustic descriptors extracted from speech signals. The proposed approach starts extracting 1580 features from the audio signal using the well-known OpenSmile toolbox. These features are then used as input to a recurrent Long Short-Term Memory (LSTM) neural network, which is trained for deciding the emotion content of the evaluated utterance. Performance evaluation was conducted by two experiments: a gender independent and a gender-dependent classification. Experimental results show that the proposed approach achieves 92% emotion recognition accuracy in the gender independent experiment, which outperforms previous works using the same experimental data. In the gender-dependent experiment, accuracy was 94.3% and 84.4% for men and women, respectively.
引用
收藏
页码:882 / 892
页数:11
相关论文
共 50 条
  • [1] Artificial Neural Network based Emotion Classification and Recognition from Speech
    Iqbal, Mudasser
    Raza, Syed Ali
    Abid, Muhammad
    Majeed, Furqan
    Hussain, Ans Ali
    [J]. International Journal of Advanced Computer Science and Applications, 2020, 11 (12): : 434 - 444
  • [2] Artificial Neural Network based Emotion Classification and Recognition from Speech
    Iqbal, Mudasser
    Raza, Syed Ali
    Abid, Muhammad
    Majeed, Furgan
    Hussain, Ans Ali
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (12) : 434 - 444
  • [3] A novel convolutional neural network with gated recurrent unit for automated speech emotion recognition and classification
    Prakash, P. Ravi
    Anuradha, D.
    Iqbal, Javid
    Galety, Mohammad Gouse
    Singh, Ruby
    Neelakandan, S.
    [J]. JOURNAL OF CONTROL AND DECISION, 2023, 10 (01) : 54 - 63
  • [4] Constructing Speech Emotion Recognition Model Based on Convolutional Neural Network
    Kuo, Jong-Yih
    Chen, Zhao-Ming
    Lin, Hui-Chi
    [J]. 2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE WORKSHOPS (APSECW 2021), 2021, : 52 - 56
  • [5] Speech emotion recognition based on improved masking EMD and convolutional recurrent neural network
    Sun, Congshan
    Li, Haifeng
    Ma, Lin
    [J]. FRONTIERS IN PSYCHOLOGY, 2023, 13
  • [6] Speech Emotion Recognition Based on Deep Neural Network
    Zhu, Zijiang
    Hu, Yi
    Li, Junshan
    Li, Jianjun
    Wang, Junhua
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 154 - 154
  • [7] Parallelized Convolutional Recurrent Neural Network With Spectral Features for Speech Emotion Recognition
    Jiang, Pengxu
    Fu, Hongliang
    Tao, Huawei
    Lei, Peizhi
    Zhao, Li
    [J]. IEEE ACCESS, 2019, 7 : 90368 - 90377
  • [8] Relative Speech Emotion Recognition Based Artificial Neural Network
    Fu, Liqin
    Mao, Xia
    Chen, Lijiang
    [J]. PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 1111 - 1115
  • [9] Speech Emotion Recognition based on Interactive Convolutional Neural Network
    Cheng, Huihui
    Tang, Xiaoyu
    [J]. 2020 IEEE 3RD INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP 2020), 2020, : 163 - 167
  • [10] Multimodal speech emotion recognition and classification using convolutional neural network techniques
    A. Christy
    S. Vaithyasubramanian
    A. Jesudoss
    M. D. Anto Praveena
    [J]. International Journal of Speech Technology, 2020, 23 : 381 - 388