Speech Emotion Recognition based on Multi-Level Residual Convolutional Neural Networks

被引:0
|
作者
Zheng, Kai [1 ]
Xia, ZhiGuang [1 ]
Zhang, Yi [1 ]
Xu, Xuan [1 ]
Fu, Yaqin [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Engn Res Ctr Informat Accessibil & Serv Robots, Chongqing 400065, Peoples R China
关键词
CNN; residual network; speech emotion recognition; CLASSIFICATION;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Speech emotion recognition, using the convolutional neural networks (CNN) model, is challenging due to the problem of features loss and the decrease of recognition accuracy. To address this issue, a Multi-level residual CNN model is proposed is this paper. In this model, the speech signals are converted into spectrogram, then the multi-level residual identity maps are introduced to compensate the missing features in the CNN during the convolution process, so as to improve the recognition accuracy of speech emotion. The research results show that the Multi-level residual CNN can achieve 74.36% recognition accuracy on the EMO-DB dataset, which has better performance than traditional deep CNN method.
引用
收藏
页码:559 / 565
页数:7
相关论文
共 50 条
  • [1] Facial Emotion Recognition Using an Ensemble of Multi-Level Convolutional Neural Networks
    Hai-Duong Nguyen
    Yeom, Soonja
    Lee, Guee-Sang
    Yang, Hyung-Jeong
    Na, In-Seop
    Kim, Soo-Hyung
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (11)
  • [2] Improvement on Speech Emotion Recognition Based on Deep Convolutional Neural Networks
    Niu, Yafeng
    Zou, Dongsheng
    Niu, Yadong
    He, Zhongshi
    Tan, Hua
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE (ICCAI 2018), 2018, : 13 - 18
  • [3] Continuous speech emotion recognition with convolutional neural networks
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Matsiola, Maria
    Kotsakis, Rigas
    Dimoulas, Charalampos
    Kalliris, George
    [J]. AES: Journal of the Audio Engineering Society, 2020, 68 (1-2): : 14 - 24
  • [4] Continuous Speech Emotion Recognition with Convolutional Neural Networks
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Matsiola, Maria
    Kotsakis, Rigas
    Dimoulas, Charalampos
    Kalliris, George
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2020, 68 (1-2): : 14 - 24
  • [5] Speech emotion recognition with deep convolutional neural networks
    Issa, Dias
    Demirci, M. Fatih
    Yazici, Adnan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 59
  • [6] An Ensemble Model for Multi-Level Speech Emotion Recognition
    Zheng, Chunjun
    Wang, Chunli
    Jia, Ning
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [7] Multi-Level Wavelet Convolutional Neural Networks
    Liu, Pengju
    Zhang, Hongzhi
    Lian, Wei
    Zuo, Wangmeng
    [J]. IEEE ACCESS, 2019, 7 : 74973 - 74985
  • [8] An Experimental Study of Speech Emotion Recognition Based on Deep Convolutional Neural Networks
    Zheng, W. Q.
    Yu, J. S.
    Zou, Y. X.
    [J]. 2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 827 - 831
  • [9] IMPROVING CONVOLUTIONAL RECURRENT NEURAL NETWORKS FOR SPEECH EMOTION RECOGNITION
    Meyer, Patrick
    Xu, Ziyi
    Fingscheidt, Tim
    [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 365 - 372
  • [10] Gender Differentiated Convolutional Neural Networks for Speech Emotion Recognition
    Mishra, Puneet
    Sharma, Ruchir
    [J]. 2020 12TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT 2020), 2020, : 142 - 148