Research on speech emotion recognition algorithm for unbalanced data set

被引:0
|
作者
Liang Z. [1 ]
Li X. [1 ]
Song W. [1 ]
机构
[1] Electronic Information Engineering, Changchun University of Science and Technology, Jilin Province
来源
关键词
CRNN; focal loss; spectrograms; Speech emotion recognition;
D O I
10.3233/JIFS-191129
中图分类号
学科分类号
摘要
In speech emotion recognition, most emotional corpora generally have problems such as inconsistent sample length and imbalance of sample categories. Considering these problems, in this paper, a variable length input CRNN deep learning model based on Focal Loss is proposed for speech emotion recognition of anger, happiness, neutrality and sadness in IEMOCAP emotional corpus. In this model, Firstly, a variable-length strategy is introduced to input the speech spectra of the filled speech samples into CNN. Then the effective part of the input sequence is preserved and output by masking matrix and convolution layer. Thirdly, the effective output of input sequence is input into BiGRU network for learning. Finally, the focal loss is used for network training to control and adjust the contribution of various samples to the total loss. Compared with the traditional speech emotion recognition model, simulations show that our method can effectively improve the accuracy and performance of emotion recognition. © 2020 - IOS Press and the authors. All rights reserved.
引用
收藏
页码:2791 / 2796
页数:5
相关论文
共 50 条
  • [21] Adversarial Data Augmentation Network for Speech Emotion Recognition
    Yi, Lu
    Mak, Man-Wai
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 529 - 534
  • [22] Emotion Statuses Recognition of Speech Signal Using Intuitionistic Fuzzy Set
    Yang, Taoxiang
    Yang, Jian
    Bi, FuKun
    2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 1, PROCEEDINGS, 2009, : 204 - +
  • [23] Important Attributes Selection Based on Rough Set for Speech Emotion Recognition
    Zhou, Jian
    Wang, Guoyin
    Yang, Yong
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2009, 3 (03) : 51 - 60
  • [24] Development of Speech Emotion Recognition Algorithm using MFCC and Prosody
    Koo, Hyejin
    Jeong, Soycong
    Yoon, Sungjae
    Kim, Wonjong
    2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,
  • [25] Application of Improved Spectral Subtraction Algorithm for Speech Emotion Recognition
    Zhang Wanli
    Li Guoxin
    Wang Lirong
    PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015, 2015, : 213 - 216
  • [26] A school bullying detecting algorithm based on motion recognition and speech emotion recognition
    Wei, Chuqiao
    Zhang, Hua
    Ye, Liang
    Meng, Fanchao
    2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 276 - 279
  • [27] Speech Emotion Recognition Using Voiced Segment Selection Algorithm
    Gu, Yu
    Postma, Eric
    Lin, Hai-Xiang
    van den Herik, Jaap
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1682 - 1683
  • [29] Speech Emotion Recognition
    Lalitha, S.
    Madhavan, Abhishek
    Bhushan, Bharath
    Saketh, Srinivas
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2014,
  • [30] Research on Classroom Emotion Recognition Algorithm Based on Visual Emotion Classification
    Yuan, Qinying
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022