A Comprehensive Review of Speech Emotion Recognition Systems

被引:86
|
作者
Wani, Taiba Majid [1 ]
Gunawan, Teddy Surya [1 ,3 ]
Qadri, Syed Asif Ahmad [1 ]
Kartiwi, Mira [2 ]
Ambikairajah, Eliathamby [3 ]
机构
[1] Int Islamic Univ Malaysia, Dept Elect & Comp Engn, Kuala Lumpur 53100, Malaysia
[2] Int Islamic Univ Malaysia, Dept Informat Syst, Kuala Lumpur 53100, Malaysia
[3] Univ New South Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Speech recognition; Databases; Feature extraction; Emotion recognition; Speech processing; Task analysis; Neural networks; Speech emotion recognition; database; preprocessing; feature extraction; classifier; CHINESE SPEECH; DATABASE; SVM; FEATURES; CLASSIFICATION; COMBINATION; MODEL;
D O I
10.1109/ACCESS.2021.3068045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
During the last decade, Speech Emotion Recognition (SER) has emerged as an integral component within Human-computer Interaction (HCI) and other high-end speech processing systems. Generally, an SER system targets the speaker's existence of varied emotions by extracting and classifying the prominent features from a preprocessed speech signal. However, the way humans and machines recognize and correlate emotional aspects of speech signals are quite contrasting quantitatively and qualitatively, which present enormous difficulties in blending knowledge from interdisciplinary fields, particularly speech emotion recognition, applied psychology, and human-computer interface. The paper carefully identifies and synthesizes recent relevant literature related to the SER systems' varied design components/methodologies, thereby providing readers with a state-of-the-art understanding of the hot research topic. Furthermore, while scrutinizing the current state of understanding on SER systems, the research gap's prominence has been sketched out for consideration and analysis by other related researchers, institutions, and regulatory bodies.
引用
收藏
页码:47795 / 47814
页数:20
相关论文
共 50 条
  • [41] Autoencoder With Emotion Embedding for Speech Emotion Recognition
    Zhang, Chenghao
    Xue, Lei
    [J]. IEEE ACCESS, 2021, 9 : 51231 - 51241
  • [42] Speech emotion recognition based on emotion perception
    Liu, Gang
    Cai, Shifang
    Wang, Ce
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [43] Autoencoder with emotion embedding for speech emotion recognition
    Zhang, Chenghao
    Xue, Lei
    [J]. IEEE Access, 2021, 9 : 51231 - 51241
  • [44] English speech emotion recognition method based on speech recognition
    Liu, Man
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (2) : 391 - 398
  • [45] English speech emotion recognition method based on speech recognition
    Man Liu
    [J]. International Journal of Speech Technology, 2022, 25 : 391 - 398
  • [46] Emotion Recognition in Arabic Speech
    Klaylat, Samira
    Hamandi, Lama
    Osman, Ziad
    Zantout, Rached
    [J]. 2017 SENSORS NETWORKS SMART AND EMERGING TECHNOLOGIES (SENSET), 2017,
  • [47] Persian Speech Emotion Recognition
    Savargiv, Mohammad
    Bastanfard, Azam
    [J]. 2015 7TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2015,
  • [48] Multiroom Speech Emotion Recognition
    Shalev, Erez
    Cohen, Israel
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 135 - 139
  • [49] Emotion recognition in Arabic speech
    Samira Klaylat
    Ziad Osman
    Lama Hamandi
    Rached Zantout
    [J]. Analog Integrated Circuits and Signal Processing, 2018, 96 : 337 - 351
  • [50] Windowing for Speech Emotion Recognition
    Puterka, Boris
    Kacur, Juraj
    Pavlovicova, Jarmila
    [J]. 2019 61ST INTERNATIONAL SYMPOSIUM ELMAR, 2019, : 147 - 150