A Review on Speech Emotion Recognition Using Deep Learning and Attention Mechanism

被引:92
|
作者
Lieskovska, Eva [1 ]
Jakubec, Maros [1 ]
Jarina, Roman [1 ]
Chmulik, Michal [1 ]
机构
[1] Univ Zilina, Fac Elect Engn & Informat Technol, Univ 8215-1, Zilina 01026, Slovakia
关键词
speech emotion recognition; deep learning; attention mechanism; recurrent neural network; long short-term memory; DATA AUGMENTATION; NEURAL-NETWORKS; FEATURES; AUDIO; CLASSIFIERS; PARAMETERS; DOMINANCE; DATABASES; AROUSAL; MODEL;
D O I
10.3390/electronics10101163
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emotions are an integral part of human interactions and are significant factors in determining user satisfaction or customer opinion. speech emotion recognition (SER) modules also play an important role in the development of human-computer interaction (HCI) applications. A tremendous number of SER systems have been developed over the last decades. Attention-based deep neural networks (DNNs) have been shown as suitable tools for mining information that is unevenly time distributed in multimedia content. The attention mechanism has been recently incorporated in DNN architectures to emphasise also emotional salient information. This paper provides a review of the recent development in SER and also examines the impact of various attention mechanisms on SER performance. Overall comparison of the system accuracies is performed on a widely used IEMOCAP benchmark database.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] A Deep Learning Approach for Speech Emotion Recognition Optimization Using Meta-Learning
    Ottoni, Lara Toledo Cordeiro
    Ottoni, Andre Luiz Carvalho
    Cerqueira, Jes de Jesus Fiais
    ELECTRONICS, 2023, 12 (23)
  • [32] Ensemble deep learning with HuBERT for speech emotion recognition
    Yang, Janghoon
    2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 153 - 154
  • [33] Enhancing speech emotion recognition: a deep learning approach with self-attention and acoustic features
    Aghajani, Khadijeh
    Zohrevandi, Mahbanou
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (05):
  • [34] SPEECH EMOTION RECOGNITION-A DEEP LEARNING APPROACH
    Asiya, U. A.
    Kiran, V. K.
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 867 - 871
  • [35] Survey of Deep Representation Learning for Speech Emotion Recognition
    Latif, Siddique
    Rana, Rajib
    Khalifa, Sara
    Jurdak, Raja
    Qadir, Junaid
    Schuller, Bjorn
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1634 - 1654
  • [36] Evaluating deep learning architectures for Speech Emotion Recognition
    Fayek, Haytham M.
    Lech, Margaret
    Cavedon, Lawrence
    NEURAL NETWORKS, 2017, 92 : 60 - 68
  • [37] Lightweight Deep Learning Framework for Speech Emotion Recognition
    Akinpelu, Samson
    Viriri, Serestina
    Adegun, Adekanmi
    IEEE ACCESS, 2023, 11 : 77086 - 77098
  • [38] Deep Multimodal Emotion Recognition on Human Speech: A Review
    Koromilas, Panagiotis
    Giannakopoulos, Theodoros
    APPLIED SCIENCES-BASEL, 2021, 11 (17):
  • [39] Speech Emotion Recognition Using Gammatone Cepstral Coefficients and Deep Learning Features
    Sharan, Roneel, V
    2023 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES, ICMLANT, 2023, : 139 - 142
  • [40] Speech emotion recognition using feature fusion: a hybrid approach to deep learning
    Khan, Waleed Akram
    ul Qudous, Hamad
    Farhan, Asma Ahmad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 75557 - 75584