Deep Learning Framework for Speech Emotion Classification: A Survey of the State-of-the-Art

被引:0
|
作者
Akinpelu, Samson [1 ]
Viriri, Serestina [1 ]
机构
[1] University of KwaZulu-Natal, School of Mathematics, Statistics and Computer Science, Durban,4041, South Africa
关键词
Adversarial machine learning - Contrastive Learning - Convolutional neural networks - Deep learning - Emotion Recognition - Image enhancement - Speech enhancement - Speech recognition;
D O I
10.1109/ACCESS.2024.3474553
中图分类号
学科分类号
摘要
The intricate landscape of speech emotion classification poses a captivating yet challenging realm due to emotions being fundamental to human communication. In recent years, deep learning frameworks have emerged as powerful tools, shedding light on the elusive domain of emotion recognition, revolutionizing human-computer interactions, and enhancing the emotional intelligence of artificial intelligence (AI). This survey embarks on an exploratory journey into the forefront of deep learning approaches dedicated to speech emotion classification. Deep learning has become the standard approach due to the scarcity of extensive speech corpora and the need for high accuracy at low computational cost. The reason lies in its potency to extract important emotional features from large or medium-sized spectrogram images. Deep learning has been applied to speech emotion classification by many researchers, leading to significant improvements in performance and accuracy. Modern deep learning methods designed for human auditory speech emotion classification are carefully examined in this work. A thorough examination of various deep learning framework designs used in emotion classification is provided, illuminating unique characteristics that capture essential features from speech signals for accurate emotion prediction. The research critically analyzes selected deep models using well-established emotion corpora, highlighting their effectiveness. This research analyses typical performance evaluation metrics used to evaluate speech emotion classification models. With this review, we hope to offer a comprehensive overview of the state-of-the-art, potential directions for further investigation, and developing approaches that further the field of speech emotion classification with deep learning frameworks. © 2013 IEEE.
引用
收藏
页码:152152 / 152182
相关论文
共 50 条
  • [31] Enhancing multimodal disaster tweet classification using state-of-the-art deep learning networks
    Divakaran Adwaith
    Ashok Kumar Abishake
    Siva Venkatesh Raghul
    Elango Sivasankar
    Multimedia Tools and Applications, 2022, 81 : 18483 - 18501
  • [32] Enhancing multimodal disaster tweet classification using state-of-the-art deep learning networks
    Adwaith, Divakaran
    Abishake, Ashok Kumar
    Raghul, Siva Venkatesh
    Sivasankar, Elango
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (13) : 18483 - 18501
  • [33] DL4SciVis: A State-of-the-Art Survey on Deep Learning for Scientific Visualization
    Wang, Chaoli
    Han, Jun
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (08) : 3714 - 3733
  • [34] State-of-the-Art Analysis of Deep Learning-Based Monaural Speech Source Separation Techniques
    Soni, Swati
    Yadav, Ram Narayan
    Gupta, Lalita
    IEEE ACCESS, 2023, 11 : 4242 - 4269
  • [35] A survey on facial emotion recognition techniques: A state-of-the-art literature review
    Canal, Felipe Zago
    Mueller, Tobias Rossi
    Matias, Jhennifer Cristine
    Scotton, Gustavo Gino
    de Sa, Antonio Reis
    Pozzebon, Eliane
    Sobieranski, Antonio Carlos
    INFORMATION SCIENCES, 2022, 582 : 593 - 617
  • [36] State-of-the-art Survey on Personalized Learning Path Recommendation
    Yun, Yue
    Dai, Huan
    Zhang, Yu-Pei
    Shang, Xue-Qun
    Li, Zhan-Huai
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (12): : 4590 - 4615
  • [37] COMPUTER SPEECH - STATE-OF-THE-ART
    LAGARDE, PM
    SOUTH AFRICAN JOURNAL OF SCIENCE, 1987, 83 (03) : 125 - 127
  • [38] Mobile learning: a state-of-the-art review survey and analysis
    Sarrab, Mohamed
    Elbasir, Mahmoud
    INTERNATIONAL JOURNAL OF INNOVATION AND LEARNING, 2016, 20 (04) : 347 - 383
  • [39] Graph Learning for Combinatorial Optimization: A Survey of State-of-the-Art
    Yun Peng
    Byron Choi
    Jianliang Xu
    Data Science and Engineering, 2021, 6 : 119 - 141
  • [40] THE STATE-OF-THE-ART IN SPEECH RECOGNITION
    BISIANI, R
    TRENDS IN NEUROSCIENCES, 1985, 8 (01) : 9 - 11