Deep Learning Framework for Speech Emotion Classification: A Survey of the State-of-the-Art

被引:0
|
作者
Akinpelu, Samson [1 ]
Viriri, Serestina [1 ]
机构
[1] University of KwaZulu-Natal, School of Mathematics, Statistics and Computer Science, Durban,4041, South Africa
关键词
Adversarial machine learning - Contrastive Learning - Convolutional neural networks - Deep learning - Emotion Recognition - Image enhancement - Speech enhancement - Speech recognition;
D O I
10.1109/ACCESS.2024.3474553
中图分类号
学科分类号
摘要
The intricate landscape of speech emotion classification poses a captivating yet challenging realm due to emotions being fundamental to human communication. In recent years, deep learning frameworks have emerged as powerful tools, shedding light on the elusive domain of emotion recognition, revolutionizing human-computer interactions, and enhancing the emotional intelligence of artificial intelligence (AI). This survey embarks on an exploratory journey into the forefront of deep learning approaches dedicated to speech emotion classification. Deep learning has become the standard approach due to the scarcity of extensive speech corpora and the need for high accuracy at low computational cost. The reason lies in its potency to extract important emotional features from large or medium-sized spectrogram images. Deep learning has been applied to speech emotion classification by many researchers, leading to significant improvements in performance and accuracy. Modern deep learning methods designed for human auditory speech emotion classification are carefully examined in this work. A thorough examination of various deep learning framework designs used in emotion classification is provided, illuminating unique characteristics that capture essential features from speech signals for accurate emotion prediction. The research critically analyzes selected deep models using well-established emotion corpora, highlighting their effectiveness. This research analyses typical performance evaluation metrics used to evaluate speech emotion classification models. With this review, we hope to offer a comprehensive overview of the state-of-the-art, potential directions for further investigation, and developing approaches that further the field of speech emotion classification with deep learning frameworks. © 2013 IEEE.
引用
收藏
页码:152152 / 152182
相关论文
共 50 条
  • [41] Graph Learning for Combinatorial Optimization: A Survey of State-of-the-Art
    Peng, Yun
    Choi, Byron
    Xu, Jianliang
    DATA SCIENCE AND ENGINEERING, 2021, 6 (02) : 119 - 141
  • [42] Review of State-of-the-Art in Deep Learning Artificial Intelligence
    Shakirov V.V.
    Solovyeva K.P.
    Dunin-Barkowski W.L.
    Optical Memory and Neural Networks, 2018, 27 (2) : 65 - 80
  • [43] State-of-the-Art Deep Learning in Cardiovascular Image Analysis
    Litjens, Geert
    Ciompi, Francesco
    Wolterink, Jelmer M.
    de Vos, Bob D.
    Leiner, Tim
    Teuwen, Jonas
    Isgum, Ivana
    JACC-CARDIOVASCULAR IMAGING, 2019, 12 (08) : 1549 - 1565
  • [44] Benchmarking State-of-the-Art Deep Learning Software Tools
    Shi, Shaohuai
    Wang, Qiang
    Xu, Pengfei
    Chu, Xiaowen
    2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2016, : 99 - 104
  • [45] State-of-the-art review on deep learning in medical imaging
    Biswas, Mainak
    Kuppili, Venkatanareshbabu
    Saba, Luca
    Edla, Damodar Reddy
    Suri, Harman S.
    Cuadrado-Godia, Elisa
    Laird, John R.
    Marinhoe, Rui Tato
    Sanches, Joao M.
    Nicolaides, Andrew
    Suri, Jasjit S.
    FRONTIERS IN BIOSCIENCE-LANDMARK, 2019, 24 : 392 - 426
  • [46] Deep learning and the electrocardiogram: review of the current state-of-the-art
    Somani, Sulaiman
    Russak, Adam J.
    Richter, Felix
    Zhao, Shan
    Vaid, Akhil
    Chaudhry, Fayzan
    De Freitas, Jessica K.
    Naik, Nidhi
    Miotto, Riccardo
    Nadkarni, Girish N.
    Narula, Jagat
    Argulian, Edgar
    Glicksberg, Benjamin S.
    EUROPACE, 2021, 23 (08): : 1179 - 1191
  • [47] Chart classification: a survey and benchmarking of different state-of-the-art methods
    Jennil Thiyam
    Sanasam Ranbir Singh
    Prabin Kumar Bora
    International Journal on Document Analysis and Recognition (IJDAR), 2024, 27 : 19 - 44
  • [48] Chart classification: a survey and benchmarking of different state-of-the-art methods
    Thiyam, Jennil
    Singh, Sanasam Ranbir
    Bora, Prabin Kumar
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024, 27 (01) : 19 - 44
  • [49] Automatic Speech Recognition System for Tonal Languages: State-of-the-Art Survey
    Kaur, Jaspreet
    Singh, Amitoj
    Kadyan, Virender
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2021, 28 (03) : 1039 - 1068
  • [50] Deep learning techniques for skin lesion analysis and melanoma cancer detection: a survey of state-of-the-art
    Adegun, Adekanmi
    Viriri, Serestina
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (02) : 811 - 841