Deep Learning Framework for Speech Emotion Classification: A Survey of the State-of-the-Art

被引:0
|
作者
Akinpelu, Samson [1 ]
Viriri, Serestina [1 ]
机构
[1] University of KwaZulu-Natal, School of Mathematics, Statistics and Computer Science, Durban,4041, South Africa
关键词
Adversarial machine learning - Contrastive Learning - Convolutional neural networks - Deep learning - Emotion Recognition - Image enhancement - Speech enhancement - Speech recognition;
D O I
10.1109/ACCESS.2024.3474553
中图分类号
学科分类号
摘要
The intricate landscape of speech emotion classification poses a captivating yet challenging realm due to emotions being fundamental to human communication. In recent years, deep learning frameworks have emerged as powerful tools, shedding light on the elusive domain of emotion recognition, revolutionizing human-computer interactions, and enhancing the emotional intelligence of artificial intelligence (AI). This survey embarks on an exploratory journey into the forefront of deep learning approaches dedicated to speech emotion classification. Deep learning has become the standard approach due to the scarcity of extensive speech corpora and the need for high accuracy at low computational cost. The reason lies in its potency to extract important emotional features from large or medium-sized spectrogram images. Deep learning has been applied to speech emotion classification by many researchers, leading to significant improvements in performance and accuracy. Modern deep learning methods designed for human auditory speech emotion classification are carefully examined in this work. A thorough examination of various deep learning framework designs used in emotion classification is provided, illuminating unique characteristics that capture essential features from speech signals for accurate emotion prediction. The research critically analyzes selected deep models using well-established emotion corpora, highlighting their effectiveness. This research analyses typical performance evaluation metrics used to evaluate speech emotion classification models. With this review, we hope to offer a comprehensive overview of the state-of-the-art, potential directions for further investigation, and developing approaches that further the field of speech emotion classification with deep learning frameworks. © 2013 IEEE.
引用
收藏
页码:152152 / 152182
相关论文
共 50 条
  • [1] Deep learning approach for facial age classification: a survey of the state-of-the-art
    Agbo-Ajala, Olatunbosun
    Viriri, Serestina
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) : 179 - 213
  • [2] Deep learning approach for facial age classification: a survey of the state-of-the-art
    Olatunbosun Agbo-Ajala
    Serestina Viriri
    Artificial Intelligence Review, 2021, 54 : 179 - 213
  • [3] A State-of-the-Art Survey on Deep Learning Theory and Architectures
    Alom, Md Zahangir
    Taha, Tarek M.
    Yakopcic, Chris
    Westberg, Stefan
    Sidike, Paheding
    Nasrin, Mst Shamima
    Hasan, Mahmudul
    Van Essen, Brian C.
    Awwal, Abdul A. S.
    Asari, Vijayan K.
    ELECTRONICS, 2019, 8 (03)
  • [4] The Fusion of Deep Learning and Fuzzy Systems: A State-of-the-Art Survey
    Zheng, Yuanhang
    Xu, Zeshui
    Wang, Xinxin
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (08) : 2783 - 2799
  • [5] Deep learning techniques for rating prediction: a survey of the state-of-the-art
    Zahid Younas Khan
    Zhendong Niu
    Sulis Sandiwarno
    Rukundo Prince
    Artificial Intelligence Review, 2021, 54 : 95 - 135
  • [6] State-of-the-Art survey of deep learning based sketch retrieval
    Ji Ziheng
    2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING (ICAICE 2020), 2020, : 6 - 14
  • [7] State-of-the-art Survey on Fuzz Testing for Deep Learning System
    Dai H.-P.
    Sun C.-A.
    Jin H.
    Xiao M.-J.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (11): : 5008 - 5028
  • [8] Deep Learning for Brain Tumor Segmentation: A Survey of State-of-the-Art
    Magadza, Tirivangani
    Viriri, Serestina
    JOURNAL OF IMAGING, 2021, 7 (02)
  • [9] DEEP LEARNING IN NATURAL LANGUAGE PROCESSING: A STATE-OF-THE-ART SURVEY
    Chai, Junyi
    Li, Anming
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 535 - 540
  • [10] Deep Learning for Edge Computing Applications: A State-of-the-Art Survey
    Wang, Fangxin
    Zhang, Miao
    Wang, Xiangxiang
    Ma, Xiaoqiang
    Liu, Jiangchuan
    IEEE ACCESS, 2020, 8 : 58322 - 58336