Speech Emotion Classification Using Deep Learning

被引:0
|
作者
Mishra, Siba Prasad [1 ]
Warule, Pankaj [1 ]
Deb, Suman [1 ]
机构
[1] Sardar Vallabhbhai Natl Inst Technol, Surat, India
关键词
Speech emotion recognition; Deep neural network signal; Convolutional neural network; Long short-term memory network; RECOGNITION;
D O I
10.1007/978-981-97-1549-7_2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Over the last several years, the fields of affective computing and human-computer interaction have shown a great deal of interest in the topic of speech emotion recognition studies. In the study of speech emotion classification, numerous strategies, many of which are well-established methodologies, have been utilized to extract emotions from speech signals. These strategies include speech analysis and classification methods. Deep learning techniques have recently been proposed as a potential alternative to more traditional methods for the classification of the emotions conveyed in speech. In this study, we used MFCC features for emotion classification and compared the performance of DNN, DCNN, and CNN-LSTM models on RAVDESS, EMODB, and TESS datasets. In the proposed models, DNN achieved an average accuracy of 76.38% on RAVDESS, 82.83% on EMODB, and 100% on TESS. The overall accuracy of DCNN was 74.65% when it was tested on RAVDESS, 77% when it was tested on EMODB, and 100% when it was tested on TESS. On RAVDESS, CNN-LSTM achieved an average accuracy of 70.27%, while on EMODB it achieved 71.8%, and on TESS it achieved 99%.
引用
收藏
页码:19 / 31
页数:13
相关论文
共 50 条
  • [1] Speech Based Multiple Emotion Classification Model Using Deep Learning
    Patneedi, Shakti Swaroop
    Kumari, Nandini
    [J]. ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 : 648 - 659
  • [2] Classification of Human Emotion from Speech Data Using Deep Learning
    Kanjanawattana, Sarunya
    Jarat, Atsadayoot
    Praneetpholkrang, Panchalee
    Bhakdisongkhram, Gun
    Weeragulpiriya, Suchada
    [J]. 2022 IEEE THE 5TH INTERNATIONAL CONFERENCE ON BIG DATA AND ARTIFICIAL INTELLIGENCE (BDAI 2022), 2022, : 1 - 5
  • [3] Speech Emotion Recognition Using Deep Learning
    Alagusundari, N.
    Anuradha, R.
    [J]. ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
  • [4] Speech Emotion Recognition Using Deep Learning
    Ahmed, Waqar
    Riaz, Sana
    Iftikhar, Khunsa
    Konur, Savas
    [J]. ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
  • [5] Emotion Classification of Songs Using Deep Learning
    Mate, Nikita
    Akre, Durva
    Patil, Gaurav
    Sakarkar, Gopal
    Basuki, Thomas Anung
    [J]. 2022 INTERNATIONAL CONFERENCE ON GREEN ENERGY, COMPUTING AND SUSTAINABLE TECHNOLOGY (GECOST), 2022, : 303 - 308
  • [6] A Robust Deep Transfer Learning Model for Accurate Speech Emotion Classification
    Akinpelu, Samson
    Viriri, Serestina
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 : 419 - 430
  • [7] Improved Speech Emotion Classification Using Deep Neural Network
    Mariwan Hama Saeed
    [J]. Circuits, Systems, and Signal Processing, 2023, 42 : 7357 - 7376
  • [8] Improved Speech Emotion Classification Using Deep Neural Network
    Saeed, Mariwan Hama
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (12) : 7357 - 7376
  • [9] Speech Emotion Recognition Using Deep Neural Networks, Transfer Learning, and Ensemble Classification Techniques
    Mihalache, Serban
    Burileanu, Dragos
    [J]. ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2023, 26 (3-4): : 375 - 387
  • [10] Robust Feature Selection-Based Speech Emotion Classification Using Deep Transfer Learning
    Akinpelu, Samson
    Viriri, Serestina
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (16):