Speech-based emotion recognition using a hybrid RNN-CNN network

被引:0
|
作者
Ning, Jingtao [1 ]
Zhang, Wenchuan [1 ]
机构
[1] Lanzhou Petrochem Univ Vocat Technol, Coll Informat Engn, Lanzhou 730060, Gansu, Peoples R China
关键词
Speech emotion recognition; Deep learning; Recurrent neural network; Convolutional neural network; Wide kernel; Classification; DEEP;
D O I
10.1007/s11760-024-03574-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech emotion recognition is probably among the most exciting and dynamic areas of modern research focused on speech signals analysis, which allows estimating and classifying speakers' rich spectrum of emotions. The following paper aims to develop a novel deep learning (DL)-based model for detecting speech emotion variation to overcome several weaknesses of the existing intelligent data-driven approaches. A new architecture for a DL network, referred to as the RNN-CNN, is proposed and applied in this paper to perform the SER task by operating directly on raw speech signals. Specifically, the challenge was effectively combining an initial convolution layer with a wide kernel as an efficient way to address and mitigate the problems caused by noise found in raw speech signals. In this experimental analysis, the 3 databases used to evaluate the proposed RNN-CNN model are RML, RAVDESS, and SAVEE. The effectiveness of such methodologies can be detected with remarkable efficacy, whose improved accuracy rates depict contrasting trends from those findings of the previous works analyzed through respective datasets. This assessment has validated the robust performance and applicability of the suggested models for diverse speech databases and underlined their potential in further speech-based emotion recognition.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] CochleaSpecNet: An Attention-Based Dual Branch Hybrid CNN-GRU Network for Speech Emotion Recognition Using Cochleagram and Spectrogram
    Namey, Atkia Anika
    Akter, Khadija
    Hossain, Md. Azad
    Dewan, M. Ali Akber
    IEEE ACCESS, 2024, 12 : 190760 - 190774
  • [22] Time-Continuous Emotion Recognition Using Spectrogram Based CNN-RNN Modelling
    Fedotov, Dmitrii
    Kim, Bobae
    Karpov, Alexey
    Minker, Wolfgang
    SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 93 - 102
  • [23] SPEECH-BASED EMOTION CLASSIFICATION USING MULTICLASS SVM WITH HYBRID KERNEL AND THRESHOLDING FUSION
    Yang, N.
    Muraleedharan, R.
    Kohl, J.
    Demirkol, I.
    Heinzelman, W.
    Sturge-Apple, M.
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 455 - 460
  • [24] Robust Multi-Scenario Speech-Based Emotion Recognition System
    Fangfang Zhu-Zhou
    Gil-Pita, Roberto
    Garcia-Gomez, Joaquin
    Rosa-Zurera, Manuel
    SENSORS, 2022, 22 (06)
  • [25] Speech-based Emotion Recognition: Application of Collective Decision Making Concepts
    Brester, Christina
    Semenkin, Eugene
    Sidorov, Maxim
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (ICCSAI 2014), 2015, : 216 - 220
  • [26] Contemporary Stochastic Feature Selection Algorithms for Speech-based Emotion Recognition
    Sidorov, Maxim
    Brester, Christina
    Schmitt, Alexander
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2699 - 2703
  • [27] Hybrid Facial Emotion Recognition Using CNN-Based Features
    Shahzad, H. M.
    Bhatti, Sohail Masood
    Jaffar, Arfan
    Akram, Sheeraz
    Alhajlah, Mousa
    Mahmood, Awais
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [28] Simulation of English speech emotion recognition based on transfer learning and CNN neural network
    Chen, Xuehua
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (02) : 2349 - 2360
  • [29] Hybrid Time Distributed CNN-transformer for Speech Emotion Recognition
    Slimi, Anwer
    Nicolas, Henri
    Zrigui, Mounir
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES (ICSOFT), 2022, : 602 - 611
  • [30] Survey on Machine Learning in Speech Emotion Recognition and Vision Systems Using a Recurrent Neural Network (RNN)
    Satya Prakash Yadav
    Subiya Zaidi
    Annu Mishra
    Vibhash Yadav
    Archives of Computational Methods in Engineering, 2022, 29 : 1753 - 1770