Speech emotion recognition and classification using hybrid deep CNN and BiLSTM model

被引:0
|
作者
Swami Mishra
Nehal Bhatnagar
Prakasam P
Sureshkumar T. R
机构
[1] Vellore Institute of Technology,School of Electronics Engineering
来源
关键词
Speech emotion recognition; Deep convolutional neural networks; LSTM; MFSC; Ensemble learning;
D O I
暂无
中图分类号
学科分类号
摘要
Accurate emotion detection from speech utterances has been a challenging and active research affair recently. Speech emotion recognition (SER) systems play an essential role in Human-machine interaction, virtual reality, emergency services, and many other real-time systems. It is an open-ended problem as subjects from different regions and lingual backgrounds convey emotions altogether differently. The conventional approach used low-level periodic features from audio samples like energy, pitch, etc., for classification but was not efficient enough to detect emotions accurately and not generalized. With the recent advancements in computer vision and neural networks extracting high-level features and more accurate recognition can be achieved. This study proposes an ensemble deep CNN + Bi-LSTM-based framework for speech emotion recognition and classification of seven different emotions. The paralinguistic log Mel-frequency spectral coefficients (MFSC) is used as a feature to train the proposed architecture. The proposed Hybrid model is validated with TESS and SAVEE datasets. Experimental results have indicated a classification accuracy of 96.36%. The proposed model is compared with existing models, proving the superiority of the proposed hybrid deep CNN and Bi-LSTM model.
引用
收藏
页码:37603 / 37620
页数:17
相关论文
共 50 条
  • [21] Dynamic Music emotion recognition based on CNN-BiLSTM
    Du, Pengfei
    Li, Xiaoyong
    Gao, Yali
    [J]. PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 1372 - 1376
  • [22] Learning Salient Features for Speech Emotion Recognition Using CNN
    Liu, Jiamu
    Han, Wenjing
    Ruan, Huabin
    Chen, Xiaomin
    Jiang, Dongmei
    Li, Haifeng
    [J]. 2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
  • [23] Comparative Analysis of Windows for Speech Emotion Recognition Using CNN
    Teixeira, Felipe L.
    Soares, Salviano Pinto
    Abreu, J. L. Pio
    Oliveira, Paulo M.
    Teixeira, Joao P.
    [J]. OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, PT I, OL2A 2023, 2024, 1981 : 233 - 248
  • [24] Speech Emotion Recognition using XGBoost and CNN BLSTM with Attention
    He, Jingru
    Ren, Liyong
    [J]. 2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 154 - 159
  • [25] Research on EEG emotion recognition based on CNN+BiLSTM+self-attention model
    LI Xueqing
    LI Penghai
    FANG Zhendong
    CHENG Longlong
    WANG Zhiyong
    WANG Weijie
    [J]. Optoelectronics Letters, 2023, 19 (08) : 506 - 512
  • [26] Music Emotion Recognition Fusion on CNN-BiLSTM and Self-Attention Model
    Zhong, Zhipeng
    Wang, Hailong
    Su, Guibin
    Liu, Lin
    Pei, Dongmei
    [J]. Computer Engineering and Applications, 2024, 59 (03) : 94 - 103
  • [27] Pattern recognition and features selection for speech emotion recognition model using deep learning
    Jermsittiparsert, Kittisak
    Abdurrahman, Abdurrahman
    Siriattakul, Parinya
    Sundeeva, Ludmila A.
    Hashim, Wahidah
    Rahim, Robbi
    Maseleno, Andino
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (04) : 799 - 806
  • [28] Pattern recognition and features selection for speech emotion recognition model using deep learning
    Kittisak Jermsittiparsert
    Abdurrahman Abdurrahman
    Parinya Siriattakul
    Ludmila A. Sundeeva
    Wahidah Hashim
    Robbi Rahim
    Andino Maseleno
    [J]. International Journal of Speech Technology, 2020, 23 : 799 - 806
  • [29] Recognition of emotions in speech using deep CNN and RESNET
    Lakshmi, Kanchi Lohitha
    Muthulakshmi, P.
    Nithya, A. Alice
    Jeyavathana, R. Beaulah
    Usharani, R.
    Das, Nishi S.
    Devi, G. Naga Rama
    [J]. SOFT COMPUTING, 2023,
  • [30] ECG signal classification based on deep CNN and BiLSTM
    Cheng, Jinyong
    Zou, Qingxu
    Zhao, Yunxiang
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)