A novel convolutional neural network with gated recurrent unit for automated speech emotion recognition and classification

被引:11
|
作者
Prakash, P. Ravi [1 ]
Anuradha, D. [2 ]
Iqbal, Javid [3 ]
Galety, Mohammad Gouse [4 ]
Singh, Ruby [5 ]
Neelakandan, S. [6 ]
机构
[1] Prasad V Potluri Siddhartha Inst Technol, Dept IT, Vijayawada, India
[2] Panimalar Engn Coll, Dept Comp Sci & Business Syst, Chennai, Tamil Nadu, India
[3] UCSI Univ, Inst Comp Sci & Digital Technol ICSDI, Kuala Lumpur, Malaysia
[4] Catholic Univ Erbil, Coll IT & CS, Dept Informat Technol, Erbil, Iraq
[5] SRM Inst Sci & Technol, Dept CSE, Ghaziabad, Uttar Pradesh, India
[6] RMK Engn Coll, Dept CSE, Sriperumbudur, India
关键词
Emotion recognition; speech recognition; deep learning; classification model; Berlin emotion dataset; DOMAIN-ADVERSARIAL; FEATURES; MODELS;
D O I
10.1080/23307706.2022.2085198
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automated Speech Emotion Recognition (SER) becomes more popular and has increased applicability. SER concentrates on the automatic identification of the emotional state of a human being using speech signals. It mainly depends upon the in-depth analysis of the speech signal, extracts features containing emotional details from the speech signal, and utilises pattern recognition techniques for emotional state identification. The major problem in automatic SER is to extract discriminate, powerful, and emotional salient features from the acoustical content of speech signals. The proposed model aims to detect and classify three emotional states of speech such as happy, neutral, and sad. The presented model makes use of Convolution neural network - Gated Recurrent unit (CNN-GRU) based feature extraction technique which derives a set of feature vectors. A comprehensive simulation takes place using the Berlin German Database and SJTU Chinese Database which comprises numerous audio files under a collection of different emotion labels.
引用
收藏
页码:54 / 63
页数:10
相关论文
共 50 条
  • [41] Constructing Speech Emotion Recognition Model Based on Convolutional Neural Network
    Kuo, Jong-Yih
    Chen, Zhao-Ming
    Lin, Hui-Chi
    [J]. 2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE WORKSHOPS (APSECW 2021), 2021, : 52 - 56
  • [42] Speech Emotion Recognition through Hybrid Features and Convolutional Neural Network
    Alluhaidan, Ala Saleh
    Saidani, Oumaima
    Jahangir, Rashid
    Nauman, Muhammad Asif
    Neffati, Omnia Saidani
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [43] Automatic Speech Recognition trained with Convolutional Neural Network and predicted with Recurrent Neural Network
    Soundarya, M.
    Karthikeyan, P. R.
    Thangarasu, Gunasekar
    [J]. 2023 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENERGY SYSTEMS, ICEES, 2023, : 41 - 45
  • [44] Multiple attention convolutional-recurrent neural networks for speech emotion recognition
    Zhang, Zhihao
    Wang, Kunxia
    [J]. 2022 10TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2022,
  • [45] COMPACT CONVOLUTIONAL RECURRENT NEURAL NETWORKS VIA BINARIZATION FOR SPEECH EMOTION RECOGNITION
    Zhao, Huan
    Xiao, Yufeng
    Han, Jing
    Zhang, Zixing
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6690 - 6694
  • [46] Emotion Classification Based on Convolutional Neural Network Using Speech Data
    Vrebcevic, N.
    Mijic, I.
    Petrinovic, D.
    [J]. 2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1007 - 1012
  • [47] Revolutionizing Speech Emotion Recognition: A Novel Hilbert Curve Approach for Two-Dimensional Representation and Convolutional Neural Network Classification
    Tyagi, Suryakant
    Szenasi, Sandor
    [J]. ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2024, 2024, 157 : 75 - 85
  • [48] Fully Convolutional Network With Gated Recurrent Unit for Hatching Egg Activity Classification
    Geng, Lei
    Wang, Haiyue
    Xiao, Zhitao
    Zhang, Fang
    Wu, Jun
    Liu, Yanbei
    [J]. IEEE ACCESS, 2019, 7 : 92378 - 92387
  • [49] Ensemble of Gated Recurrent Unit and Convolutional Neural Network for Sarcasm Detection in Bangla
    Farhan, Niloy
    Awishi, Ishrat Tasnim
    Mehedi, Md Humaion Kabir
    Alam, Md. Mustakin
    Rasel, Annajiat Alim
    [J]. 2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 624 - 629
  • [50] Cryptocurrency Price Prediction with Convolutional Neural Network and Stacked Gated Recurrent Unit
    Kang, Chuen Yik
    Lee, Chin Poo
    Lim, Kian Ming
    [J]. DATA, 2022, 7 (11)