A novel convolutional neural network with gated recurrent unit for automated speech emotion recognition and classification

被引:11
|
作者
Prakash, P. Ravi [1 ]
Anuradha, D. [2 ]
Iqbal, Javid [3 ]
Galety, Mohammad Gouse [4 ]
Singh, Ruby [5 ]
Neelakandan, S. [6 ]
机构
[1] Prasad V Potluri Siddhartha Inst Technol, Dept IT, Vijayawada, India
[2] Panimalar Engn Coll, Dept Comp Sci & Business Syst, Chennai, Tamil Nadu, India
[3] UCSI Univ, Inst Comp Sci & Digital Technol ICSDI, Kuala Lumpur, Malaysia
[4] Catholic Univ Erbil, Coll IT & CS, Dept Informat Technol, Erbil, Iraq
[5] SRM Inst Sci & Technol, Dept CSE, Ghaziabad, Uttar Pradesh, India
[6] RMK Engn Coll, Dept CSE, Sriperumbudur, India
关键词
Emotion recognition; speech recognition; deep learning; classification model; Berlin emotion dataset; DOMAIN-ADVERSARIAL; FEATURES; MODELS;
D O I
10.1080/23307706.2022.2085198
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automated Speech Emotion Recognition (SER) becomes more popular and has increased applicability. SER concentrates on the automatic identification of the emotional state of a human being using speech signals. It mainly depends upon the in-depth analysis of the speech signal, extracts features containing emotional details from the speech signal, and utilises pattern recognition techniques for emotional state identification. The major problem in automatic SER is to extract discriminate, powerful, and emotional salient features from the acoustical content of speech signals. The proposed model aims to detect and classify three emotional states of speech such as happy, neutral, and sad. The presented model makes use of Convolution neural network - Gated Recurrent unit (CNN-GRU) based feature extraction technique which derives a set of feature vectors. A comprehensive simulation takes place using the Berlin German Database and SJTU Chinese Database which comprises numerous audio files under a collection of different emotion labels.
引用
收藏
页码:54 / 63
页数:10
相关论文
共 50 条
  • [31] Real Time Human Activity Recognition Using Convolutional Neural Network and Deep Gated Recurrent Unit
    Fajar, Rasyid
    Suciati, Nanik
    Navastara, Dini Adni
    [J]. 2020 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICELTICS 2020), 2020, : 58 - 63
  • [32] Attention-Based Convolutional Neural Network and Bidirectional Gated Recurrent Unit for Human Activity Recognition
    Tao, Shuai
    Zhao, Zhiqiang
    Qin, Jing
    Ji, Changqing
    Wang, Zumin
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1128 - 1134
  • [33] Integrating Convolutional Neural Network and Gated Recurrent Unit for Hyperspectral Image Spectral-Spatial Classification
    Zhou, Feng
    Hang, Renlong
    Liu, Qingshan
    Yuan, Xiaotong
    [J]. PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 409 - 420
  • [34] A Convolutional Gated Recurrent Network for Speech Enhancement
    Yuan W.-H.
    Hu S.-D.
    Shi Y.-L.
    Li Z.
    Liang C.-Y.
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (07): : 1276 - 1283
  • [35] Improving Speech Emotion Recognition Using Graph Attentive Bi-directional Gated Recurrent Unit Network
    Su, Bo-Hao
    Chang, Chun-Min
    Lin, Yun-Shao
    Lee, Chi-Chun
    [J]. INTERSPEECH 2020, 2020, : 506 - 510
  • [36] Speech Emotion Recognition from Spectrograms with Deep Convolutional Neural Network
    Badshah, Abdul Malik
    Ahmad, Jamil
    Rahim, Nasir
    Baik, Sung Wook
    [J]. 2017 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON), 2017, : 125 - 129
  • [37] Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network
    Yang, Zijun
    Zhou, Shi
    Zhang, Lifeng
    Serikawa, Seiichi
    [J]. Cognitive Robotics, 2024, 4 : 30 - 41
  • [38] Speech Emotion Recognition in Neurological Disorders Using Convolutional Neural Network
    Zisad, Sharif Noor
    Hossain, Mohammad Shahadat
    Andersson, Karl
    [J]. BRAIN INFORMATICS, BI 2020, 2020, 12241 : 287 - 296
  • [39] Recognition of meal information using recurrent neural network and gated recurrent unit
    Zhang, Liyang
    Suzuki, Hiroyuki
    Koyama, Akio
    [J]. INTERNET OF THINGS, 2021, 13
  • [40] Convolutional Neural Network with Spectrogram and Perceptual Features for Speech Emotion Recognition
    Zhang, Linjuan
    Wang, Longbiao
    Dang, Jianwu
    Guo, Lili
    Guan, Haotian
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 62 - 71