Speech Emotion Recognition using Convolutional Recurrent Neural Networks and Spectrograms

被引:4
|
作者
Qamhan, Mustafa A. [1 ]
Meftah, Ali H. [1 ]
Selouani, Sid-Ahmed [2 ]
Alotaibi, Yousef A. [1 ]
Zakariah, Mohammed [1 ]
Seddiq, Yasser Mohammad [3 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
[2] Univ Moncton, 218 Bvd J Gauthier, Shippegan, NB E8S 1P6, Canada
[3] King Abdulaziz City Sci & Technol, Riyadh, Saudi Arabia
关键词
emotion; classification; Arabic; spectrograms; CNN; LSTM;
D O I
10.1109/ccece47787.2020.9255752
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this study, a speech emotion recognition technique based on a deep learning neural network that uses the King Saud University Emotions' Arabic dataset is presented. The convolutional neural network and long short-term memory (LSTM) are used to design the primary system of the convolutional recurrent neural network (CRNN). This study further investigates the use of linearly spaced spectrograms as inputs to the emotional speech recognizers. The performance of the CRNN system is compared with the results obtained through an experiment evaluating the human capability to perceive the emotion from speech. This human perceptual evaluation is considered as the baseline system. The overall CRNN system achieves 84.55% and 77.51% accuracies for file and segment levels, respectively. These values of accuracy are considerably close to the human emotion perception scores.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Gender Differentiated Convolutional Neural Networks for Speech Emotion Recognition
    Mishra, Puneet
    Sharma, Ruchir
    2020 12TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT 2020), 2020, : 142 - 148
  • [22] FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition
    Dossou, Bonaventure F. P.
    Gbenou, Yeno K. S.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3526 - 3531
  • [23] AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION
    Mirsamadi, Seyedmahdad
    Barsoum, Emad
    Zhang, Cha
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2227 - 2231
  • [24] Segment-Based Speech Emotion Recognition Using Recurrent Neural Networks
    Tzinis, Efthymios
    Potamianos, Alexandros
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 190 - 195
  • [25] Emotion recognition in speech using neural networks
    Nicholson, J
    Takahashi, K
    Nakatsu, R
    AFFECTIVE MINDS, 2000, : 215 - 220
  • [26] Emotion recognition in speech using neural networks
    Nicholson, J
    Takahashi, K
    Nakatsu, R
    NEURAL COMPUTING & APPLICATIONS, 2000, 9 (04): : 290 - 296
  • [27] Emotion Recognition in Speech Using Neural Networks
    J. Nicholson
    K. Takahashi
    R. Nakatsu
    Neural Computing & Applications, 2000, 9 : 290 - 296
  • [28] Speech Emotion Recognition and Deep Learning: An Extensive Validation Using Convolutional Neural Networks
    Ri, Francesco Ardan Dal
    Ciardi, Fabio Cifariello
    Conci, Nicola
    IEEE ACCESS, 2023, 11 : 116638 - 116649
  • [29] Multi-Channel 2-D Convolutional Recurrent Neural Networks for Speech Emotion Recognition
    Zhou, Weidong
    Zhou, Houpan
    Xia, Pengfei
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5884 - 5889
  • [30] EEG Emotion Recognition using Parallel Hybrid Convolutional-Recurrent Neural Networks
    Putri, Nursilva Aulianisa
    Djamal, Esmeralda Contessa
    Nugraha, Fikri
    Kasyidi, Fatan
    2022 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ITS APPLICATIONS (ICODSA), 2022, : 24 - 29