Speech Emotion Recognition using Convolutional Recurrent Neural Networks and Spectrograms

被引:4
|
作者
Qamhan, Mustafa A. [1 ]
Meftah, Ali H. [1 ]
Selouani, Sid-Ahmed [2 ]
Alotaibi, Yousef A. [1 ]
Zakariah, Mohammed [1 ]
Seddiq, Yasser Mohammad [3 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
[2] Univ Moncton, 218 Bvd J Gauthier, Shippegan, NB E8S 1P6, Canada
[3] King Abdulaziz City Sci & Technol, Riyadh, Saudi Arabia
关键词
emotion; classification; Arabic; spectrograms; CNN; LSTM;
D O I
10.1109/ccece47787.2020.9255752
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this study, a speech emotion recognition technique based on a deep learning neural network that uses the King Saud University Emotions' Arabic dataset is presented. The convolutional neural network and long short-term memory (LSTM) are used to design the primary system of the convolutional recurrent neural network (CRNN). This study further investigates the use of linearly spaced spectrograms as inputs to the emotional speech recognizers. The performance of the CRNN system is compared with the results obtained through an experiment evaluating the human capability to perceive the emotion from speech. This human perceptual evaluation is considered as the baseline system. The overall CRNN system achieves 84.55% and 77.51% accuracies for file and segment levels, respectively. These values of accuracy are considerably close to the human emotion perception scores.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Deep Convolutional Neural Networks for Feature Extraction in Speech Emotion Recognition
    Heracleous, Panikos
    Mohammad, Yasser
    Yoneyama, Akio
    HUMAN-COMPUTER INTERACTION. RECOGNITION AND INTERACTION TECHNOLOGIES, HCI 2019, PT II, 2019, 11567 : 117 - 132
  • [32] Improvement on Speech Emotion Recognition Based on Deep Convolutional Neural Networks
    Niu, Yafeng
    Zou, Dongsheng
    Niu, Yadong
    He, Zhongshi
    Tan, Hua
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE (ICCAI 2018), 2018, : 13 - 18
  • [33] Facial Emotion Recognition using Convolutional Neural Networks
    Rzayeva, Zeynab
    Alasgarov, Emin
    2019 IEEE 13TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT 2019), 2019, : 91 - 95
  • [34] Facial Emotion Recognition using Convolutional Neural Networks
    Gopichand, G.
    Reddy, I. Ravi Prakash
    Santhi, H.
    Akula, Vijaya Krishna
    IMPENDING INQUISITIONS IN HUMANITIES AND SCIENCES, ICIIHS-2022, 2024, : 198 - 203
  • [35] Facial emotion recognition using convolutional neural networks
    Sarvakar K.
    Senkamalavalli R.
    Raghavendra S.
    Santosh Kumar J.
    Manjunath R.
    Jaiswal S.
    Materials Today: Proceedings, 2023, 80 : 3560 - 3564
  • [36] Parallelized Convolutional Recurrent Neural Network With Spectral Features for Speech Emotion Recognition
    Jiang, Pengxu
    Fu, Hongliang
    Tao, Huawei
    Lei, Peizhi
    Zhao, Li
    IEEE ACCESS, 2019, 7 : 90368 - 90377
  • [37] Emotion recognition from speech using deep recurrent neural networks with acoustic features
    Byun, Sung-Woo
    Shin, Bo-Ra
    Lee, Seok-Pil
    Han, Hyuk-Soo
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 123 : 43 - 44
  • [38] Speech emotion recognition using recurrent neural networks with directional self-attention
    Li, Dongdong
    Liu, Jinlin
    Yang, Zhuo
    Sun, Linyu
    Wang, Zhe
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 173
  • [39] Prediction of User Emotion and Dialogue Success Using Audio Spectrograms and Convolutional Neural Networks
    Lykartsis, Athanasios
    Kotti, Margarita
    20TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2019), 2019, : 336 - 344
  • [40] Speech emotion recognition using spiking neural networks
    Buscicchio, Cosimo A.
    Gorecki, Przemyslaw
    Caponetti, Laura
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 38 - 46