Speech Emotion Recognition using Convolutional Recurrent Neural Networks and Spectrograms

被引:4
|
作者
Qamhan, Mustafa A. [1 ]
Meftah, Ali H. [1 ]
Selouani, Sid-Ahmed [2 ]
Alotaibi, Yousef A. [1 ]
Zakariah, Mohammed [1 ]
Seddiq, Yasser Mohammad [3 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
[2] Univ Moncton, 218 Bvd J Gauthier, Shippegan, NB E8S 1P6, Canada
[3] King Abdulaziz City Sci & Technol, Riyadh, Saudi Arabia
关键词
emotion; classification; Arabic; spectrograms; CNN; LSTM;
D O I
10.1109/ccece47787.2020.9255752
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this study, a speech emotion recognition technique based on a deep learning neural network that uses the King Saud University Emotions' Arabic dataset is presented. The convolutional neural network and long short-term memory (LSTM) are used to design the primary system of the convolutional recurrent neural network (CRNN). This study further investigates the use of linearly spaced spectrograms as inputs to the emotional speech recognizers. The performance of the CRNN system is compared with the results obtained through an experiment evaluating the human capability to perceive the emotion from speech. This human perceptual evaluation is considered as the baseline system. The overall CRNN system achieves 84.55% and 77.51% accuracies for file and segment levels, respectively. These values of accuracy are considerably close to the human emotion perception scores.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Speech Emotion Recognition Using Convolution Neural Networks and Multi-Head Convolutional Transformer
    Ullah, Rizwan
    Asif, Muhammad
    Shah, Wahab Ali
    Anjam, Fakhar
    Ullah, Ibrar
    Khurshaid, Tahir
    Wuttisittikulkij, Lunchakorn
    Shah, Shashi
    Ali, Syed Mansoor
    Alibakhshikenari, Mohammad
    [J]. SENSORS, 2023, 23 (13)
  • [42] Temporal Feedback Convolutional Recurrent Neural Networks for Speech Command Recognition
    Kim, Taejun
    Nam, Juhan
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 437 - 441
  • [43] An Experimental Study of Speech Emotion Recognition Based on Deep Convolutional Neural Networks
    Zheng, W. Q.
    Yu, J. S.
    Zou, Y. X.
    [J]. 2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 827 - 831
  • [44] Convolutional Neural Networks for Speech Recognition
    Abdel-Hamid, Ossama
    Mohamed, Abdel-Rahman
    Jiang, Hui
    Deng, Li
    Penn, Gerald
    Yu, Dong
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) : 1533 - 1545
  • [45] Speech Emotion Recognition in Neurological Disorders Using Convolutional Neural Network
    Zisad, Sharif Noor
    Hossain, Mohammad Shahadat
    Andersson, Karl
    [J]. BRAIN INFORMATICS, BI 2020, 2020, 12241 : 287 - 296
  • [46] Facial emotion recognition using convolutional neural networks (FERC)
    Ninad Mehendale
    [J]. SN Applied Sciences, 2020, 2
  • [47] DEEP CONVOLUTIONAL RECURRENT NEURAL NETWORK WITH ATTENTION MECHANISM FOR ROBUST SPEECH EMOTION RECOGNITION
    Huang, Che-Wei
    Narayanan, Shrikanth
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 583 - 588
  • [48] Facial emotion recognition using convolutional neural networks (FERC)
    Mehendale, Ninad
    [J]. SN APPLIED SCIENCES, 2020, 2 (03)
  • [49] Music emotion recognition using deep convolutional neural networks
    Li, Ting
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (4-5) : 3063 - 3078
  • [50] Speech emotion recognition based on improved masking EMD and convolutional recurrent neural network
    Sun, Congshan
    Li, Haifeng
    Ma, Lin
    [J]. FRONTIERS IN PSYCHOLOGY, 2023, 13