EFFICIENT ARABIC EMOTION RECOGNITION USING DEEP NEURAL NETWORKS

被引:0
|
作者
Hifny, Yasser [1 ]
Ali, Ahmed [2 ]
机构
[1] Univ Helwan, Helwan, Egypt
[2] HBKU, Qatar Comp Res Inst, Doha, Qatar
关键词
Speech emotion recognition; deep neural networks (DNN); convolutional neural networks (CNN); bidirection long short-term memory (BLSTM) networks; attention; SPEECH;
D O I
10.1109/icassp.2019.8683632
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Emotion recognition from speech signal based on deep learning is an active research area. Convolutional neural networks (CNNs) may be the dominant method in this area. In this paper, we implement two neural architectures to address this problem. The first architecture is an attention-based CNN-LSTM-DNN model. In this novel architecture, the convolutional layers extract salient features and the bi-directional long short-term memory (BLSTM) layers handle the sequential phenomena of the speech signal. This is followed by an attention layer, which extracts a summary vector that is fed to the fully connected dense layer (DNN), which finally connects to a softmax output layer. The second architecture is based on a deep CNN model. The results on an Arabic speech emotion recognition task show that our innovative approach can lead to significant improvements (2.2% absolute improvements) over a strong deep CNN baseline system. On the other hand, the deep CNN models are significantly faster than the attention based CNN-LSTM-DNN models in training and classification.
引用
收藏
页码:6710 / 6714
页数:5
相关论文
共 50 条
  • [1] Emotion Recognition Using Pretrained Deep Neural Networks
    Dobes, Marek
    Sabolova, Natalia
    [J]. ACTA POLYTECHNICA HUNGARICA, 2023, 20 (04) : 195 - 204
  • [2] Visual Emotion Recognition Using Deep Neural Networks
    Iliev, Alexander I.
    Mote, Ameya
    [J]. DIGITAL PRESENTATION AND PRESERVATION OF CULTURAL AND SCIENTIFIC HERITAGE, 2022, 12 : 77 - 88
  • [3] Multimodal Emotion Recognition Using Deep Neural Networks
    Tang, Hao
    Liu, Wei
    Zheng, Wei-Long
    Lu, Bao-Liang
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 811 - 819
  • [4] Human Facial Emotion Recognition using Deep Neural Networks
    Benisha, S.
    Mirnalinee, T. T.
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (03) : 303 - 309
  • [5] Music emotion recognition using deep convolutional neural networks
    Li, Ting
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (4-5) : 3063 - 3078
  • [6] Arabic handwritten characters recognition using Deep Belief Neural Networks
    Elleuch, Mohamed
    Tagougui, Najiba
    Kherallah, Monji
    [J]. 2015 IEEE 12TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2015,
  • [7] Handwritten Arabic Numeral Recognition using Deep Learning Neural Networks
    Ashiquzzaman, Akm
    Tushar, Abdul Kawsar
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2017,
  • [8] Multimodal Arabic emotion recognition using deep learning
    Al Roken, Noora
    Barlas, Gerassimos
    [J]. SPEECH COMMUNICATION, 2023, 155
  • [9] Speech Emotion Recognition using Convolution Neural Networks and Deep Stride Convolutional Neural Networks
    Wani, Taiba Majid
    Gunawan, Teddy Surya
    Qadri, Syed Asif Ahmad
    Mansor, Hasmah
    Kartiwi, Mira
    Ismail, Nanang
    [J]. PROCEEDING OF 2020 6TH INTERNATIONAL CONFERENCE ON WIRELESS AND TELEMATICS (ICWT), 2020,
  • [10] A Bilingual Emotion Recognition System Using Deep Learning Neural Networks
    Absa, Ahmed H. Abo
    Deriche, M.
    Mohandes, M.
    [J]. 2018 15TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES (SSD), 2018, : 1241 - 1245