Deep Learning for Emotional Speech Recognition

被引:0
|
作者
Alhamada, M., I [1 ]
Khalifa, O. O. [1 ]
Abdalla, A. H.
机构
[1] Int Islamic Univ Malaysia, Fac Engn, Elect & Comp Engn, Kuala Lumpur, Malaysia
关键词
D O I
10.1063/5.0032381
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotion speech recognition is a developing field in machine learning. The main purpose of this field is to produce a convenient system that is able to effortlessly communicate and interact with humans. The reliability of the current speech emotion recognition systems is far from being achieved. However, this is a challenging task due to the gap between acoustic features and human emotions, which rely strongly on the discriminative acoustic features extracted for a given recognition task. The speech signals were process with information which is divided into two main categories, linguistic and paralinguistic; emotions belong to the latter tree. The aim of this work is to develop a system that can understand paralinguistic information for paramount better human-machine interactions. A different extracted features like MFCC as well as feature classifications methods like FIMM, GMM, LTSTM and ANN were used. In this paper, an improved architecture of CNN for speech emotion recognition were implemented. The main fmding that the proposed CNN model achieved 93.96% accuracy rate in detecting emotions.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Deep Learning for Emotional Speech Recognition
    Sanchez-Gutierrez, Maximo E.
    Marcelo Albornoz, E.
    Martinez-Licona, Fabiola
    Leonardo Rufiner, H.
    Goddard, John
    [J]. PATTERN RECOGNITION, MCPR 2014, 2014, 8495 : 311 - +
  • [2] Deep Learning Analysis Models for Speech and Emotional Recognition
    Wu, Jun
    Zhu, Tianliang
    Yu, Chengtian
    Wang, Chunzhi
    Zhou, Xianjing
    Liu, Hu
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1541 - 1545
  • [3] Emotional speech Recognition using CNN and Deep learning techniques
    Hema, C.
    Marquez, Fausto Pedro Garcia
    [J]. APPLIED ACOUSTICS, 2023, 211
  • [4] Emotional Climate Recognition in Interactive Conversational Speech Using Deep Learning
    Alhussein, Ghada
    Alkhodari, Mohanad
    Khandokher, Ahsan
    Hadjileontiadis, Leontios J.
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH (IEEE ICDH 2022), 2022, : 96 - 103
  • [5] Speech Emotion Recognition with Deep Learning
    Harar, Pavol
    Burget, Radim
    Dutta, Malay Kishore
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 137 - 140
  • [6] Speech Recognition using Deep Learning
    Lakkhanawannakun, Phoemporn
    Noyunsan, Chaluemwut
    [J]. 2019 34TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2019), 2019, : 514 - 517
  • [7] Emotional Speech Recognition Using Deep Neural Networks
    Trinh Van, Loan
    Dao Thi Le, Thuy
    Le Xuan, Thanh
    Castelli, Eric
    [J]. SENSORS, 2022, 22 (04)
  • [8] An Analysis of Emotional Speech Recognition for Tamil Language Using Deep Learning Gate Recurrent Unit
    Fernandes, Bennilo
    Mannepalli, Kasiprasad
    [J]. PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2021, 29 (03): : 1937 - 1961
  • [9] Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech
    Wang, Shijun
    Gudnason, Jon
    Borth, Damian
    [J]. INTERSPEECH 2023, 2023, : 351 - 355
  • [10] Korean speech recognition using deep learning
    Lee, Suji
    Han, Seokjin
    Park, Sewon
    Lee, Kyeongwon
    Lee, Jaeyong
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (02) : 213 - 227