Deep Learning for Emotional Speech Recognition

被引:0
|
作者
Alhamada, M., I [1 ]
Khalifa, O. O. [1 ]
Abdalla, A. H.
机构
[1] Int Islamic Univ Malaysia, Fac Engn, Elect & Comp Engn, Kuala Lumpur, Malaysia
关键词
D O I
10.1063/5.0032381
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotion speech recognition is a developing field in machine learning. The main purpose of this field is to produce a convenient system that is able to effortlessly communicate and interact with humans. The reliability of the current speech emotion recognition systems is far from being achieved. However, this is a challenging task due to the gap between acoustic features and human emotions, which rely strongly on the discriminative acoustic features extracted for a given recognition task. The speech signals were process with information which is divided into two main categories, linguistic and paralinguistic; emotions belong to the latter tree. The aim of this work is to develop a system that can understand paralinguistic information for paramount better human-machine interactions. A different extracted features like MFCC as well as feature classifications methods like FIMM, GMM, LTSTM and ANN were used. In this paper, an improved architecture of CNN for speech emotion recognition were implemented. The main fmding that the proposed CNN model achieved 93.96% accuracy rate in detecting emotions.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Dysarthric Speech Recognition Based on Deep Metric Learning
    Takashima, Yuki
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    [J]. INTERSPEECH 2020, 2020, : 4796 - 4800
  • [32] Ensemble deep learning with HuBERT for speech emotion recognition
    Yang, Janghoon
    [J]. 2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 153 - 154
  • [33] Classical and Deep Learning Methods for Speech Command Recognition
    Xie, Jie
    Li, Qijing
    Hu, Kai
    Zhu, Mingying
    [J]. 2021 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2021), 2021, : 41 - 45
  • [34] Evaluating deep learning architectures for Speech Emotion Recognition
    Fayek, Haytham M.
    Lech, Margaret
    Cavedon, Lawrence
    [J]. NEURAL NETWORKS, 2017, 92 : 60 - 68
  • [35] Applications of Deep Learning Approaches in Speech Recognition: A Survey
    Al-Janabi, Sameer I. Ali
    Lateef, Ali Azawii Abdul
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION NETWORKS (ICCCN 2021), 2022, 394 : 189 - 196
  • [36] SPEECH EMOTION RECOGNITION-A DEEP LEARNING APPROACH
    Asiya, U. A.
    Kiran, V. K.
    [J]. PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 867 - 871
  • [37] On Comparison of Deep Learning Architectures for Distant Speech Recognition
    Sustika, Rika
    Yuliani, Asri R.
    Zaenudin, Efendi
    Pardede, Hilman F.
    [J]. 2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 17 - 21
  • [38] Lightweight Deep Learning Framework for Speech Emotion Recognition
    Akinpelu, Samson
    Viriri, Serestina
    Adegun, Adekanmi
    [J]. IEEE ACCESS, 2023, 11 : 77086 - 77098
  • [39] Deep Learning Techniques for Speech Emotion Recognition : A Review
    Pandey, Sandeep Kumar
    Shekhawat, H. S.
    Prasanna, S. R. M.
    [J]. 2019 29TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2019, : 197 - 202
  • [40] Controlling the emotional expressiveness of synthetic speech: a deep learning approach
    Tits, Noe
    [J]. 4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2022, 20 (01): : 165 - 166