Deep Learning for Emotional Speech Recognition

被引:0
|
作者
Alhamada, M., I [1 ]
Khalifa, O. O. [1 ]
Abdalla, A. H.
机构
[1] Int Islamic Univ Malaysia, Fac Engn, Elect & Comp Engn, Kuala Lumpur, Malaysia
关键词
D O I
10.1063/5.0032381
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotion speech recognition is a developing field in machine learning. The main purpose of this field is to produce a convenient system that is able to effortlessly communicate and interact with humans. The reliability of the current speech emotion recognition systems is far from being achieved. However, this is a challenging task due to the gap between acoustic features and human emotions, which rely strongly on the discriminative acoustic features extracted for a given recognition task. The speech signals were process with information which is divided into two main categories, linguistic and paralinguistic; emotions belong to the latter tree. The aim of this work is to develop a system that can understand paralinguistic information for paramount better human-machine interactions. A different extracted features like MFCC as well as feature classifications methods like FIMM, GMM, LTSTM and ANN were used. In this paper, an improved architecture of CNN for speech emotion recognition were implemented. The main fmding that the proposed CNN model achieved 93.96% accuracy rate in detecting emotions.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Deep learning for Depression Recognition from Speech
    Tian, Han
    Zhu, Zhang
    Jing, Xu
    [J]. MOBILE NETWORKS & APPLICATIONS, 2023,
  • [22] Deep Learning of Attitude in Children's Emotional Speech
    Ntalampiras, Stavros
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (CIVEMSA 2020), 2020,
  • [23] Using learning automata in brain emotional learning for speech emotion recognition
    Farhoudi Z.
    Setayeshi S.
    Rabiee A.
    [J]. International Journal of Speech Technology, 2017, 20 (3) : 553 - 562
  • [24] DISTRIBUTED DEEP LEARNING STRATEGIES FOR AUTOMATIC SPEECH RECOGNITION
    Zhang, Wei
    Cui, Xiaodong
    Finkler, Ulrich
    Kingsbury, Brian
    Saon, George
    Kung, David
    Picheny, Michael
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5706 - 5710
  • [25] Recognition of English speech - using a deep learning algorithm
    Wang, Shuyan
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [26] DEEP VARIATIONAL FILTER LEARNING MODELS FOR SPEECH RECOGNITION
    Agrawal, Purvi
    Ganapathy, Sriram
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5731 - 5735
  • [27] Survey of Deep Representation Learning for Speech Emotion Recognition
    Latif, Siddique
    Rana, Rajib
    Khalifa, Sara
    Jurdak, Raja
    Qadir, Junaid
    Schuller, Bjorn
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1634 - 1654
  • [28] Deep Learning of Speech Features for Improved Phonetic Recognition
    Lee, Jaehyung
    Lee, Soo-Young
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1256 - 1259
  • [29] Ensemble deep learning with HuBERT for speech emotion recognition
    Yang, Janghoon
    [J]. 2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 153 - 154
  • [30] Dysarthric Speech Recognition Based on Deep Metric Learning
    Takashima, Yuki
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    [J]. INTERSPEECH 2020, 2020, : 4796 - 4800