Speech Emotion Recognition Based on Speech Segment Using LSTM with Attention Model

被引:0
|
作者
Atmaja, Bagus Tris [1 ,2 ]
Akagi, Masato [3 ]
机构
[1] Japan Adv Inst Sci & Tech, Nomi, Japan
[2] Inst Teknol Sepuluh Nopember, Surabaya, Indonesia
[3] Japan Adv Inst Sci & Tech JAIST, Sch Informat Sci, Nomi, Japan
关键词
voice segments; silence removal; speech emotion recognition; attention model;
D O I
10.1109/icsigsys.2019.8811080
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic speech emotion recognition has become popular as it enables natural interaction between human-machine interaction. One modality of recognizing emotion is speech. However, the speech also contains silence that may not relevant to emotion. Two ways to improve performance is by removing silence and/or paying more attention to speech segment while ignoring the silence. In this paper, we propose both, a combination of silence removal and attention model to improve speech emotion recognition performance. The results show that utilizing combination silence removal and attention model outperforms the use of either noise removal only or attention model only.
引用
收藏
页码:40 / 44
页数:5
相关论文
共 50 条
  • [1] Attention-Based Dense LSTM for Speech Emotion Recognition
    Xie, Yue
    Liang, Ruiyu
    Liang, Zhenlin
    Zhao, Li
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (07): : 1426 - 1429
  • [2] Siamese Attention-Based LSTM for Speech Emotion Recognition
    Nizamidin, Tashpolat
    Zhao, Li
    Liang, Ruiyu
    Xie, Yue
    Hamdulla, Askar
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2020, E103A (07) : 937 - 941
  • [3] Speech Emotion Recognition Based on Acoustic Segment Model
    Zheng, Siyuan
    Du, Jun
    Zhou, Hengshun
    Bai, Xue
    Lee, Chin-Hui
    Li, Shipeng
    [J]. 2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [4] Empirical Interpretation of Speech Emotion Perception with Attention Based Model for Speech Emotion Recognition
    Jalal, Md Asif
    Milner, Rosanna
    Hain, Thomas
    [J]. INTERSPEECH 2020, 2020, : 4113 - 4117
  • [5] Speech Emotion Classification Using Attention-Based LSTM
    Xie, Yue
    Liang, Ruiyu
    Liang, Zhenlin
    Huang, Chengwei
    Zou, Cairong
    Schuller, Bjoern
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1675 - 1685
  • [6] Attention-LSTM-Attention Model for Speech Emotion Recognition and Analysis of IEMOCAP Database
    Yu, Yeonguk
    Kim, Yoon-Joong
    [J]. ELECTRONICS, 2020, 9 (05):
  • [7] Attention guided 3D CNN-LSTM model for accurate speech based emotion recognition
    Atila, Orhan
    Sengur, Abdulkadir
    [J]. APPLIED ACOUSTICS, 2021, 182
  • [8] Speech Emotion Recognition using MFCC features and LSTM network
    Kumbhar, Harshawardhan S.
    Bhandari, Sheetal U.
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [9] Combined CNN LSTM with attention for speech emotion recognition based on feature-level fusion
    Liu, Yanlin
    Chen, Aibin
    Zhou, Guoxiong
    Yi, Jizheng
    Xiang, Jin
    Wang, Yaru
    [J]. Multimedia Tools and Applications, 2024, 83 (21) : 59839 - 59859
  • [10] SPEECH EMOTION RECOGNITION USING AUTOENCODER BOTTLENECK FEATURES AND LSTM
    Huang, Kun-Yi
    Wu, Chung-Hsien
    Yang, Tsung-Hsien
    Su, Ming-Hsiang
    Chou, Jia-Hui
    [J]. 2016 INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT), 2018, : 1 - 4