Long-Time Speech Emotion Recognition Using Feature Compensation and Accentuation-Based Fusion

被引:1
|
作者
Sun, Jiu [1 ]
Zhu, Jinxin [1 ]
Shao, Jun [1 ]
机构
[1] Yancheng Inst Technol, Sch Informat Technol, Yancheng 224051, Peoples R China
关键词
Speech emotion recognition; Feature compensation; Long-time emotion recognition; Accentuation-based fusion;
D O I
10.1007/s00034-023-02480-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we study the speech emotion feature optimization using stochastic optimization algorithms, and feature compensation using deep neural networks. We also proposed to use accentuation-based fusion for long-time speech emotion recognition. Firstly, the extraction method of emotional features is studied, and a series of speech features are constructed for the recognition of emotion. Secondly, we propose a method of sample adaptation through denoising autoencoder to enhance the versatility of features through the mapping of sample features to improve adaptive ability. Thirdly, GA and SFLA are used to optimize the combination of features to improve the emotion recognition results at the utterance level. Finally, we use transformer model to implement accentuation-based emotion fusion in long-time speech. The continuous long-time speech corpus, as well as the public available EMO-DB, are used for experiments. Results show that the proposed method can effectively improve the performance of long-time speech emotion recognition.
引用
收藏
页码:916 / 940
页数:25
相关论文
共 50 条
  • [31] Speech emotion recognition based on multi-feature and multi-lingual fusion
    Wang, Chunyi
    Ren, Ying
    Zhang, Na
    Cui, Fuwei
    Luo, Shiying
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (04) : 4897 - 4907
  • [32] Graph-Based Multi-Feature Fusion Method for Speech Emotion Recognition
    Liu, Xueyu
    Lin, Jie
    Wang, Chao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (16)
  • [33] Metric Learning Based Feature Representation with Gated Fusion Model for Speech Emotion Recognition
    Gao, Yuan
    Liu, JiaXing
    Wang, Longbiao
    Dang, Jianwu
    INTERSPEECH 2021, 2021, : 4503 - 4507
  • [34] Novel feature fusion method for speech emotion recognition based on multiple kernel learning
    Zhao, L. (zhaoli@seu.edu.cn), 1600, Southeast University (29):
  • [35] Speech Emotion Recognition with Fusion of Acoustic- and Linguistic-Feature-Based Decisions
    Nagase, Ryotaro
    Fukumori, Takahiro
    Yamashita, Yoichi
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 725 - 730
  • [36] Speech Emotion Recognition Using Multihead Attention in Both Time and Feature Dimensions
    Xie, Yue
    Liang, Ruiyu
    Liang, Zhenlin
    Zhao, Xiaoyan
    Zeng, Wenhao
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (05) : 1098 - 1101
  • [37] EEG based emotion recognition using fusion feature extraction method
    Qiang Gao
    Chu-han Wang
    Zhe Wang
    Xiao-lin Song
    En-zeng Dong
    Yu Song
    Multimedia Tools and Applications, 2020, 79 : 27057 - 27074
  • [38] EEG based emotion recognition using fusion feature extraction method
    Gao, Qiang
    Wang, Chu-han
    Wang, Zhe
    Song, Xiao-lin
    Dong, En-zeng
    Song, Yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27057 - 27074
  • [39] Research on Feature Fusion Speech Emotion Recognition Technology for Smart Teaching
    Zhang, Shaoyun
    Li, Chao
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [40] An Investigation of a Feature-Level Fusion for Noisy Speech Emotion Recognition
    Sekkate, Sara
    Khalil, Mohammed
    Adib, Abdellah
    Ben Jebara, Sofia
    COMPUTERS, 2019, 8 (04)