Feature Fusion of Speech Emotion Recognition Based on Deep Learning

被引:0
|
作者
Liu, Gang [1 ]
He, Wei [1 ]
Jin, Bicheng [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Pattern Recognit & Intelligent Syst Lab, Sch Informat & Commun Engn, Beijing, Peoples R China
关键词
Feature fusion; Hyper-prosodic features; Spectrogram; SER; Deep learning;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech emotion recognition (SER) is a hot topic in academia. One of the key issues in improving the performance of SER systems is the choice of speech emotion features. In order to establish a robust speech emotion recognition system, it is essential to select the features which can be a perfect representation of speech emotion attributes. Researchers has done a lot of work, proposed a variety of emotional features and made great progress. Although each kind of features were proven to be effective, most of methods are based on a single type. In this paper, we proposed a method of feature fusion based on deep learning, combining spectral-based features and pitch-based hyper-prosodic features. The experiments show that this method improves the performance of speech emotion recognition system.
引用
收藏
页码:193 / 197
页数:5
相关论文
共 50 条
  • [1] Speech emotion recognition using feature fusion: a hybrid approach to deep learning
    Khan, Waleed Akram
    ul Qudous, Hamad
    Farhan, Asma Ahmad
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 75557 - 75584
  • [2] Enhancing speech emotion recognition through deep learning and handcrafted feature fusion
    Eris, Fatma Gunes
    Akbal, Erhan
    [J]. APPLIED ACOUSTICS, 2024, 222
  • [3] Speech Emotion Recognition Based on Feature Fusion
    Shen, Qi
    Chen, Guanggen
    Chang, Lin
    [J]. PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, MACHINERY AND ENERGY ENGINEERING (MSMEE 2017), 2017, 123 : 1071 - 1074
  • [4] A FEATURE FUSION METHOD BASED ON EXTREME LEARNING MACHINE FOR SPEECH EMOTION RECOGNITION
    Guo, Lili
    Wang, Longbiao
    Dang, Jianwu
    Zhang, Linjuan
    Guan, Haotian
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2666 - 2670
  • [5] Speech Emotion Recognition based on Multiple Feature Fusion
    Jiang, Changjiang
    Mao, Rong
    Liu, Geng
    Wang, Mingyi
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 907 - 912
  • [6] Design of smart home system speech emotion recognition model based on ensemble deep learning and feature fusion
    Wang, Mengsheng
    Ma, Hongbin
    Wang, Yingli
    Sun, Xianhe
    [J]. APPLIED ACOUSTICS, 2024, 218
  • [7] Metric Learning Based Feature Representation with Gated Fusion Model for Speech Emotion Recognition
    Gao, Yuan
    Liu, JiaXing
    Wang, Longbiao
    Dang, Jianwu
    [J]. INTERSPEECH 2021, 2021, : 4503 - 4507
  • [8] Novel feature fusion method for speech emotion recognition based on multiple kernel learning
    [J]. Zhao, L. (zhaoli@seu.edu.cn), 1600, Southeast University (29):
  • [9] Video-Audio Emotion Recognition Based on Feature Fusion Deep Learning Method
    Song, Yanan
    Cai, Yuanyang
    Tan, Lizhe
    [J]. 2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 611 - 616
  • [10] Unsupervised Feature Learning for Speech Emotion Recognition Based on Autoencoder
    Ying, Yangwei
    Tu, Yuanwu
    Zhou, Hong
    [J]. ELECTRONICS, 2021, 10 (17)