Feature Fusion of Speech Emotion Recognition Based on Deep Learning

被引:0
|
作者
Liu, Gang [1 ]
He, Wei [1 ]
Jin, Bicheng [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Pattern Recognit & Intelligent Syst Lab, Sch Informat & Commun Engn, Beijing, Peoples R China
关键词
Feature fusion; Hyper-prosodic features; Spectrogram; SER; Deep learning;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech emotion recognition (SER) is a hot topic in academia. One of the key issues in improving the performance of SER systems is the choice of speech emotion features. In order to establish a robust speech emotion recognition system, it is essential to select the features which can be a perfect representation of speech emotion attributes. Researchers has done a lot of work, proposed a variety of emotional features and made great progress. Although each kind of features were proven to be effective, most of methods are based on a single type. In this paper, we proposed a method of feature fusion based on deep learning, combining spectral-based features and pitch-based hyper-prosodic features. The experiments show that this method improves the performance of speech emotion recognition system.
引用
收藏
页码:193 / 197
页数:5
相关论文
共 50 条
  • [31] Multimodal emotion recognition from facial expression and speech based on feature fusion
    Tang, Guichen
    Xie, Yue
    Li, Ke
    Liang, Ruiyu
    Zhao, Li
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 16359 - 16373
  • [32] Multimodal emotion recognition from facial expression and speech based on feature fusion
    Guichen Tang
    Yue Xie
    Ke Li
    Ruiyu Liang
    Li Zhao
    [J]. Multimedia Tools and Applications, 2023, 82 : 16359 - 16373
  • [33] Teager_Mel and PLP Fusion Feature Based Speech Emotion Recognition
    Chen, Xiao
    Li, Haifeng
    Ma, Lin
    Liu, Xinlei
    Chen, Jing
    [J]. 2015 FIFTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2015, : 1109 - 1114
  • [34] Feature fusion methods research based on deep belief networks for speech emotion recognition under noise condition
    Yongming Huang
    Kexin Tian
    Ao Wu
    Guobao Zhang
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 1787 - 1798
  • [35] Feature fusion methods research based on deep belief networks for speech emotion recognition under noise condition
    Huang, Yongming
    Tian, Kexin
    Wu, Ao
    Zhang, Guobao
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (05) : 1787 - 1798
  • [36] Emotion Recognition in Speech with Deep Learning Architectures
    Erdal, Mehmet
    Kaechele, Markus
    Schwenker, Friedhelm
    [J]. ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, 2016, 9896 : 298 - 311
  • [37] Speech Emotion Recognition Using Deep Learning
    Alagusundari, N.
    Anuradha, R.
    [J]. ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
  • [38] Speech Emotion Recognition Using Deep Learning
    Ahmed, Waqar
    Riaz, Sana
    Iftikhar, Khunsa
    Konur, Savas
    [J]. ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
  • [39] Music Emotion Recognition Based on Feature Fusion Broad Learning Method
    郁进明
    张晨光
    海涵
    [J]. Journal of Donghua University(English Edition), 2023, 40 (03) : 343 - 350
  • [40] A Feature Fusion Model with Data Augmentation for Speech Emotion Recognition
    Tu, Zhongwen
    Liu, Bin
    Zhao, Wei
    Yan, Raoxin
    Zou, Yang
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (07):