Speech Emotion Recognition Based on Robust Discriminative Sparse Regression

被引:18
|
作者
Song, Peng [1 ]
Zheng, Wenming [2 ]
Yu, Yanwei [3 ]
Ou, Shifeng [4 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China
[2] Southeast Univ, Res Ctr Learning Sci, Key Lab Child Dev & Learning Sci, Minist Educ, Nanjing 210096, Peoples R China
[3] Ocean Univ China, Dept Comp Sci & Technol, Qingdao 266100, Peoples R China
[4] Yantai Univ, Sch Optoelect Informat Sci & Technol, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Speech recognition; Emotion recognition; Prediction algorithms; Signal processing algorithms; Task analysis; Linear regression; Feature selection; graph Laplacian; regression analysis; semi-supervised learning; speech emotion recognition; FEATURE-SELECTION; LINEAR-REGRESSION; FACE RECOGNITION; NEURAL-NETWORK; CLASSIFICATION; FEATURES; SIGNALS;
D O I
10.1109/TCDS.2020.2990928
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech emotion recognition has recently attracted much interest due to the widespread of multimedia data. It generally involves two basic problems: 1) feature extraction and 2) emotion classification. Most previous algorithms just focus on solving one of these two problems. In this article, we aim to deal with these two problems in a joint learning framework, and present a novel regression algorithm, namely, robust discriminative sparse regression (RDSR). In RDSR, we propose a sparse regression algorithm to make our model robust to outliers and noises, and introduce a feature selection regularization constraint simultaneously to select the most discriminative and relevant features. In addition, to well predict the labels, we exploit the local and global consistency over labels, and incorporate it into the proposed framework. To solve the objective function of RDSR, we design an efficient alternative optimization algorithm. Finally, experimental results on several public emotion data sets verify the effectiveness and the superiority of our proposed method.
引用
收藏
页码:343 / 353
页数:11
相关论文
共 50 条
  • [1] Speech Emotion Recognition Based on Sparse Representation
    Yan, Jingjie
    Wang, Xiaolan
    Gu, Weiyi
    Ma, Lili
    [J]. ARCHIVES OF ACOUSTICS, 2013, 38 (04) : 465 - 470
  • [2] Robust emotion recognition in noisy speech via sparse representation
    Zhao, Xiaoming
    Zhang, Shiqing
    Lei, Bicheng
    [J]. NEURAL COMPUTING & APPLICATIONS, 2014, 24 (7-8): : 1539 - 1553
  • [3] Robust emotion recognition in noisy speech via sparse representation
    Xiaoming Zhao
    Shiqing Zhang
    Bicheng Lei
    [J]. Neural Computing and Applications, 2014, 24 : 1539 - 1553
  • [4] Speech Emotion Recognition with Discriminative Feature Learning
    Zhou, Huan
    Liu, Kai
    [J]. INTERSPEECH 2020, 2020, : 4094 - 4097
  • [5] Discriminative Feature Learning for Speech Emotion Recognition
    Zhang, Yuying
    Zou, Yuexian
    Peng, Junyi
    Luo, Danqing
    Huang, Dongyan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 198 - 210
  • [6] Discriminative auditory-based features for robust speech recognition
    Mak, BKW
    Tam, YC
    Li, PQ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 27 - 36
  • [7] Speech Emotion Recognition Based on Transfer Emotion-Discriminative Features Subspace Learning
    Zhang, Kexin
    Liu, Yunxiang
    [J]. IEEE ACCESS, 2023, 11 : 56336 - 56343
  • [8] Towards Robust Speech-Based Emotion Recognition
    Tabatabaei, Talieh S.
    Krishnan, Sridhar
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [9] Speech Emotion Recognition Based on Sparse Transfer Learning Method
    Song, Peng
    Zheng, Wenming
    Liang, Ruiyu
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (07) : 1409 - 1412
  • [10] Discriminative auditory features for robust speech recognition
    Mak, B
    Tam, YC
    Li, Q
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 381 - 384