COMBINING FEATURE SELECTION AND REPRESENTATION FOR SPEECH EMOTION RECOGNITION

被引:0
|
作者
Han, Wenjing [1 ]
Ruan, Huabin [2 ]
Yu, Xiaojie [1 ]
Zhu, Xuan [1 ]
机构
[1] Samsung R&D Inst China Beijing SRC B, Language Comp Lab, Beijing, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
关键词
speech emotion recognition; multiple kernel learning; denoising autoencoder; feature selection; feature representation;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a feature selection and representation combination method to generate discriminative features for speech emotion recognition. In feature selection stage, a Multiple Kernel Learning (MKL) based strategy is used to obtain the optimal feature subset. Specifically, features selected at least n times among 10-fold cross validation are collected to build a new feature subset named n-subset, then the n-subset resulting in the highest classification accuracy is viewed as the optimal one. In feature representation stage, the optimal feature subset is mapped to a hidden representation using a denoising autoencoder (DAE). The model parameters are learned by minimizing the squared error between the original and the reconstructed input. The hidden representation is then used as the final feature set in the MKL model for emotion recognition. Our experimental results show significant performance improvement compared to using the original features in both of the inner-corpus and cross-corpus scenarios.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Feature representation for speech emotion Recognition
    Abdollahpour, Mehdi
    Zamani, Lafar
    Rad, Hamidreza Saligheh
    [J]. 2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1465 - 1468
  • [2] Feature selection for emotion recognition of mandarin speech
    College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
    不详
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban), 2007, 11 (1816-1822):
  • [3] Statistical feature selection for mandarin speech emotion recognition
    Xie, B
    Chen, L
    Chen, GC
    Chen, C
    [J]. ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 591 - 600
  • [4] Harmony search for feature selection in speech emotion recognition
    Tao, Yongsen
    Wang, Kunxia
    Yang, Jing
    An, Ning
    Li, Lian
    [J]. 2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 362 - 367
  • [5] Survey on discriminative feature selection for speech emotion recognition
    Xu, Xin
    Li, Ya
    Xu, Xiaoying
    Wen, Zhengqi
    Che, Hao
    Liu, Shanfeng
    Tao, Jianhua
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 345 - +
  • [6] A novel feature selection method for speech emotion recognition
    Ozseven, Turgut
    [J]. APPLIED ACOUSTICS, 2019, 146 : 320 - 326
  • [7] Evolutionary feature selection for emotion recognition in multilingual speech analysis
    Brester, Christina
    Semenkin, Eugene
    Kovalev, Igor
    Zelenkov, Pavel
    Sidorov, Maxim
    [J]. 2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 2406 - 2411
  • [8] A Hierarchical Approach with Feature Selection for Emotion Recognition from Speech
    Giannoulis, Panagiotis
    Potamianos, Gerasimos
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1203 - 1206
  • [9] Enhancing Emotion Recognition from Speech through Feature Selection
    Kostoulas, Theodoros
    Ganchev, Todor
    Lazaridis, Alexandros
    Fakotakis, Nikos
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 338 - 344
  • [10] Acoustic feature selection for automatic emotion recognition from speech
    Rong, Jia
    Li, Gang
    Chen, Yi-Ping Phoebe
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2009, 45 (03) : 315 - 328