Exploration of Complementary Features for Speech Emotion Recognition Based on Kernel Extreme Learning Machine

被引:44
|
作者
Guo, Lili [1 ]
Wang, Longbiao [1 ]
Dang, Jianwu [1 ,2 ]
Liu, Zhilei [1 ]
Guan, Haotian [3 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin Key Lab Cognit Comp & Applicat, Tianjin 300350, Peoples R China
[2] Japan Adv Inst Sci & Technol, Nomi, Ishikawa 9231292, Japan
[3] Huiyan Technol Tianjin Co Ltd, Tianjin 300384, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Speech emotion recognition; auditory-based features; spectrogram-based features; complementary features; kernel extreme learning machine; NEURAL-NETWORK; CLASSIFICATION;
D O I
10.1109/ACCESS.2019.2921390
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous studies of speech emotion recognition using either empirical features (e.g., F0, energy, and voice probability) or spectrogram-based statistical features. The empirical features can highlight the human knowledge of emotion recognition, while the statistical features enable a general representation, but they do not emphasize human knowledge sufficiently. However, the use of these two kinds of features together can complement some features that may be unconsciously used by humans in daily life but have not been realized yet. Based on this consideration, this paper proposes a dynamic fusion framework to utilize the potential advantages of the complementary spectrogram-based statistical features and the auditory-based empirical features. In addition, a kernel extreme learning machine (KELM) is adopted as the classifier to distinguish emotions. To validate the proposed framework, we conduct experiments on two public emotional databases, including Emo-DB and IEMOCAP databases. The experimental results demonstrate that the proposed fusion framework significantly outperforms the existing state-of-the-art methods. The results also show that the proposed method, by integrating the auditory-based features with spectrogram-based features, could achieve a notably improved performance over the conventional methods.
引用
收藏
页码:75798 / 75809
页数:12
相关论文
共 50 条
  • [1] SER: Speech Emotion Recognition Application Based on Extreme Learning Machine
    Ainurrochman
    Febriansyah, Irfanur Ilham
    Yuhana, Umi Laili
    [J]. PROCEEDINGS OF 2021 13TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2021, : 179 - 183
  • [2] A FEATURE FUSION METHOD BASED ON EXTREME LEARNING MACHINE FOR SPEECH EMOTION RECOGNITION
    Guo, Lili
    Wang, Longbiao
    Dang, Jianwu
    Zhang, Linjuan
    Guan, Haotian
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2666 - 2670
  • [3] Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition
    Xu, Xinzhou
    Deng, Jun
    Coutinho, Eduardo
    Wu, Chen
    Zhao, Li
    Schuller, Bjoern W.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (03) : 795 - 808
  • [4] Relative Entropy Normalized Gaussian Supervector for Speech Emotion Recognition using Kernel Extreme Learning Machine
    Li, Ruru
    Yang, Dali
    Li, Xinxing
    Wang, Renyu
    Xu, Mingxing
    Zheng, Thomas Fang
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [5] Speech emotion recognition based on feature selection and extreme learning machine decision tree
    Liu, Zhen-Tao
    Wu, Min
    Cao, Wei-Hua
    Mao, Jun-Wei
    Xu, Jian-Ping
    Tan, Guan-Zheng
    [J]. NEUROCOMPUTING, 2018, 273 : 271 - 280
  • [6] Speech based Emotion Recognition using Machine Learning
    Deshmukh, Girija
    Gaonkar, Apurva
    Golwalkar, Gauri
    Kulkarni, Sukanya
    [J]. PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 812 - 817
  • [7] Speech Emotion Recognition Using Deep Neural Network and Extreme Learning Machine
    Han, Kun
    Yu, Dong
    Tashev, Ivan
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 223 - 227
  • [8] Speech Emotion Recognition Based on Deep Learning and Kernel Nonlinear PSVM
    Han Zhiyan
    Wang Jian
    [J]. PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 1426 - 1430
  • [9] Investigating voice features for Speech emotion recognition based on four kinds of machine learning methods
    Chen, Haiyan
    Liu, Zheng
    Kang, Xin
    Nishide, Shun
    Ren, Fuji
    [J]. PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 195 - 199
  • [10] A Subset of Acoustic Features for Machine Learning-based and Statistical Approaches in Speech Emotion Recognition
    Costantini, Giovanni
    Cesarini, Valerio
    Casali, Daniele
    [J]. BIOSIGNALS: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL 4: BIOSIGNALS, 2022, : 257 - 264