A Feature Selection Algorithm Based on Differential Evolution for English Speech Emotion Recognition

被引:0
|
作者
Yue, Liya [1 ]
Hu, Pei [2 ]
Chu, Shu-Chuan [3 ]
Pan, Jeng-Shyang [3 ,4 ]
机构
[1] Nanyang Inst Technol, Fanli Business Sch, Nanyang 473004, Peoples R China
[2] Nanyang Inst Technol, Sch Comp & Software, Nanyang 473004, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[4] Chaoyang Univ Technol, Dept Informat Management, Taichung, Taiwan
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 22期
关键词
speech emotion recognition; feature selection; differential evolution; mutation; STRESS RECOGNITION; PSO;
D O I
10.3390/app132212410
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The automatic identification of emotions from speech holds significance in facilitating interactions between humans and machines. To improve the recognition accuracy of speech emotion, we extract mel-frequency cepstral coefficients (MFCCs) and pitch features from raw signals, and an improved differential evolution (DE) algorithm is utilized for feature selection based on K-nearest neighbor (KNN) and random forest (RF) classifiers. The proposed multivariate DE (MDE) adopts three mutation strategies to solve the slow convergence of the classical DE and maintain population diversity, and employs a jumping method to avoid falling into local traps. The simulations are conducted on four public English speech emotion datasets: eNTERFACE05, Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS), Surrey Audio-Visual Expressed Emotion (SAEE), and Toronto Emotional Speech Set (TESS), and they cover a diverse range of emotions. The MDE algorithm is compared with PSO-assisted biogeography-based optimization (BBO_PSO), DE, and the sine cosine algorithm (SCA) on emotion recognition error, number of selected features, and running time. From the results obtained, MDE obtains the errors of 0.5270, 0.5044, 0.4490, and 0.0420 in eNTERFACE05, RAVDESS, SAVEE, and TESS based on the KNN classifier, and the errors of 0.4721, 0.4264, 0.3283 and 0.0114 based on the RF classifier. The proposed algorithm demonstrates excellent performance in emotion recognition accuracy, and it finds meaningful acoustic features from MFCCs and pitch.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A novel feature selection method for speech emotion recognition
    Ozseven, Turgut
    [J]. APPLIED ACOUSTICS, 2019, 146 : 320 - 326
  • [22] A DFE-based algorithm for feature selection in speech recognition
    delaTorre, A
    Peinado, AM
    Rubio, AJ
    Sanchez, V
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1519 - 1522
  • [23] A Salient Feature Extraction Algorithm for Speech Emotion Recognition
    Liang, Ruiyu
    Tao, Huawei
    Tang, Guichen
    Wang, Qingyun
    Zhao, Li
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (09): : 1715 - 1718
  • [24] Contemporary Stochastic Feature Selection Algorithms for Speech-based Emotion Recognition
    Sidorov, Maxim
    Brester, Christina
    Schmitt, Alexander
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2699 - 2703
  • [25] Adaptive emotion recognition in speech by feature selection based on KL-divergence
    Noda, Tetsuya
    Yano, Yoshikazu
    Doki, Shinji
    Okuma, Shigeru
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 1921 - +
  • [26] Speech feature selection and emotion recognition based on weighted binary cuckoo search
    Zhang, Zicheng
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (01) : 1499 - 1507
  • [27] A modified feature selection method based on metaheuristic algorithms for speech emotion recognition
    Yildirim, Serdar
    Kaya, Yasin
    Kilic, Fatih
    [J]. APPLIED ACOUSTICS, 2021, 173
  • [28] Speech Emotion Recognition Based on Feature Fusion
    Shen, Qi
    Chen, Guanggen
    Chang, Lin
    [J]. PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, MACHINERY AND ENERGY ENGINEERING (MSMEE 2017), 2017, 123 : 1071 - 1074
  • [29] Evolutionary feature selection for emotion recognition in multilingual speech analysis
    Brester, Christina
    Semenkin, Eugene
    Kovalev, Igor
    Zelenkov, Pavel
    Sidorov, Maxim
    [J]. 2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 2406 - 2411
  • [30] A Hierarchical Approach with Feature Selection for Emotion Recognition from Speech
    Giannoulis, Panagiotis
    Potamianos, Gerasimos
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1203 - 1206