Speech Emotion Recognition Using Unsupervised Feature Selection Algorithms

被引:9
|
作者
Bandela, Surekha Reddy [1 ]
Kumar, T. Kishore [1 ]
机构
[1] Natl Inst Technol Warangal, Dept ECE, Warangal, Andhra Pradesh, India
关键词
Speech Emotion Recognition (SER); INTERSPEECH Paralinguistic Feature Set; GTCC; feature selection; feature optimization; FSASL; UFSOL; SuFS;
D O I
10.13164/re.2020.0353
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The use of the combination of different speech features is a common practice to improve the accuracy of Speech Emotion Recognition (SER). Sometimes, this leads to an abrupt increase in the processing time and some of these features contribute less to emotion recognition often resulting in an incorrect prediction of emotion due to which the accuracy of the SER system decreases substantially. Hence, there is a need to select the appropriate feature set that can contribute significantly to emotion recognition. This paper presents the use of Feature Selection with Adaptive Structure Learning (FSASL) and Unsupervised Feature Selection with Ordinal Locality (UFSOL) algorithms for feature dimension reduction to improve SER performance with reduced feature dimension. A novel Subset Feature Selection (SuFS) algorithm is proposed to reduce further the feature dimension and achieve a comparable better accuracy when used along with the FSASL and UFSOL algorithms. 1582 INTERSPEECH 2010 Paralinguistic, 20 Gammatone Cepsfral Coefficients and Support Vector Machine classifier with 10-Fold Cross-Validation and Hold-Out Validation are considered in this work. The EMO-DB and IEMOCAP databases are used to evaluate the performance of the proposed SER system in terms of classification accuracy and computational time. From the result analysis, it is evident that the proposed SER system outperforms the existing ones.
引用
收藏
页码:353 / 364
页数:12
相关论文
共 50 条
  • [31] ENSEMBLE FEATURE SELECTION FOR DOMAIN ADAPTATION IN SPEECH EMOTION RECOGNITION
    Abdelwahab, Mohammed
    Busso, Carlos
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5000 - 5004
  • [32] Speech Emotion Recognition Using Speech Feature and Word Embedding
    Atmaja, Bagus Tris
    Shirai, Kiyoaki
    Akagi, Masato
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 519 - 523
  • [33] Speech emotion recognition using amplitude modulation parameters and a combined feature selection procedure
    Mencattini, Arianna
    Martinelli, Eugenio
    Costantini, Giovanni
    Todisco, Massimiliano
    Basile, Barbara
    Bozzali, Marco
    Di Natale, Corrado
    KNOWLEDGE-BASED SYSTEMS, 2014, 63 : 68 - 81
  • [34] An optimal two stage feature selection for speech emotion recognition using acoustic features
    Kuchibhotla S.
    Vankayalapati H.D.
    Anne K.R.
    International Journal of Speech Technology, 2016, 19 (4) : 657 - 667
  • [35] Multi-Stage Recognition of Speech Emotion Using Sequential Forward Feature Selection
    Liogiene, Tatjana
    Tamulevicius, Gintautas
    ELECTRICAL CONTROL AND COMMUNICATION ENGINEERING, 2016, 10 (01) : 35 - 41
  • [36] A Hybrid Meta-Heuristic Feature Selection Method Using Golden Ratio and Equilibrium Optimization Algorithms for Speech Emotion Recognition
    Dey, Arijit
    Chattopadhyay, Soham
    Singh, Pawan Kumar
    Ahmadian, Ali
    Ferrara, Massimiliano
    Sarkar, Ram
    IEEE ACCESS, 2020, 8 : 200953 - 200970
  • [37] Emotion Recognition Related to Stock Trading Using Machine Learning Algorithms With Feature Selection
    Torres, Edgar P.
    Torres, Edgar Alejandro
    Hernandez-Alvarez, Myriam
    Yoo, Sang Guun
    IEEE ACCESS, 2020, 8 (08): : 199719 - 199732
  • [38] Unsupervised domain adaptation for speech emotion recognition using PCANet
    Huang, Zhengwei
    Xue, Wentao
    Mao, Qirong
    Zhan, Yongzhao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (05) : 6785 - 6799
  • [39] Unsupervised domain adaptation for speech emotion recognition using PCANet
    Zhengwei Huang
    Wentao Xue
    Qirong Mao
    Yongzhao Zhan
    Multimedia Tools and Applications, 2017, 76 : 6785 - 6799
  • [40] Speech emotion recognition using a novel feature set
    Yang, J. (jsjyj0801@163.com), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):