Emotion Recognition in Speech Using MFCC with SVM, DSVM and Auto-encoder

被引:0
|
作者
Aouani, Hadhami [1 ]
Ben Ayed, Yassine [2 ]
机构
[1] ISIMS Univ Sfax, Higher Inst Comp Sci & Multimedia, Sfax, Tunisia
[2] MIRACL Univ Sfax, Multimedia InfoRmat Syst & Adv Comp Lab, Sfax, Tunisia
关键词
Emotion recognition; MFCC; SVM; Deep Support Vector Machine; Basic auto-encoder; Stacked Auto-encoder;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotions recognition from speech is one of the most important sub domains in the field of signal processing. In this work, our system is a two-stage approach, namely feature extraction and classification engine. Firstly, two sets of feature are investigated which are: 39 Mel-frequency Cepstral Coefficient (MFCC) coefficients and 65 MFCC features extracted based on the work of [20]. Secondly, we use the Support Vector Machine (SVM) as the main classifier engine since it is the most common technique in the field of speech recognition. Besides that, we investigate the importance of the recent advances in machine learning including the deep kernel learning, as well as the various types of auto-encoder (the basic auto-encoder and the stacked auto-encoder). A large set of experiments are conducted on the SAVEE audio database. The experimental results show that DSVM method outperforms the standard SVM with a classification rate of 69.84% and 68.25% using 39 MFCC, respectively. Additionally, the auto-encoder method outperforms the standard SVM, yielding a classification rate of 73.01%.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Bunch graph based dimensionality reduction using auto-encoder for character recognition
    Bhadoria, Robin Singh
    Samanta, Sovan
    Pathak, Yadhunath
    Shukla, Piyush Kumar
    Zubi, Ahmad Ali
    Kaur, Manjit
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 32093 - 32115
  • [42] An effective approach for the diagnosis of melanoma using the sparse auto-encoder for features detection and the SVM for classification
    Zghal, Nadia Smaoui
    Kallel, Imene Khanfir
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,
  • [43] Ultrasound-Based Silent Speech Interface using Sequential Convolutional Auto-encoder
    Xu, Kele
    Wu, Yuxiang
    Gao, Zhifeng
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2194 - 2195
  • [44] Multi-Modal Domain Adaptation Variational Auto-encoder for EEG-Based Emotion Recognition
    Wang, Yixin
    Qiu, Shuang
    Li, Dan
    Du, Changde
    Lu, Bao-Liang
    He, Huiguang
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (09) : 1612 - 1626
  • [45] RECOGNITION OF SAR TARGET BASED ON MULTILAYER AUTO-ENCODER AND SNN
    Sun, Zhijun
    Xue, Lei
    Xu, Yangming
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2013, 9 (11): : 4331 - 4341
  • [46] Speaker Dependent Speech Emotion Recognition using MFCC and Support Vector Machine
    Dahake, Prajakta P.
    Shaw, Kailash
    Malathi, P.
    2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 1080 - 1084
  • [47] Robust Variational Auto-Encoder for Radar HRRP Target Recognition
    Zhai Y.
    Chen B.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (06): : 1149 - 1155
  • [48] Robust Variational Auto-Encoder for Radar HRRP Target Recognition
    Zhai, Ying
    Chen, Bo
    Zhang, Hao
    Wang, Zhengjue
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, ISCIDE 2017, 2017, 10559 : 356 - 367
  • [49] Local Feature Hashing With Binary Auto-Encoder for Face Recognition
    Chen, Jing
    Zu, Yunxiao
    IEEE ACCESS, 2020, 8 : 37526 - 37540
  • [50] Study on Image Recognition Based on Stacked Sparse Auto-encoder
    Cao, Gui-Ming
    Ding, Xiang-Qian
    Gong, Hui-Li
    PROCEEDINGS OF THE 3RD ANNUAL INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND INFORMATION SCIENCE (EEEIS 2017), 2017, 131 : 372 - 378