Emotion Recognition in Speech Using MFCC with SVM, DSVM and Auto-encoder

被引:0
|
作者
Aouani, Hadhami [1 ]
Ben Ayed, Yassine [2 ]
机构
[1] ISIMS Univ Sfax, Higher Inst Comp Sci & Multimedia, Sfax, Tunisia
[2] MIRACL Univ Sfax, Multimedia InfoRmat Syst & Adv Comp Lab, Sfax, Tunisia
关键词
Emotion recognition; MFCC; SVM; Deep Support Vector Machine; Basic auto-encoder; Stacked Auto-encoder;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotions recognition from speech is one of the most important sub domains in the field of signal processing. In this work, our system is a two-stage approach, namely feature extraction and classification engine. Firstly, two sets of feature are investigated which are: 39 Mel-frequency Cepstral Coefficient (MFCC) coefficients and 65 MFCC features extracted based on the work of [20]. Secondly, we use the Support Vector Machine (SVM) as the main classifier engine since it is the most common technique in the field of speech recognition. Besides that, we investigate the importance of the recent advances in machine learning including the deep kernel learning, as well as the various types of auto-encoder (the basic auto-encoder and the stacked auto-encoder). A large set of experiments are conducted on the SAVEE audio database. The experimental results show that DSVM method outperforms the standard SVM with a classification rate of 69.84% and 68.25% using 39 MFCC, respectively. Additionally, the auto-encoder method outperforms the standard SVM, yielding a classification rate of 73.01%.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Research on speech emotion recognition based on deep auto-encoder
    Wang, Fei
    Ye, Xiaofeng
    Sun, Zhaoyu
    Huang, Yujia
    Zhang, Xing
    Shang, Shengxing
    2016 IEEE INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2016, : 308 - 312
  • [2] Multimodal Emotion Recognition Method Based on Convolutional Auto-Encoder
    Zhou, Jian
    Wei, Xianwei
    Cheng, Chunling
    Yang, Qidong
    Li, Qun
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (01) : 351 - 358
  • [3] Convolutional Auto-Encoder and Adversarial Domain Adaptation for Cross-Corpus Speech Emotion Recognition
    Wang, Yang
    Fu, Hongliang
    Tao, Huawei
    Yang, Jing
    Ge, Hongyi
    Xie, Yue
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (10) : 1803 - 1806
  • [4] Multimodal Emotion Recognition Method Based on Convolutional Auto-Encoder
    Jian Zhou
    Xianwei Wei
    Chunling Cheng
    Qidong Yang
    Qun Li
    International Journal of Computational Intelligence Systems, 2018, 12 (1) : 351 - 358
  • [5] Deep Feature Learning for Tibetan Speech Recognition using Sparse Auto-encoder
    Wang, H.
    Zhao, Y.
    Liu, X. F.
    Xu, X. N.
    Wang, L.
    Zhou, N.
    Xu, Y. M.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, AUTOMATION AND MECHANICAL ENGINEERING (EAME 2015), 2015, 13 : 342 - 345
  • [6] Emotion Recognition in Speech Using MFCC and Classifiers
    Ajitha, G.
    Prashanth, Addagatla
    Radhika, Chelle
    Chaitanya, Kancharapu
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING ( ICCVBIC 2021), 2022, 1420 : 197 - 207
  • [7] Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition
    Xie, Xurong
    Ruzi, Rukiye
    Liu, Xunying
    Wang, Lan
    INTERSPEECH 2021, 2021, : 4808 - 4812
  • [8] Speech Based Human Emotion Recognition Using MFCC
    Likitha, M. S.
    Gupta, Raksha R.
    Hasitha, K.
    Raju, A. Upendra
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 2257 - 2260
  • [9] Emotion Recognition in Speech Using MFCC and Wavelet Features
    Kishore, K. V. Krishna
    Satish, P. Krishna
    PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 842 - 847
  • [10] Speech Emotion Recognition Using ANN on MFCC Features
    Dolka, Harshit
    Xavier, Arul V. M.
    Juliet, Sujitha
    ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 431 - 435