Efficient Speech Emotion Recognition Using Binary Support Vector Machines & Multiclass SVM

被引:0
|
作者
Kanth, N. Ratna [1 ]
Saraswathi, S. [2 ]
机构
[1] Pondicherry Univ, Dept CSE, Kalapet, Puducherry, India
[2] Pondicherry Engn Coll, Dept Informat Technol, Pillaichavadi, Puducherry, India
关键词
Speech Emotion Recognition; Emotional Speech Corpus; Feature Selection; Correlation Based Feature Selection; Support Vector Machine;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the construction of Binary Support Vector Machines and its significance for efficient Speech Emotion Recognition (SER). German Emotional Speech Corpus EmoDB has been used in this study. Seven Binary Support Vector Machines (SVMs) corresponding to each of the seven emotions in the EmoDB, namely Anger-Not Anger, Boredom-Not Boredom, Disgust-Not Disgust, Fear-Not Fear, Happy-Not Happy, Sad-Not Sad and Neutral-Not Neutral are constructed. Features are selected for these seven Binary SVMs using Correlation Based Feature Selection (CFS) with Sequential Forward Selection (SFS). One Multiclass SVM is also constructed. Ten fold cross validation has been used and achieved an average accuracy of 95.32% for the Binary SVMs and 62.85% for the Multiclass SVM. The seven Binary SVMs and the Multiclass SVM are fused together using a combinator algorithm. All the SVMs are run in parallel by giving the SVM specific features as input. Fused model produced an average accuracy of 92.25% for the Binary SVMs and 77.07% for the Multiclass SVM on the test set. On the same test set using the combinator algorithm, the fused model has achieved an overall accuracy of 87.86% which is a significant improvement over the accuracies achieved in the previous studies.
引用
收藏
页码:542 / 547
页数:6
相关论文
共 50 条
  • [1] Speech Emotion Recognition Using Support Vector Machines
    Yu, Caiming
    Tian, Qingxi
    Cheng, Fang
    Zhang, Shiqing
    [J]. ADVANCED RESEARCH ON COMPUTER SCIENCE AND INFORMATION ENGINEERING, PT I, 2011, 152 : 215 - 220
  • [2] On the Optimization of Multiclass Support Vector Machines Dedicated to Speech Recognition
    Mezzoudj, Freha
    Benyettou, Assia
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2012, PT II, 2012, 7664 : 1 - 8
  • [3] Bagged support vector machines for emotion recognition from speech
    Bhavan, Anjali
    Chauhan, Pankaj
    Hitkul
    Shah, Rajiv Ratn
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 184
  • [4] Speech Recognition using Support Vector Machines
    Aida-zade, Kamil
    Xocayev, Anar
    Rustamov, Samir
    [J]. 2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 108 - 111
  • [5] Design and Evaluation of Speech based Emotion Recognition System using Support Vector Machines
    Harshini, D.
    Pranjali, B.
    Ranjitha, M.
    Rushali, J.
    Manikandan, J.
    [J]. 2019 IEEE 16TH INDIA COUNCIL INTERNATIONAL CONFERENCE (IEEE INDICON 2019), 2019,
  • [6] Visual speech recognition using support vector machines
    Gordan, M
    Kotropoulos, C
    Pitas, I
    [J]. DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 1093 - 1096
  • [7] Speech Emotion Recognition by Using Combinations of C5.0, Neural Network (NN), and Support Vector Machines (SVM) Classification Methods
    Javidi, Mohammad Masoud
    Roshan, Ebrahim Fazlizadeh
    [J]. JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2013, 6 (03): : 191 - 200
  • [8] Speech Emotion Recognition Based on Fuzzy Least Squares Support Vector Machines
    Zhang, Shiqing
    [J]. 2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 1299 - 1302
  • [9] Mandarin Digits Speech Recognition Using Support Vector Machines
    谢湘
    匡镜明
    [J]. Journal of Beijing Institute of Technology, 2005, (01) : 9 - 12
  • [10] A Study of Speech, Speaker and Emotion Recognition using Mel Frequency Cepstrum Coefficients and Support Vector Machines
    Rajasekhar, Ashwini
    Hota, Malaya Kumar
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 114 - 118