A Study of Support Vector Machines for Emotional Speech Recognition

被引:0
|
作者
Kurpukdee, Nattapong [1 ,2 ]
Kasuriya, Sawit [2 ]
Chunwijitra, Vataya [2 ]
Wutiwiwatchai, Chai [2 ]
Lamsrichan, Poonlap [1 ]
机构
[1] Kasetsart Univ, ICTES Program, TAIST Tokyo Tech, Bangkok, Thailand
[2] NSTDA, NECTEC, 112 Pahonyothin Rd, Pathum Thani 12120, Thailand
关键词
Emotional Speech Recognition (ESR) and Classification; Utterance features; SVM (Support Vector Machines); Binary Support Vector Machines (BSVM); FEATURES; SVM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, efficiency comparison of Support Vector Machines (SVM) and Binary Support Vector Machines (BSVM) techniques in utterance-based emotion recognition is studied. Acoustic features including energy, Mel-frequency cepstral coefficients (MFCC), Perceptual linear predictive (PLP), Filter bank (FBANK), pitch, their first and second derivatives are used as frame-based features. Four basic emotions including anger, happiness, neutral and sadness in Interactive Emotional Dyadic Motion Capture (IEMOCAP) database are selected for training and evaluating in our experiments. The best accuracy of emotional speech recognition is 58.40% in average from SVM with polynomial kernel. Energy features combination with FBANK, pitch and their first and second derivatives features are the most suitable for computing utterance feature. Binary Support Vector Machines (BSVM) techniques show accuracy improvement in some emotions, such as sadness and happiness emotion.
引用
收藏
页数:6
相关论文
共 50 条
  • [11] Speech Emotion Recognition Using Support Vector Machines
    Yu, Caiming
    Tian, Qingxi
    Cheng, Fang
    Zhang, Shiqing
    [J]. ADVANCED RESEARCH ON COMPUTER SCIENCE AND INFORMATION ENGINEERING, PT I, 2011, 152 : 215 - 220
  • [12] DEEP NEURAL SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
    Zhang, Shi-Xiong
    Liu, Chaojun
    Yao, Kaisheng
    Gong, Yifan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4275 - 4279
  • [13] INFINITE STRUCTURED SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
    Yang, J.
    van Dalen, R. C.
    Zhang, S. -X.
    Gales, M. J. F.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [14] Mixture of Support Vector Machines for HMM based speech recognition
    Krueger, Sven E.
    Schaffoener, Martin
    Katz, Marcel
    Andelic, Edin
    Wendemuth, Andreas
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 326 - +
  • [15] Bagged support vector machines for emotion recognition from speech
    Bhavan, Anjali
    Chauhan, Pankaj
    Hitkul
    Shah, Rajiv Ratn
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 184
  • [16] Recognition of bimodal produced speech based on Support Vector Machines
    Galic, Jovan
    Pavlovic, Dragana Sumarac
    Jovicic, Slobodan T.
    Markovic, Branko
    Grozdic, Dorde
    [J]. 2017 25TH TELECOMMUNICATION FORUM (TELFOR), 2017, : 362 - 365
  • [17] On the Optimization of Multiclass Support Vector Machines Dedicated to Speech Recognition
    Mezzoudj, Freha
    Benyettou, Assia
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2012, PT II, 2012, 7664 : 1 - 8
  • [18] Application of Support Vector Machines classifiers to visual speech recognition
    Gordan, M
    Kotropoulos, C
    Pitas, I
    [J]. 2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2002, : 129 - 132
  • [19] Mandarin Digits Speech Recognition Using Support Vector Machines
    谢湘
    匡镜明
    [J]. Journal of Beijing Institute of Technology, 2005, (01) : 9 - 12
  • [20] An overview of speech recognition system based on the Support Vector Machines
    Sonkamble, Balwant A.
    Doye, D. D.
    [J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 768 - +