COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION

被引:0
|
作者
Trabelsi, I. [1 ]
Ben Ayed, D. [1 ]
Ellouze, N. [1 ]
机构
[1] Univ Tunis El Manar, ENIT, Lab Signal Image & Technol Informat LRSITI, Tunis 1002, Tunisia
来源
关键词
Speech; Emotions; SVM; GMM; Kernel; Sequence;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Speech emotion recognition aims at automatically identifying the emotional or physical state of a human being from his or her voice. The emotional state is an important factor in human communication, because it provides feedback information in many applications. This paper makes a comparison of two standard methods used for speaker recognition and verification: Gaussian Mixture Models (GMM) and Support Vector Machines (SVM) for emotion recognition. An extensive comparison of two methods: GMM and GMM/SVM sequence kernel is conducted. The main goal here is to analyze and compare influence of initial setting of parameters such as number of mixture components, used number of iterations and volume of training data for these two methods. Experimental studies are performed over the Berlin Emotional Database, expressing different emotions, in German language. The emotions used in this study are anger, fear, joy, boredom, neutral, disgust, and sadness. Experimental results show the effectiveness of the combination of GMM and SVM in order to classify sound data sequences when compared to systems based on GMM.
引用
下载
收藏
页码:1221 / 1233
页数:13
相关论文
共 50 条
  • [31] A Hybrid PNN-GMM Classification Scheme for Speech Emotion Recognition
    Ser, Wee
    Cen, Ling
    Yu, Zhu Liang
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1718 - +
  • [32] Dialect Recognition Using a Phone-GMM-Supervector-Based SVM Kernel
    Biadsy, Fadi
    Hirschberg, Julia
    Collins, Michael
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 753 - +
  • [33] Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
    Nirmalya Sen
    Md Sahidullah
    Hemant A. Patil
    Shyamal Kumar Das Mandal
    Krothapalli Sreenivasa Rao
    Tapan Kumar Basu
    International Journal of Speech Technology, 2021, 24 : 1067 - 1088
  • [34] Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
    Sen, Nirmalya
    Sahidullah, Md
    Patil, Hemant A.
    Das Mandal, Shyamal Kumar
    Rao, Krothapalli Sreenivasa
    Basu, Tapan Kumar
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (04) : 1067 - 1088
  • [35] GMM-based SVM for face recognition
    Bredin, Herve
    Dehak, Najim
    Chollet, Gerard
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 1111 - +
  • [36] A GMM/CPSO Speech Recognition System
    Viana Beserra, Amanda Abelardo
    Santos Silva, Washington Luis
    de Oliveira Serra, Ginalber Luiz
    2015 IEEE 24TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2015, : 26 - 31
  • [37] PCA变换下的GMM-SVM话者确认研究
    卓著
    李辉
    小型微型计算机系统, 2015, 36 (03) : 637 - 640
  • [38] 一种GMM-SVM混合说话人辨认模型
    冷自强
    王金明
    林大会
    军事通信技术, 2009, 30 (01) : 86 - 89
  • [39] Robust regression fusion of GMM-UBM and GMM-SVM normalized scores using G729 bit-stream for speaker recognition over IP
    Yessad, Dalila
    Amrouche, Abderrahmane
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 43 - 51
  • [40] 基于重组超矢量的GMM-SVM说话人辨认系统
    欧国振
    孙林慧
    薛海双
    计算机技术与发展, 2017, 27 (07) : 51 - 56