COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION

被引:0
|
作者
Trabelsi, I. [1 ]
Ben Ayed, D. [1 ]
Ellouze, N. [1 ]
机构
[1] Univ Tunis El Manar, ENIT, Lab Signal Image & Technol Informat LRSITI, Tunis 1002, Tunisia
来源
关键词
Speech; Emotions; SVM; GMM; Kernel; Sequence;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Speech emotion recognition aims at automatically identifying the emotional or physical state of a human being from his or her voice. The emotional state is an important factor in human communication, because it provides feedback information in many applications. This paper makes a comparison of two standard methods used for speaker recognition and verification: Gaussian Mixture Models (GMM) and Support Vector Machines (SVM) for emotion recognition. An extensive comparison of two methods: GMM and GMM/SVM sequence kernel is conducted. The main goal here is to analyze and compare influence of initial setting of parameters such as number of mixture components, used number of iterations and volume of training data for these two methods. Experimental studies are performed over the Berlin Emotional Database, expressing different emotions, in German language. The emotions used in this study are anger, fear, joy, boredom, neutral, disgust, and sadness. Experimental results show the effectiveness of the combination of GMM and SVM in order to classify sound data sequences when compared to systems based on GMM.
引用
下载
收藏
页码:1221 / 1233
页数:13
相关论文
共 50 条
  • [1] COMPARISON OF ADAPTATION METHODS FOR GMM-SVM BASED SPEECH EMOTION RECOGNITION
    Jiang, Jianbo
    Wu, Zhiyong
    Xu, Mingxing
    Jia, Jia
    Cai, Lianhong
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 269 - 273
  • [2] Comparing Feature Dimension Reduction Algorithms for GMM-SVM based Speech Emotion Recognition
    Jiang, Jianho
    Wu, Zhiyong
    Xu, Mingxing
    Jia, Jia
    Cai, Lianhong
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [3] GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition
    You, Chang Huai
    Lee, Kong Aik
    Li, Haizhou
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1300 - 1312
  • [4] The GMM-SVM Supervector Approach for the Recognition of the Emotional Status from Speech
    Schwenker, Friedhelm
    Scherer, Stefan
    Magdi, Yasmine M.
    Palm, Guenther
    ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT I, 2009, 5768 : 894 - +
  • [5] Performances Evaluation of GMM-UBM and GMM-SVM for Speaker Recognition in Realistic World
    Asbai, Nassim
    Amrouche, Abderrahmane
    Debyeche, Mohamed
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 284 - 291
  • [6] GMM supervector based SVM with spectral features for speech emotion recognition
    Hu, Hao
    Xu, Ming-Xing
    Wu, Wei
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 413 - +
  • [7] Automatic Laughter Detection in Spontaneous Speech Using GMM-SVM Method
    Neuberger, Tilda
    Beke, Andras
    TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 113 - 120
  • [8] A SAMPLE AND FEATURE SELECTION SCHEME FOR GMM-SVM BASED LANGUAGE RECOGNITION
    Song, Yan
    Dai, Li-Rong
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 326 - 329
  • [9] A hybrid GMM-SVM speaker identification system
    Mashao, DJ
    2004 IEEE AFRICON: 7TH AFRICON CONFERENCE IN AFRICA, VOLS 1 AND 2: TECHNOLOGY INNOVATION, 2004, : 319 - 322
  • [10] Speaker Recognition and Speech Emotion Recognition Based on GMM
    Xu, Shupeng
    Liu, Yan
    Liu, Xiping
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 434 - 436