Emotion Recognition from Speech using Gaussian Mixture Model and Vector Quantization

被引:0
|
作者
Agrawal, Surabhi [1 ]
Dongaonkar, Shabda [1 ]
机构
[1] GHRCEM, Dept Comp, Pune, Maharashtra, India
关键词
Anchor models; emotional speech; emotion recognition; GMM model;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, there is a demand to evaluate the effectiveness of anchor models applied to the multiclass drawback of Emotion recognition from speech. Within the anchor models system, associate in nursing emotion category is characterized by its line of similarity relative to different emotion categories. Generative models like Gaussian Mixture Models (GMMs) are typically used as front-end systems to get feature vectors want to train complicated back-end systems like Support Vector Machine (SVMs) to enhance the classification performance. There is a tendency to show that within the context of extremely unbalanced knowledge categories, these back-end systems will improve the performance achieved by GMMs as long as associate in nursing acceptable sampling or importance coefficient technique is applied. The experiments conducted on audio sample of speech show that anchor models considerably improves the performance of GMMs by half dozen 2% relative. There is a tendency to be employing a hybrid approach for recognizing emotion from speech that may be a combination of Vector quantization (VQ) and mathematician Mixture Models GMM. A quick review of labor applied within the space of recognition victimization VQ-GMM hybrid approach is mentioned here.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] CONTEXTUAL VECTOR QUANTIZATION FOR SPEECH RECOGNITION WITH DISCRETE HIDDEN MARKOV MODEL
    HUO, QA
    CHAN, CK
    [J]. PATTERN RECOGNITION, 1995, 28 (04) : 513 - 517
  • [32] Speech Emotion Recognition Based on Gaussian Kernel nonlinear Proximal Support Vector Machine
    Han, Zhiyan
    Wang, Jian
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 2513 - 2516
  • [33] Classification of stressed speech using Gaussian mixture model
    Patro, H
    Raja, GS
    Dandapat, S
    [J]. INDICON 2005 Proceedings, 2005, : 342 - 346
  • [34] Bagged support vector machines for emotion recognition from speech
    Bhavan, Anjali
    Chauhan, Pankaj
    Hitkul
    Shah, Rajiv Ratn
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 184
  • [35] Performance Analysis of Speech Digit Recognition using Cepstrum and Vector Quantization
    Rudresh, M. D.
    Latha, A. S.
    Suganya, J.
    Nayana, C. G.
    [J]. 2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 632 - 637
  • [36] Scalable distributed speech recognition using Gaussian mixture model-based block quantisation
    So, Stephen
    Paliwal, Kuldip K.
    [J]. SPEECH COMMUNICATION, 2006, 48 (06) : 746 - 758
  • [37] Speech Enhancement for Automatic Speech Recognition Using Complex Gaussian Mixture Priors for Noise and Speech
    Astudillo, Ramon F.
    Hoffmann, Eugen
    Mandelartz, Philipp
    Orglmeister, Reinhold
    [J]. ADVANCES IN NONLINEAR SPEECH PROCESSING, 2010, 5933 : 60 - 67
  • [38] SPEECH EMOTION RECOGNITION WITH I-VECTOR FEATURE AND RNN MODEL
    Zhang, Teng
    Wu, Ji
    [J]. 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 524 - 528
  • [39] Gaussian mixture language models for speech recognition
    Afify, Mohamed
    Siohan, Olivier
    Sarikaya, Ruhi
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 29 - +
  • [40] SUBSPACE GAUSSIAN MIXTURE MODELS FOR SPEECH RECOGNITION
    Povey, Daniel
    Burget, Lukas
    Agarwal, Mohit
    Akyazi, Pinar
    Feng, Kai
    Ghoshal, Arnab
    Glembek, Ondrej
    Goel, Nagendra Kumar
    Karafiat, Martin
    Rastrow, Ariya
    Rose, Richard C.
    Schwarz, Petr
    Thomas, Samuel
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4330 - 4333