Emotion Recognition from Speech using Gaussian Mixture Model and Vector Quantization

被引:0
|
作者
Agrawal, Surabhi [1 ]
Dongaonkar, Shabda [1 ]
机构
[1] GHRCEM, Dept Comp, Pune, Maharashtra, India
关键词
Anchor models; emotional speech; emotion recognition; GMM model;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, there is a demand to evaluate the effectiveness of anchor models applied to the multiclass drawback of Emotion recognition from speech. Within the anchor models system, associate in nursing emotion category is characterized by its line of similarity relative to different emotion categories. Generative models like Gaussian Mixture Models (GMMs) are typically used as front-end systems to get feature vectors want to train complicated back-end systems like Support Vector Machine (SVMs) to enhance the classification performance. There is a tendency to show that within the context of extremely unbalanced knowledge categories, these back-end systems will improve the performance achieved by GMMs as long as associate in nursing acceptable sampling or importance coefficient technique is applied. The experiments conducted on audio sample of speech show that anchor models considerably improves the performance of GMMs by half dozen 2% relative. There is a tendency to be employing a hybrid approach for recognizing emotion from speech that may be a combination of Vector quantization (VQ) and mathematician Mixture Models GMM. A quick review of labor applied within the space of recognition victimization VQ-GMM hybrid approach is mentioned here.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Speech emotion recognition using Gaussian mixture vector autoregressive models
    El Ayadi, Moataz M. H.
    Kamel, Mohamed S.
    Karray, Fakhri
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 957 - +
  • [2] Robust Features for Emotion Recognition from Speech by Using Gaussian Mixture Model Classification
    Navyasri, M.
    RajeswarRao, R.
    DaveeduRaju, A.
    Ramakrishnamurthy, M.
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 2, 2018, 84 : 437 - 444
  • [3] i-vector Algorithm with Gaussian Mixture Model for Efficient Speech Emotion Recognition
    Gomes, Joan
    El-Sharkawy, Mohamed
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2015, : 476 - 480
  • [4] The Research of Speech Emotion Recognition Based on Gaussian Mixture Model
    Zhang, Wanli
    Li, Guoxin
    Gao, Wei
    [J]. MECHANICAL COMPONENTS AND CONTROL ENGINEERING III, 2014, 668-669 : 1126 - +
  • [5] Application of Vector Quantization in Emotion Recognition from Human Speech
    Khanna, Preeti
    Kumar, M. Sasi
    [J]. INFORMATION INTELLIGENCE, SYSTEMS, TECHNOLOGY AND MANAGEMENT, 2011, 141 : 118 - +
  • [6] EMOTION RECOGNITION FROM SPEECH VIA BOOSTED GAUSSIAN MIXTURE MODELS
    Tang, Hao
    Chu, Stephen M.
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 294 - +
  • [7] Variational Gaussian Mixture Models for Speech Emotion Recognition
    Mishra, Harendra Kumar
    Sekhar, C. Chandra
    [J]. ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 183 - 186
  • [8] Self Learning Speech Recognition Model Using Vector Quantization
    Saleem, M.
    Rehman, Zia Ur
    Zahoor, Usama
    Mazhar, Amna
    Anjum, M. R.
    [J]. 2016 SIXTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2016, : 199 - 203
  • [9] Improved Emotion Recognition Using Gaussian Mixture Model and Extreme Learning Machine in Speech and Glottal Signals
    Muthusamy, Hariharan
    Polat, Kemal
    Yaacob, Sazali
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [10] Waveform quantization of speech using Gaussian mixture models
    Samuelsson, J
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 165 - 168