An i-vector GPLDA System for Speech based Emotion Recognition

被引:0
|
作者
Gamage, Kalani Wataraka [1 ,2 ]
Sethu, Vidhyasaharan [1 ]
Phu Ngoc Le [1 ,2 ]
Ambikairajah, Eliathamby [1 ,2 ]
机构
[1] UNSW, Sch Elect Engn & Telecommun, Kensington, NSW, Australia
[2] Natl ICT Australia NICTA, ATP Res Lab, Sydney, NSW, Australia
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose the use of a Gaussian Probabilistic Linear Discriminant Analysis (GPLDA) back-end for utterance level emotion classification based on i-vectors representing the distribution of frame level MFCC features. Experimental results based on the IEMOCAP corpus show that the GPLDA back-end outperforms an SVM based back-end while being less sensitive to i-vector dimensionality, making the proposed framework more robust to parameter tuning during system development.
引用
收藏
页码:289 / 292
页数:4
相关论文
共 50 条
  • [1] SPEECH EMOTION RECOGNITION WITH I-VECTOR FEATURE AND RNN MODEL
    Zhang, Teng
    Wu, Ji
    [J]. 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 524 - 528
  • [2] Emotion Recognition in I-vector Spaced
    Mackova, Lenka
    Cizmar, Anton
    Juhar, Jozef
    [J]. PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA 2016), 2016, : 372 - 375
  • [3] i-vector Algorithm with Gaussian Mixture Model for Efficient Speech Emotion Recognition
    Gomes, Joan
    El-Sharkawy, Mohamed
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2015, : 476 - 480
  • [4] I-vector Based Utterance Verification for Large-Vocabulary Speech Recognition System
    Choi, Woo Yong
    Song, Hwa Jeon
    Chung, Hoon
    Kang, Jeomja
    Park, Jeon Gue
    [J]. 2016 FIRST IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND THE INTERNET (ICCCI 2016), 2016, : 316 - 319
  • [5] Using i-Vector Space Model for Emotion Recognition
    Xia, Rui
    Liu, Yang
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2227 - 2230
  • [6] Robust i-vector based Adaptation of DNN Acoustic Model for Speech Recognition
    Garimella
    Mandal, Arindam
    Strom, Nikko
    Hoffmeister, Bjorn
    Matsoukas, Spyros
    Parthasarathi, Hari Krishnan
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2877 - 2881
  • [7] An i-Vector based Approach to Training Data Clustering for Improved Speech Recognition
    Zhang, Yu
    Xu, Jian
    Yan, Zhi-Jie
    Huo, Qiang
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 796 - 799
  • [8] I-vector Based Speaker Gender Recognition
    Wang, Minghe
    Chen, Ying
    Tang, Zhenmin
    Zhang, Erhua
    [J]. 2015 IEEE ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2015, : 729 - 732
  • [9] I-Vector Dependent Feature Space Transformations for Adaptive Speech Recognition
    Li, Xiangang
    Wu, Xihong
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3635 - 3639
  • [10] I-Vector Speaker and Language Recognition System on Android
    Vazquez-Machado, Christian
    Colon-Hernandez, Pedro
    Torres-Carrasquillo, Pedro A.
    [J]. 2016 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2016,