An i-vector GPLDA System for Speech based Emotion Recognition

被引:0
|
作者
Gamage, Kalani Wataraka [1 ,2 ]
Sethu, Vidhyasaharan [1 ]
Phu Ngoc Le [1 ,2 ]
Ambikairajah, Eliathamby [1 ,2 ]
机构
[1] UNSW, Sch Elect Engn & Telecommun, Kensington, NSW, Australia
[2] Natl ICT Australia NICTA, ATP Res Lab, Sydney, NSW, Australia
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose the use of a Gaussian Probabilistic Linear Discriminant Analysis (GPLDA) back-end for utterance level emotion classification based on i-vectors representing the distribution of frame level MFCC features. Experimental results based on the IEMOCAP corpus show that the GPLDA back-end outperforms an SVM based back-end while being less sensitive to i-vector dimensionality, making the proposed framework more robust to parameter tuning during system development.
引用
下载
收藏
页码:289 / 292
页数:4
相关论文
共 50 条
  • [41] I-VECTOR ESTIMATION AS AUXILIARY TASK FOR MULTI-TASK LEARNING BASED ACOUSTIC MODELING FOR AUTOMATIC SPEECH RECOGNITION
    Pironkov, Gueorgui
    Dupont, Stephane
    Dutoit, Thierry
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 1 - 7
  • [42] Generalized Discriminant Analysis (GDA) for Improved i-Vector Based Speaker Recognition
    Bahmaninezhad, Fahimeh
    Hansen, John H. L.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3643 - 3647
  • [43] Generalized cosine similarity in I-vector based automatic speaker recognition systems
    Drgas, Szymon
    Dabrowski, Adam
    2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 73 - 77
  • [44] Generalizing I-Vector Estimation for Rapid Speaker Recognition
    Xu, Longting
    Lee, Kong Aik
    Li, Haizhou
    Yang, Zhen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 749 - 759
  • [45] Supervised I-vector modeling for language and accent recognition
    Ramoji, Shreyas
    Ganapathy, Sriram
    COMPUTER SPEECH AND LANGUAGE, 2020, 60
  • [46] On the use of the i-vector speech representation for instrumental quality measurement
    Anderson R. Avila
    Jahangir Alam
    Douglas O’Shaughnessy
    Tiago H. Falk
    Quality and User Experience, 2020, 5 (1)
  • [47] I-vector based speaker recognition using advanced channel compensation techniques
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    McLaren, Mitchell
    Vogt, Robbie
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 121 - 140
  • [48] Speaker recognition based on discriminant i-vector local distance preserving projection
    Li, Zhiyi
    He, Liang
    Zhang, Weiqiang
    Liu, Jia
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2012, 52 (05): : 598 - 601
  • [49] Hindi speech recognition using time delay neural network acoustic modeling with i-vector adaptation
    Ankit Kumar
    Rajesh Kumar Aggarwal
    International Journal of Speech Technology, 2022, 25 : 67 - 78
  • [50] Hindi speech recognition using time delay neural network acoustic modeling with i-vector adaptation
    Kumar, Ankit
    Aggarwal, Rajesh Kumar
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 67 - 78