An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition

被引:60
|
作者
You, Chang Huai [1 ]
Lee, Kong Aik [1 ]
Li, Haizhou [1 ]
机构
[1] ASTAR, Agcy Sci Technol & Res, Inst Infocomm Res, I2R, Singapore 138632, Singapore
关键词
Gaussian mixture model; National Institute of Standards and Technology (NIST) evaluation; speaker recognition; supervector; support vector machine; SUPPORT VECTOR MACHINES;
D O I
10.1109/LSP.2008.2006711
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Gaussian mixture model (GMM) and support vector machine (SVM) have become popular classifiers in text-independent speaker recognition. A GMM-supervector characterizes a speaker's voice with the parameters of GMM, which include mean vectors, covariance matrices, and mixture weights. GMM-supervector SVM benefits from both GMM and SVM frameworks to achieve the state-of-the-art performance. Conventional Kullback-Leibler (KL) kernel in GMM-supervector SVM classifier limits the adaptation of GMM to mean value and leaves covariance unchanged. In this letter, we introduce the GMM-UBM mean interval (GUMI) concept based on the Bhattacharyya distance. This leads to a new kernel for SVM classifier. Comparing with the KL kernel, the new kernel allows us to exploit the information not only from the mean but also from the covariance. We demonstrate the effectiveness of the new kernel on the 2006 National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) dataset.
引用
收藏
页码:49 / 52
页数:4
相关论文
共 50 条
  • [1] A GMM SUPERVECTOR KERNEL WITH THE BHATTACHARYYA DISTANCE FOR SVM BASED SPEAKER RECOGNITION
    You, Chang Huai
    Lee, Kong Aik
    Li, Haizhou
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4221 - 4224
  • [2] Speaker Verification Using SVM Kernel with GMM-Supervector Based on the Mahalanobis Distance
    Kim, Hyoung-Gook
    Shin, Dong
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2010, 29 (03): : 216 - 221
  • [3] GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition
    You, Chang Huai
    Lee, Kong Aik
    Li, Haizhou
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1300 - 1312
  • [4] Structural MAP Adaptation in GMM-Supervector based Speaker Recognition
    Ferras, Marc
    Shinoda, Koichi
    Furui, Sadaoki
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5432 - 5435
  • [5] Enhanced Speaker Verification Using GMM-Supervector Based Modified Adaptive GMM Training
    Trinh, Tan Dat
    Park, Min Kyung
    Kim, Jin Young
    Lee, Kyong Rok
    Cho, Keeseong
    MOBILE AND WIRELESS TECHNOLOGY 2015, 2015, 310 : 147 - 153
  • [6] SVM based speaker verification using a GMM supervector kernel and nap variability compensation
    Campbell, W. M.
    Sturim, D. E.
    Reynolds, D. A.
    Solomonoff, A.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 97 - 100
  • [7] Dialect Recognition Using a Phone-GMM-Supervector-Based SVM Kernel
    Biadsy, Fadi
    Hirschberg, Julia
    Collins, Michael
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 753 - +
  • [8] GMM-based Bhattacharyya kernel Fisher Discriminant Analysis for speaker recognition
    Chao, YH
    Wang, HM
    Chang, RC
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 649 - 652
  • [9] SVM based speaker selection using GMM supervector for rapid speaker adaptation
    Wang, Jian
    Lei, Jianjun
    Guo, Jun
    Yang, Zhen
    SIMULATED EVOLUTION AND LEARNING, PROCEEDINGS, 2006, 4247 : 617 - 624
  • [10] Audio-based Emotion Recognition using GMM Supervector an SVM Linear Kernel
    Dinh-Son Tran
    Yang, Hyung-Jeong
    Kim, Soo-Hyung
    Lee, Guee Sang
    Luu-Ngoc Do
    Ngoc-Huynh Ho
    Van Quan Nguyen
    2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING (ICMLSC 2018), 2015, : 169 - 173