An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition

被引:60
|
作者
You, Chang Huai [1 ]
Lee, Kong Aik [1 ]
Li, Haizhou [1 ]
机构
[1] ASTAR, Agcy Sci Technol & Res, Inst Infocomm Res, I2R, Singapore 138632, Singapore
关键词
Gaussian mixture model; National Institute of Standards and Technology (NIST) evaluation; speaker recognition; supervector; support vector machine; SUPPORT VECTOR MACHINES;
D O I
10.1109/LSP.2008.2006711
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Gaussian mixture model (GMM) and support vector machine (SVM) have become popular classifiers in text-independent speaker recognition. A GMM-supervector characterizes a speaker's voice with the parameters of GMM, which include mean vectors, covariance matrices, and mixture weights. GMM-supervector SVM benefits from both GMM and SVM frameworks to achieve the state-of-the-art performance. Conventional Kullback-Leibler (KL) kernel in GMM-supervector SVM classifier limits the adaptation of GMM to mean value and leaves covariance unchanged. In this letter, we introduce the GMM-UBM mean interval (GUMI) concept based on the Bhattacharyya distance. This leads to a new kernel for SVM classifier. Comparing with the KL kernel, the new kernel allows us to exploit the information not only from the mean but also from the covariance. We demonstrate the effectiveness of the new kernel on the 2006 National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) dataset.
引用
收藏
页码:49 / 52
页数:4
相关论文
共 50 条
  • [41] Convergence between SVM-based and distance-based paradigms for speaker recognition
    Charlet, Delphine
    Zhao, Xianyu
    Dong, Yuan
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1389 - +
  • [42] A multi-class MLLR kernel for SVM speaker recognition
    Karam, Zahi N.
    Campbell, William M.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4117 - +
  • [43] Evaluation of GMM-based Features for SVM Speaker Verification
    Liu, Minghui
    Huang, Zhongwei
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5027 - 5030
  • [44] SVM-based text-independent speaker verification using derivative kernel in the reference GMM space
    Xu, Minqiang
    Dal, Beiqian
    Xu, Dongxing
    Yang, Shlqing
    Liu, Qingsong
    2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 422 - 425
  • [45] GMM-based SVM for face recognition
    Bredin, Herve
    Dehak, Najim
    Chollet, Gerard
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 1111 - +
  • [46] Text-independent speaker recognition using probabilistic SVM with GMM adjustment
    Hou, FL
    Wang, BX
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 305 - 308
  • [47] Speaker Recognition Based on Fusion of a Deep and Shallow Recombination Gaussian Supervector
    Sun, Linhui
    Bu, Yunyi
    Zou, Bo
    Fu, Sheng
    Li, Pingan
    ELECTRONICS, 2021, 10 (01) : 1 - 21
  • [48] Homogenous ensemble phonotactic language recognition based on SVM supervector reconstruction
    Liu, Wei-Wei
    Zhang, Wei-Qiang
    Johnson, Michael T.
    Liu, Jia
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 13
  • [49] A GMM-based Probabilistic Sequence Kernel for Speaker Verification
    Lee, Kong-Aik
    You, Changhuai
    Li, Haizhou
    Kinnunen, Tomi
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1553 - 1556
  • [50] Homogenous ensemble phonotactic language recognition based on SVM supervector reconstruction
    Wei-Wei Liu
    Wei-Qiang Zhang
    Michael T Johnson
    Jia Liu
    EURASIP Journal on Audio, Speech, and Music Processing, 2014