An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition

被引:60
|
作者
You, Chang Huai [1 ]
Lee, Kong Aik [1 ]
Li, Haizhou [1 ]
机构
[1] ASTAR, Agcy Sci Technol & Res, Inst Infocomm Res, I2R, Singapore 138632, Singapore
关键词
Gaussian mixture model; National Institute of Standards and Technology (NIST) evaluation; speaker recognition; supervector; support vector machine; SUPPORT VECTOR MACHINES;
D O I
10.1109/LSP.2008.2006711
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Gaussian mixture model (GMM) and support vector machine (SVM) have become popular classifiers in text-independent speaker recognition. A GMM-supervector characterizes a speaker's voice with the parameters of GMM, which include mean vectors, covariance matrices, and mixture weights. GMM-supervector SVM benefits from both GMM and SVM frameworks to achieve the state-of-the-art performance. Conventional Kullback-Leibler (KL) kernel in GMM-supervector SVM classifier limits the adaptation of GMM to mean value and leaves covariance unchanged. In this letter, we introduce the GMM-UBM mean interval (GUMI) concept based on the Bhattacharyya distance. This leads to a new kernel for SVM classifier. Comparing with the KL kernel, the new kernel allows us to exploit the information not only from the mean but also from the covariance. We demonstrate the effectiveness of the new kernel on the 2006 National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) dataset.
引用
收藏
页码:49 / 52
页数:4
相关论文
共 50 条
  • [21] Multi-feature Fusion using Multi-GMM Supervector for SVM Speaker Verification
    Liu, Minghui
    Huang, Zhongwei
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4332 - 4335
  • [22] Investigating the Effect of Data Partitioning for GMM Supervector Based Speaker Verification
    Dikici, Erinc
    Saraclar, Murat
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 464 - 469
  • [23] A Method to Integrate GMM, SVM and DTW for Speaker Recognition
    Ding, Ing-Jr
    Yen, Chih-Ta
    Ou, Da-Cheng
    INTERNATIONAL JOURNAL OF ENGINEERING AND TECHNOLOGY INNOVATION, 2014, 4 (01) : 38 - 47
  • [24] Human Action Recognition based on GMM-UBM supervector using SVM with non-linear GMM KL and GUMI
    Bui, Nam N.
    Kim, Young J.
    SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2015), 2015, 9631
  • [25] Data-driven UBM Generation via Tied Gaussians for GMM-Supervector Based Accent Identification
    Zheng, Rong
    Zhang, Ce
    Xu, Bo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 852 - 855
  • [26] Singing speaker clustering based on subspace learning in the GMM mean supervector space
    Mehrabani, Mahnoosh
    Hansen, John H. L.
    SPEECH COMMUNICATION, 2013, 55 (05) : 653 - 666
  • [27] SVM AGAINST GMM/SVM FOR DIALECT INFLUENCE ON AUTOMATIC SPEAKER RECOGNITION TASK
    Zergat, Kawthar
    Amrouche, Abderrahmane
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2014, 13 (02)
  • [28] New scheme based on GMM-PCA-SVM modelling for automatic speaker recognition
    Zergat, Kawthar
    Amrouche, Abderrahmane
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (04) : 373 - 381
  • [29] Speaker Cluster based GMM Tokenization for Speaker Recognition
    Ma, Bin
    Zhu, Donglai
    Tong, Rong
    Li, Haizhou
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 505 - 508
  • [30] A Non-linear GMM KL and GUMI Kernel for SVM Using GMM-UBM Supervector in Home Acoustic Event Classification
    Ngoc Nam Bui
    Kim, Jin Young
    Trinh, Tan Dat
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2014, E97A (08): : 1791 - 1794