A Hybrid Modeling Strategy for GMM-SVM Speaker Recognition with Adaptive Relevance Factor

被引:0
|
作者
You, Chang Huai [1 ]
Li, Haizhou [1 ]
Lee, Kong Aik [1 ]
机构
[1] ASTAR, Inst Infocomm Res, Human Language Technol Dept, Singapore 138632, Singapore
关键词
speaker recognition; Gaussian mixture model; maximum a posteriori; VERIFICATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In Gaussian mixture model (GMM) approach to speaker recognition, it has been found that the maximum a posteriori (MAP) estimation is greatly affected by undesired variability due to varying duration of utterance as well as other hidden factors related to recording devices, session environment, and phonetic contents. We propose an adaptive relevance factor (RF) to compensate for this variability. In the other side, in realistic application, it is likely that the different channel corresponds to its different training and test conditions in terms of quantity and quality of the speech signals. In this connection, we develop a hybrid model that combines multiple complementary systems, each of which focuses on specific condition(s). We show the effectiveness of the proposed method on the core task of the National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) 2008.
引用
收藏
页码:2754 / 2757
页数:4
相关论文
共 50 条
  • [1] Effect of Relevance Factor of Maximum a posteriori Adaptation for GMM-SVM in Speaker and Language Recognition
    You, Chang Huai
    Li, Haizhou
    Ma, Bin
    Lee, Kong Aik
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2063 - 2066
  • [2] A STUDY ON GMM-SVM WITH ADAPTIVE RELEVANCE FACTOR AND ITS COMPARISON WITH I-VECTOR AND JFA FOR SPEAKER RECOGNITION
    You, Chang Huai
    Li, Haizhou
    Ma, Bin
    Lee, Kong Aik
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7683 - 7687
  • [3] A hybrid GMM-SVM speaker identification system
    Mashao, DJ
    [J]. 2004 IEEE AFRICON: 7TH AFRICON CONFERENCE IN AFRICA, VOLS 1 AND 2: TECHNOLOGY INNOVATION, 2004, : 319 - 322
  • [4] A hybrid system based on GMM-SVM for Speaker Identification
    Chakroun, Rania
    Zouari, Leila Beltaifa
    Frikha, Mondher
    Ben Hamida, Ahmed
    [J]. 2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 654 - 658
  • [5] GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition
    You, Chang Huai
    Lee, Kong Aik
    Li, Haizhou
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1300 - 1312
  • [6] Performances Evaluation of GMM-UBM and GMM-SVM for Speaker Recognition in Realistic World
    Asbai, Nassim
    Amrouche, Abderrahmane
    Debyeche, Mohamed
    [J]. NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 284 - 291
  • [7] A NEW STUDY OF GMM-SVM SYSTEM FOR TEXT-DEPENDENT SPEAKER RECOGNITION
    Sun, Hanwu
    Lee, Kong Aik
    Ma, Bin
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4195 - 4199
  • [8] MiniVectors: an Improved GMM-SVM Approach for Speaker Verification
    Anguera, Xavier
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2323 - 2326
  • [9] Combining Deep Speaker Specific Representations with GMM-SVM for Speaker Verification
    Price, Ryan
    Biswas, Sangeeta
    Shinoda, Koichi
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2787 - 2791
  • [10] Client dependent GMM-SVM models for speaker verification
    Le, Q
    Bengio, S
    [J]. ARTIFICIAL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 443 - 451