Secondary classification for GMM based speaker recognition

被引:0
|
作者
Pelecanos, Jason [1 ]
Povey, Dan [1 ]
Ramaswamy, Ganesh [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Conversat Biometr Grp, Yorktown Hts, NY 10598 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper discusses the use of a secondary classifier to reweight the frame-based scores of a speaker recognition system according to which region in feature space they belong. The score mapping function is constructed to perform a likelihood ratio (LR) correction of the original LR scores. This approach has the ability to limit the effect of rogue model components and regions of feature space that may not be robust to different audio environments, handset types or speakers. Prior information available from tests on a development data set can be used to determine a log-likelihood-ratio mapping function that more appropriately weights each speech frame. The computational overhead for this approach in online mode is close to negligible for significant performance gains shown for the NIST 2004 Speaker Recognition Evaluation data.
引用
收藏
页码:109 / 112
页数:4
相关论文
共 50 条
  • [1] Speaker Cluster based GMM Tokenization for Speaker Recognition
    Ma, Bin
    Zhu, Donglai
    Tong, Rong
    Li, Haizhou
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 505 - 508
  • [2] Speaker Recognition and Speech Emotion Recognition Based on GMM
    Xu, Shupeng
    Liu, Yan
    Liu, Xiping
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 434 - 436
  • [3] Speaker recognition based on the combination of GMM and SVDD
    Zhou, Yuhuan
    Zhang, Xiongwei
    Wang, Jinming
    Gong, Yong
    Zhou, Yi
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2011, 87 (03): : 329 - 332
  • [4] Speaker Recognition Based on GMM with an Embedded TDNN
    Chen, Cunbao
    Zhao, Li
    [J]. NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 746 - 753
  • [5] Experimental Study on GMM-Based Speaker Recognition
    Ye, Wenxing
    Wu, Dapeng
    Nucci, Antonio
    [J]. MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010, 2010, 7708
  • [6] On the use of orthogonal GMM in speaker recognition
    Liu, L
    He, JL
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 845 - 848
  • [7] Signal bias removal based GMM for robust speaker recognition
    Kim, YJ
    Chung, JH
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4163 - 4163
  • [8] GMM and kernel-based speaker recognition with the ISIP toolkit
    Imbiriba, T
    Klautau, A
    Parihar, N
    Raghavan, S
    Picone, J
    [J]. MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 371 - 380
  • [9] Structural MAP Adaptation in GMM-Supervector based Speaker Recognition
    Ferras, Marc
    Shinoda, Koichi
    Furui, Sadaoki
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5432 - 5435
  • [10] Score Regulation based on GMM Token Ratio Similarity for Speaker Recognition
    Yang, Yingchun
    Deng, Licai
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 424 - 424