Speaker verification using adapted Gaussian mixture models

被引:2852
|
作者
Reynolds, DA [1 ]
Quatieri, TF [1 ]
Dunn, RB [1 ]
机构
[1] MIT, Lincoln Lab, Speech Syst Technol Grp, Lexington, MA 02420 USA
关键词
speaker recognition; Gaussian mixture models; likelihood ratio detector; universal background model; handset normalization; NIST evaluation;
D O I
10.1006/dspr.1999.0361
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we describe the major elements of MIT Lincoln Laboratory's Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs). The system is built around the likelihood ratio test for verification, using simple but effective GMMs for likelihood functions, a universal background model (UBM) for alternative speaker representation, and a form of Bayesian adaptation to derive speaker models from the UBM. The development and use of a handset detector and score normalization to greatly improve verification performance is also described and discussed. Finally representative performance benchmarks and system behavior experiments on NIST SRE corpora are presented. (C) 2000 Academic Press.
引用
收藏
页码:19 / 41
页数:23
相关论文
共 50 条
  • [21] Speaker recognition for VoIP transmission using Gaussian mixture models
    Staroniewicz, P
    COMPUTER RECOGNITION SYSTEMS, PROCEEDINGS, 2005, : 739 - 745
  • [22] Comparative evaluation of maximum a Posteriori vector quantization and gaussian mixture models in speaker verification
    Kinnunen, Tomi
    Saastamoinen, Juhani
    Hautamaki, Ville
    Vinni, Mikko
    Franti, Pasi
    PATTERN RECOGNITION LETTERS, 2009, 30 (04) : 341 - 347
  • [23] Text-Independent Speaker Verification Using Variational Gaussian Mixture Model
    Moattar, Mohammad Hossein
    Homayounpour, Mohammad Mehdi
    ETRI JOURNAL, 2011, 33 (06) : 914 - 923
  • [24] Comparison of speaker segmentation methods based on the Bayesian information criterion and adapted Gaussian mixture models
    Grasic, Matej
    Kos, Marko
    Zgank, Andrej
    Kacic, Zdravko
    PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 161 - 164
  • [25] TEXT INDEPENDENT SPEAKER VERIFICATION USING ENHANCED SORTED GAUSSIAN MIXTURE MODEL
    Saeidi, R.
    Ganchev, T.
    Mohammadi, H. R. Sadegh
    ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 1191 - +
  • [26] ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
    REYNOLDS, DA
    ROSE, RC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 72 - 83
  • [27] Skew Gaussian mixture models for speaker recognition
    Matza, Avi
    Bistritz, Yuval
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 12 - 15
  • [28] Skew Gaussian mixture models for speaker recognition
    Matza, Avi
    Bistritz, Yuval
    IET SIGNAL PROCESSING, 2014, 8 (08) : 860 - 867
  • [29] Phoneme based speaker verification by Gaussian mixture models with adaptation of subsets of dominant parameters and phonemes
    Gutman, D
    Bistritz, Y
    22ND CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, PROCEEDINGS, 2002, : 246 - 246
  • [30] Efficient text-independent speaker verification with structural Gaussian mixture models and neural network
    Xiang, B
    Berger, T
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 447 - 456