A study of interspeaker variability in speaker verification

被引:420
|
作者
Kenny, Patrick [1 ]
Ouellet, Pierre [1 ]
Dehak, Najim [1 ]
Gupta, Vishwa [1 ]
Dumouchel, Pierre [1 ]
机构
[1] Ctr Rech Informat Montreal, Montreal, PQ H3A 1B9, Canada
关键词
channel factors; Gaussian mixture model (GMM); speaker factors; speaker verification;
D O I
10.1109/TASL.2008.925147
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a new approach to the problem of estimating the hyperparameters which define the interspeaker variability model in joint factor analysis. We tested the proposed estimation technique on the NIST 2006 speaker recognition evaluation data and obtained 10%-15% reductions in error rates on the core condition and the extended data condition (as measured both by equal error rates and the NIST detection cost function). We show that when a large joint factor analysis model is trained in this way and tested on the core condition, the extended data condition and the cross-channel condition, it is capable of performing at least as well as fusions of multiple systems of other types. (The comparisons are based on the best results on these tasks that have been reported in the literature.) In the case of the cross-channel condition, a factor analysis model with 300 speaker factors and 200 channel factors can achieve equal error rates of less than 3.0%. This is a substantial improvement over the best results that have previously been reported on this task.
引用
收藏
页码:980 / 988
页数:9
相关论文
共 50 条
  • [31] Total Variability Layer in Deep Neural Network Embeddings for Speaker Verification
    Travadi, Ruchir
    Narayanan, Shrikanth
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (06) : 893 - 897
  • [32] Session variability subspace projection based model compensation for speaker verification
    Deng, Jing
    Zheng, Thomas Fang
    Wu, Wenhu
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 57 - +
  • [33] Within-Session Variability Modelling for Factor Analysis Speaker Verification
    Vogt, Robbie
    Pelecanos, Jason
    Scheffer, Nicolas
    Kajarekar, Sachin
    Sridharan, Sridha
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1531 - +
  • [34] SPEAKER VERIFICATION
    CHAPMAN, WD
    LI, KP
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1966, 40 (05): : 1282 - &
  • [35] Multi-channel speaker verification based on total variability modelling
    Correia, Joana
    Brutti, Alessio
    Abad, Alberto
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2312 - 2316
  • [36] Incorporating Local Acoustic Variability Information into Short Duration Speaker Verification
    Ma, Jianbo
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathamby
    Lee, Kong Aik
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1502 - 1506
  • [37] SVM speaker verification using session variability modelling and GMM supervectors
    McLaren, M.
    Vogt, R.
    Sridharan, S.
    ADVANCES IN BIOMETRICS, PROCEEDINGS, 2007, 4642 : 1077 - +
  • [38] Phoneme dependent inter-session variability reduction for speaker verification
    Lu, Haoze
    Zhang, Wenbin
    Horiuchi, Yasuo
    Kuroiwa, Shingo
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2015, 7 (02) : 83 - 96
  • [39] Compensation of Intrinsic Variability with Factor Analysis Modeling for Robust Speaker Verification
    Chen, Sheng
    Xu, Mingxing
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1574 - 1577
  • [40] Speaker verification
    Atkins, Wendy
    Biometric Technology Today, 2001, 9 (03) : 8 - 11