A study of interspeaker variability in speaker verification

被引:420
|
作者
Kenny, Patrick [1 ]
Ouellet, Pierre [1 ]
Dehak, Najim [1 ]
Gupta, Vishwa [1 ]
Dumouchel, Pierre [1 ]
机构
[1] Ctr Rech Informat Montreal, Montreal, PQ H3A 1B9, Canada
关键词
channel factors; Gaussian mixture model (GMM); speaker factors; speaker verification;
D O I
10.1109/TASL.2008.925147
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a new approach to the problem of estimating the hyperparameters which define the interspeaker variability model in joint factor analysis. We tested the proposed estimation technique on the NIST 2006 speaker recognition evaluation data and obtained 10%-15% reductions in error rates on the core condition and the extended data condition (as measured both by equal error rates and the NIST detection cost function). We show that when a large joint factor analysis model is trained in this way and tested on the core condition, the extended data condition and the cross-channel condition, it is capable of performing at least as well as fusions of multiple systems of other types. (The comparisons are based on the best results on these tasks that have been reported in the literature.) In the case of the cross-channel condition, a factor analysis model with 300 speaker factors and 200 channel factors can achieve equal error rates of less than 3.0%. This is a substantial improvement over the best results that have previously been reported on this task.
引用
收藏
页码:980 / 988
页数:9
相关论文
共 50 条
  • [1] Generalized Variability Model for Speaker Verification
    Ma, Jianbo
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathamby
    Lee, Kong Aik
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (12) : 1775 - 1779
  • [2] Session Variability in Automatic Speaker Verification
    Hayet, Djellali
    Radia, Amirouche
    Akila, Djebbar
    Tayeb, Laskri Mohamed
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 185 - 190
  • [3] Speaker and session variability in GMM-based speaker verification
    Kenny, Patrick
    Boulianne, Gilles
    Ouellet, Pierre
    Dumouchel, Pierre
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
  • [4] Intra-speaker variability effects on Speaker Verification performance
    Kahn, Juliette
    Audibert, Nicolas
    Rossato, Solange
    Bonastre, Jean-Francois
    ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 109 - 116
  • [5] Speaker Verification Based on SVM and Total Variability
    Zhang, Sheng
    Xu, Jie
    Guo, Wu
    Hu, Guoping
    Ma, Xiaokong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 418 - 418
  • [6] Experiments in session variability modelling for speaker verification
    Vogt, Robbie
    Sridharan, Sridha
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 897 - 900
  • [7] Explicit modelling of session variability for speaker verification
    Vogt, Robbie
    Sridharan, Sridha
    COMPUTER SPEECH AND LANGUAGE, 2008, 22 (01): : 17 - 38
  • [8] Local spectral variability features for speaker verification
    Sahidullah, Md
    Kinnunen, Tomi
    DIGITAL SIGNAL PROCESSING, 2016, 50 : 1 - 11
  • [9] Restoring the Residual Speaker Information in Total Variability Modeling for Speaker Verification
    Zhang, Ce
    Zheng, Rong
    Xu, Bo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 132 - 135
  • [10] Capture interspeaker information with a neural network for speaker identification
    Wang, L
    Chen, K
    Chi, HS
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (02): : 436 - 445