UNSUPERVISED DOMAIN ADAPTATION FOR GENDER-AWARE PLDA MIXTURE MODELS

被引:0
|
作者
Li, Longxin [1 ]
Mak, Man-Wai [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China
关键词
I-vectors; DNN-driven mixture of PLDA; spectral clustering; domain adaptation; speaker verification;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Probabilistic linear discriminant analysis (PLDA) is a state-of-art back-end for i-vector based speaker verification. However, this back-end is still problematic when (1) the model is deployed to new environment (in-domain) that is very different from the training one (out-of-domain) and (2) there are insufficient labeled data from the new environment. To address these problems, this paper proposes using out-of-domain training data to pre-train a PLDA mixture model and applying the mixture model on the in-domain training data to compute a pairwise score matrix for spectral clustering. The hypothesized speaker labels produced by spectral clustering are then used for re-training the mixture model to fit the new environment. To refine the mixture model, the spectral clustering and re-training processes are repeated a number of times. To make the mixture model amenable to both genders, a deep neural network (DNN) is trained to produce gender posteriors given an i-vector. The gender posteriors then replace the posterior probabilities of the indicator variables in the PLDA mixture model. Evaluations based on NIST 2016 SRE suggest that at the end of the iterative re-training, the PLDA mixture model becomes fully adapted to the new domain. Results also show that the PLDA scores can be readily incorporated into spectral clustering, resulting in high quality speaker clusters that could not be possibly achieved by agglomerative hierarchical clustering.
引用
收藏
页码:5269 / 5273
页数:5
相关论文
共 50 条
  • [1] Domain Adaptation of PLDA models in Broadcast Diarization by means of Unsupervised Speaker Clustering
    Vinals, Ignacio
    Ortega, Alfonso
    Villalba, Jesus
    Miguel, Antonio
    Lleida, Eduardo
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2829 - 2833
  • [2] Unsupervised adaptation of PLDA models for broadcast diarization
    Ignacio Viñals
    Alfonso Ortega
    Jesús Villalba
    Antonio Miguel
    Eduardo Lleida
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [3] Unsupervised adaptation of PLDA models for broadcast diarization
    Vinals, Ignacio
    Ortega, Alfonso
    Villalba, Jesus
    Miguel, Antonio
    Lleida, Eduardo
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (01)
  • [4] THE CORAL plus ALGORITHM FOR UNSUPERVISED DOMAIN ADAPTATION OF PLDA
    Lee, Kong Aik
    Wang, Qiongqiong
    Koshinaka, Takafumi
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5821 - 5825
  • [5] Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
    Wang, Qiongqiong
    Koshinaka, Takafumi
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3727 - 3731
  • [6] UNSUPERVISED DOMAIN ADAPTATION OF NEURAL PLDA USING SEGMENT PAIRS FOR SPEAKER VERIFICATION
    Ulgen, I. Rasim
    Arslan, Levent M.
    [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 571 - 576
  • [7] Unsupervised Bayesian Adaptation of PLDA for Speaker Verification
    Borgstrorn, Bengt J.
    [J]. INTERSPEECH 2021, 2021, : 1039 - 1043
  • [8] UNSUPERVISED DOMAIN ADAPTATION WITH COPULA MODELS
    Tran, Cuong D.
    Rudovic, Ognjen
    Pavlovic, Vladimir
    [J]. 2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
  • [9] Gender-aware Re-ranking
    Kharitonov, Eugene
    Serdyukov, Pavel
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1081 - 1082
  • [10] Discriminative and Geometry-Aware Unsupervised Domain Adaptation
    Luo, Lingkun
    Chen, Liming
    Hu, Shiqiang
    Lu, Ying
    Wang, Xiaofang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 3914 - 3927