UNSUPERVISED DOMAIN ADAPTATION FOR GENDER-AWARE PLDA MIXTURE MODELS

被引：0

作者：

Li, Longxin ^{[1
]}

Mak, Man-Wai ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

关键词：

I-vectors; DNN-driven mixture of PLDA; spectral clustering; domain adaptation; speaker verification;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Probabilistic linear discriminant analysis (PLDA) is a state-of-art back-end for i-vector based speaker verification. However, this back-end is still problematic when (1) the model is deployed to new environment (in-domain) that is very different from the training one (out-of-domain) and (2) there are insufficient labeled data from the new environment. To address these problems, this paper proposes using out-of-domain training data to pre-train a PLDA mixture model and applying the mixture model on the in-domain training data to compute a pairwise score matrix for spectral clustering. The hypothesized speaker labels produced by spectral clustering are then used for re-training the mixture model to fit the new environment. To refine the mixture model, the spectral clustering and re-training processes are repeated a number of times. To make the mixture model amenable to both genders, a deep neural network (DNN) is trained to produce gender posteriors given an i-vector. The gender posteriors then replace the posterior probabilities of the indicator variables in the PLDA mixture model. Evaluations based on NIST 2016 SRE suggest that at the end of the iterative re-training, the PLDA mixture model becomes fully adapted to the new domain. Results also show that the PLDA scores can be readily incorporated into spectral clustering, resulting in high quality speaker clusters that could not be possibly achieved by agglomerative hierarchical clustering.

引用

页码：5269 / 5273

页数：5

共 50 条

[1] Domain Adaptation of PLDA models in Broadcast Diarization by means of Unsupervised Speaker Clustering
Vinals, Ignacio
Ortega, Alfonso
Villalba, Jesus
Miguel, Antonio
Lleida, Eduardo
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2829 - 2833
[2] Unsupervised adaptation of PLDA models for broadcast diarization
Ignacio Viñals
Alfonso Ortega
Jesús Villalba
Antonio Miguel
Eduardo Lleida
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2019
[3] Unsupervised adaptation of PLDA models for broadcast diarization
Vinals, Ignacio
Ortega, Alfonso
Villalba, Jesus
Miguel, Antonio
Lleida, Eduardo
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (01)
[4] THE CORAL plus ALGORITHM FOR UNSUPERVISED DOMAIN ADAPTATION OF PLDA
Lee, Kong Aik
Wang, Qiongqiong
Koshinaka, Takafumi
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5821 - 5825
[5] Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
Wang, Qiongqiong
Koshinaka, Takafumi
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3727 - 3731
[6] UNSUPERVISED DOMAIN ADAPTATION OF NEURAL PLDA USING SEGMENT PAIRS FOR SPEAKER VERIFICATION
Ulgen, I. Rasim
Arslan, Levent M.
[J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 571 - 576
[7] Unsupervised Bayesian Adaptation of PLDA for Speaker Verification
Borgstrorn, Bengt J.
[J]. INTERSPEECH 2021, 2021, : 1039 - 1043
[8] UNSUPERVISED DOMAIN ADAPTATION WITH COPULA MODELS
Tran, Cuong D.
Rudovic, Ognjen
Pavlovic, Vladimir
[J]. 2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
[9] Gender-aware Re-ranking
Kharitonov, Eugene
Serdyukov, Pavel
[J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1081 - 1082
[10] Discriminative and Geometry-Aware Unsupervised Domain Adaptation
Luo, Lingkun
Chen, Liming
Hu, Shiqiang
Lu, Ying
Wang, Xiaofang
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 3914 - 3927

← 1 2 3 4 5 →