A GENERALIZED FRAMEWORK FOR DOMAIN ADAPTATION OF PLDA IN SPEAKER RECOGNITION

被引：0

作者：

Wang, Qiongqiong ^{[1
]}

Okabe, Koji ^{[1
]}

Lee, Kong Aik ^{[1
]}

Koshinaka, Takafumi ^{[1
]}

机构：

[1] NEC Corp Ltd, Biometr Res Labs, Tokyo, Japan

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年

关键词：

Speak verification; domain adaptation; correlation alignment; regularization; generalized framework;

D O I：

10.1109/icassp40776.2020.9054113

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper proposes a generalized framework for domain adaptation of Probabilistic Linear Discriminant Analysis (PLDA) in speaker recognition. It not only includes several existing supervised and unsupervised domain adaptation methods but also makes possible more flexible usage of available data in different domains. In particular, we introduce here the two new techniques described below. (1) Correlation-alignment-based interpolation and (2) covariance regularization. The proposed correlation-alignment-based-interpolation method decreases minC(primary) up to 30.5% as compared with that from an out-of-domain PLDA model before adaptation, and minC(primary) is also 5.5% lower than with a conventional linear interpolation method with optimal interpolation weights. Further, the proposed regularization technique ensures robustness in interpolations w.r.t. varying interpolation weights, which in practice is essential.

引用

页码：6619 / 6623

页数：5

共 50 条

[1] Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition
Wang, Qiongqiong
Okabe, Koji
Lee, Kong Aik
Koshinaka, Takafumi
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 3936 - 3947
[2] Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
Wang, Qiongqiong
Koshinaka, Takafumi
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3727 - 3731
[3] Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition
McCree, Alan
Sell, Gregory
Garcia-Romero, Daniel
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1552 - 1556
[4] Iterative PLDA Adaptation for Speaker Diarization
Le Lan, Gael
Charlet, Delphine
Larcher, Anthony
Meignier, Sylvain
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2175 - 2179
[5] UNSUPERVISED DOMAIN ADAPTATION OF NEURAL PLDA USING SEGMENT PAIRS FOR SPEAKER VERIFICATION
Ulgen, I. Rasim
Arslan, Levent M.
[J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 571 - 576
[6] Domain Adaptation of PLDA models in Broadcast Diarization by means of Unsupervised Speaker Clustering
Vinals, Ignacio
Ortega, Alfonso
Villalba, Jesus
Miguel, Antonio
Lleida, Eduardo
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2829 - 2833
[7] Unsupervised Bayesian Adaptation of PLDA for Speaker Verification
Borgstrorn, Bengt J.
[J]. INTERSPEECH 2021, 2021, : 1039 - 1043
[8] On robustness of unsupervised domain adaptation for speaker recognition
Bousquet, Pierre-Michel
Rouvier, Mickael
[J]. INTERSPEECH 2019, 2019, : 2958 - 2962
[9] DOMAIN AND SPEAKER ADAPTATION FOR CORTANA SPEECH RECOGNITION
Zhao, Yong
Li, Jinyu
Zhang, Shixiong
Chen, Liping
Gong, Yifan
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5984 - 5988
[10] On Behaviour of PLDA Models in the Task of Speaker Recognition
Machlica, Lukas
Radova, Vlasta
[J]. TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 352 - 359

← 1 2 3 4 5 →