A GENERALIZED FRAMEWORK FOR DOMAIN ADAPTATION OF PLDA IN SPEAKER RECOGNITION

被引:0
|
作者
Wang, Qiongqiong [1 ]
Okabe, Koji [1 ]
Lee, Kong Aik [1 ]
Koshinaka, Takafumi [1 ]
机构
[1] NEC Corp Ltd, Biometr Res Labs, Tokyo, Japan
关键词
Speak verification; domain adaptation; correlation alignment; regularization; generalized framework;
D O I
10.1109/icassp40776.2020.9054113
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a generalized framework for domain adaptation of Probabilistic Linear Discriminant Analysis (PLDA) in speaker recognition. It not only includes several existing supervised and unsupervised domain adaptation methods but also makes possible more flexible usage of available data in different domains. In particular, we introduce here the two new techniques described below. (1) Correlation-alignment-based interpolation and (2) covariance regularization. The proposed correlation-alignment-based-interpolation method decreases minC(primary) up to 30.5% as compared with that from an out-of-domain PLDA model before adaptation, and minC(primary) is also 5.5% lower than with a conventional linear interpolation method with optimal interpolation weights. Further, the proposed regularization technique ensures robustness in interpolations w.r.t. varying interpolation weights, which in practice is essential.
引用
收藏
页码:6619 / 6623
页数:5
相关论文
共 50 条
  • [1] Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition
    Wang, Qiongqiong
    Okabe, Koji
    Lee, Kong Aik
    Koshinaka, Takafumi
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 3936 - 3947
  • [2] Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
    Wang, Qiongqiong
    Koshinaka, Takafumi
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3727 - 3731
  • [3] Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition
    McCree, Alan
    Sell, Gregory
    Garcia-Romero, Daniel
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1552 - 1556
  • [4] Iterative PLDA Adaptation for Speaker Diarization
    Le Lan, Gael
    Charlet, Delphine
    Larcher, Anthony
    Meignier, Sylvain
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2175 - 2179
  • [5] UNSUPERVISED DOMAIN ADAPTATION OF NEURAL PLDA USING SEGMENT PAIRS FOR SPEAKER VERIFICATION
    Ulgen, I. Rasim
    Arslan, Levent M.
    [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 571 - 576
  • [6] Domain Adaptation of PLDA models in Broadcast Diarization by means of Unsupervised Speaker Clustering
    Vinals, Ignacio
    Ortega, Alfonso
    Villalba, Jesus
    Miguel, Antonio
    Lleida, Eduardo
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2829 - 2833
  • [7] Unsupervised Bayesian Adaptation of PLDA for Speaker Verification
    Borgstrorn, Bengt J.
    [J]. INTERSPEECH 2021, 2021, : 1039 - 1043
  • [8] On robustness of unsupervised domain adaptation for speaker recognition
    Bousquet, Pierre-Michel
    Rouvier, Mickael
    [J]. INTERSPEECH 2019, 2019, : 2958 - 2962
  • [9] DOMAIN AND SPEAKER ADAPTATION FOR CORTANA SPEECH RECOGNITION
    Zhao, Yong
    Li, Jinyu
    Zhang, Shixiong
    Chen, Liping
    Gong, Yifan
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5984 - 5988
  • [10] On Behaviour of PLDA Models in the Task of Speaker Recognition
    Machlica, Lukas
    Radova, Vlasta
    [J]. TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 352 - 359