Dataset-Invariant Covariance Normalization for Out-domain PLDA Speaker Verification

被引:0
|
作者
Rahman, Md Hafizur [1 ]
Kanagasundaram, Ahilan [1 ]
Dean, David [1 ]
Sridharan, Sridha [1 ]
机构
[1] Queensland Univ Technol, Speech & Audio Res Lab, Brisbane, Qld, Australia
关键词
speaker verification; PLDA; DICN; domain adaptation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we introduce a novel domain-invariant covariance normalization (DICN) technique to relocate both in-domain and out-domain i-vectors into a third dataset-invariant space, providing an improvement for out-domain PLDA speaker verification with a very small number of unlabelled in-domain adaptation i-vectors. By capturing the dataset variance from a global mean using both development out-domain i-vectors and limited unlabelled in-domain i-vectors, we could obtain domain invariant representations of PLDA training data. The DICN-compensated out-domain PLDA system is shown to perform as well as in-domain PLDA training with as few as 500 unlabelled in-domain i-vectors for NIST-2010 SRE and 2000 unlabelled in-domain i-vectors for NIST-2008 SRE, and considerable relative improvement over both out-domain and in-domain PLDA development if more are available.
引用
收藏
页码:1017 / 1021
页数:5
相关论文
共 22 条
  • [1] Domain mismatch modeling of out-domain i-vectors for PLDA speaker verification
    Rahman, Md Hafizur
    Himawan, Ivan
    Dean, David
    Sridharan, Sridha
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1581 - 1585
  • [2] IMPROVING OUT-DOMAIN PLDA SPEAKER VERIFICATION USING UNSUPERVISED INTER-DATASET VARIABILITY COMPENSATION APPROACH
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4654 - 4658
  • [3] Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification
    Cai, Weicheng
    Li, Ming
    Li, Lin
    Hong, Qingyang
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1027 - 1031
  • [4] SNR-Invariant PLDA Modeling for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2317 - 2321
  • [5] DISCRIMINATIVE MULTI-DOMAIN PLDA FOR SPEAKER VERIFICATION
    Sholokhov, Alexey
    Kinnunen, Tomi
    Cumani, Sandro
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5030 - 5034
  • [6] Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
    Wang, Qiongqiong
    Koshinaka, Takafumi
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3727 - 3731
  • [7] NORMALIZATION OF TOTAL VARIABILITY MATRIX FOR I-VECTOR/PLDA SPEAKER VERIFICATION
    Rao, Wei
    Mak, Man-Wai
    Lee, Kong-Aik
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4180 - 4184
  • [8] SNR-Invariant PLDA Modeling in Nonparametric Subspace for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (10) : 1648 - 1659
  • [9] INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION
    Madikeri, Srikanth
    Ferras, Marc
    Motlicek, Petr
    Dey, Subhadeep
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5365 - 5369
  • [10] I-VECTOR KULLBACK-LEIBLER DIVISIVE NORMALIZATION FOR PLDA SPEAKER VERIFICATION
    Pan, Yilin
    Zheng, Tieran
    Chen, Chen
    [J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 56 - 60