A GENERALIZED FRAMEWORK FOR DOMAIN ADAPTATION OF PLDA IN SPEAKER RECOGNITION

被引:0
|
作者
Wang, Qiongqiong [1 ]
Okabe, Koji [1 ]
Lee, Kong Aik [1 ]
Koshinaka, Takafumi [1 ]
机构
[1] NEC Corp Ltd, Biometr Res Labs, Tokyo, Japan
关键词
Speak verification; domain adaptation; correlation alignment; regularization; generalized framework;
D O I
10.1109/icassp40776.2020.9054113
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a generalized framework for domain adaptation of Probabilistic Linear Discriminant Analysis (PLDA) in speaker recognition. It not only includes several existing supervised and unsupervised domain adaptation methods but also makes possible more flexible usage of available data in different domains. In particular, we introduce here the two new techniques described below. (1) Correlation-alignment-based interpolation and (2) covariance regularization. The proposed correlation-alignment-based-interpolation method decreases minC(primary) up to 30.5% as compared with that from an out-of-domain PLDA model before adaptation, and minC(primary) is also 5.5% lower than with a conventional linear interpolation method with optimal interpolation weights. Further, the proposed regularization technique ensures robustness in interpolations w.r.t. varying interpolation weights, which in practice is essential.
引用
收藏
页码:6619 / 6623
页数:5
相关论文
共 50 条
  • [41] Effect of multicondition training on i-vector PLDA configurations for speaker recognition
    Rajan, Padmanabhan
    Kinnunen, Tomi
    Hautamaki, Ville
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3661 - 3664
  • [42] INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION
    Madikeri, Srikanth
    Ferras, Marc
    Motlicek, Petr
    Dey, Subhadeep
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5365 - 5369
  • [43] TOWARDS PLDA-RBM BASED SPEAKER RECOGNITION IN MOBILE ENVIRONMENT: DESIGNING STACKED/DEEP PLDA-RBM SYSTEMS
    Nautsch, Andreas
    Hao, Hong
    Stafylakis, Themos
    Rathgeb, Christian
    Busch, Christoph
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5055 - 5059
  • [44] Local Training in Speaker Verification for PLDA
    Pahuja, Hunny
    Ranjan, Priya
    Ujlayan, Amit
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2017, : 1466 - 1469
  • [45] Domain adaptation framework for personalized human activity recognition models
    Ala Mhalla
    Jean-Marie Favreau
    [J]. Multimedia Tools and Applications, 2024, 83 (25) : 66775 - 66797
  • [46] PREDICTIVE SPEAKER ADAPTATION IN SPEECH RECOGNITION
    COX, S
    [J]. COMPUTER SPEECH AND LANGUAGE, 1995, 9 (01): : 1 - 17
  • [47] UNSUPERVISED DOMAIN ADAPTATION FOR GENDER-AWARE PLDA MIXTURE MODELS
    Li, Longxin
    Mak, Man-Wai
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5269 - 5273
  • [48] Multi-Source Domain Adaptation for Text-Independent Forensic Speaker Recognition
    Wang, Zhenyu
    Hansen, John H. L.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 60 - 75
  • [49] LOW-RESOURCE DOMAIN ADAPTATION FOR SPEAKER RECOGNITION USING CYCLE-GANS
    Nidadavolu, Phani Sankar
    Kataria, Saurabh
    Villalba, Jesus
    Dehak, Najim
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 710 - 717
  • [50] IMPROVING SPEAKER RECOGNITION PERFORMANCE IN THE DOMAIN ADAPTATION CHALLENGE USING DEEP NEURAL NETWORKS
    Garcia-Romero, Daniel
    Zhang, Xiaohui
    McCree, Alan
    Povey, Daniel
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 378 - 383