THE CORAL plus plus ALGORITHM FOR UNSUPERVISED DOMAIN ADAPTATION OF SPEAKER RECOGNITION

被引:8
|
作者
Li, Rongjin [1 ]
Zhang, Weibin [1 ]
Chen, Dongpeng [1 ]
机构
[1] VoiceAI Technol Co Ltd, Shenzhen, Peoples R China
关键词
Speaker recognition; speaker embedding; domain adaptation; unsupervised learning;
D O I
10.1109/ICASSP43922.2022.9747792
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
State-of-the-art speaker recognition systems are trained with a large amount of human-labeled training data set. Such a training set is usually composed of various data sources to enhance the modeling capability of models. However, in practical deployment, unseen condition is almost inevitable. Domain mismatch is a common problem in real-life applications due to the statistical difference between the training and testing data sets. To alleviate the degradation caused by domain mismatch, we propose a new feature-based unsupervised domain adaptation algorithm. The algorithm we propose is a further optimization based on the well-known CORrelation ALignment (CORAL), so we call it CORAL++. On the NIST 2019 Speaker Recognition Evaluation (SRE19), we use SRE18 CTS set as the development set to verify the effectiveness of CORAL++. With the typical x-vector/PLDA setup, the CORAL++ outperforms the CORAL by 9.40% relatively on EER.
引用
收藏
页码:7172 / 7176
页数:5
相关论文
共 50 条
  • [1] THE CORAL plus ALGORITHM FOR UNSUPERVISED DOMAIN ADAPTATION OF PLDA
    Lee, Kong Aik
    Wang, Qiongqiong
    Koshinaka, Takafumi
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5821 - 5825
  • [2] On robustness of unsupervised domain adaptation for speaker recognition
    Bousquet, Pierre-Michel
    Rouvier, Mickael
    INTERSPEECH 2019, 2019, : 2958 - 2962
  • [3] UNSUPERVISED DOMAIN ADAPTATION VIA DOMAIN ADVERSARIAL TRAINING FOR SPEAKER RECOGNITION
    Wang, Qing
    Rao, Wei
    Sun, Sining
    Xie, Lei
    Chng, Eng Siong
    Li, Haizhou
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4889 - 4893
  • [4] Crowd plus plus : Unsupervised Speaker Count with Smartphones
    Xu, Chenren
    Li, Sugang
    Liu, Gang
    Zhang, Yanyong
    Miluzzo, Emiliano
    Chen, Yih-Farn
    Li, Jun
    Firner, Bernhard
    UBICOMP'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, 2013, : 43 - 52
  • [5] A Simple Unsupervised Knowledge-Free Domain Adaptation for Speaker Recognition
    Lin, Wan
    Li, Lantian
    Wang, Dong
    APPLIED SCIENCES-BASEL, 2024, 14 (03):
  • [6] A fast algorithm for unsupervised incremental speaker adaptation
    Schussler, M
    Gallwitz, F
    Harbeck, S
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1019 - 1022
  • [7] DOMAIN AND SPEAKER ADAPTATION FOR CORTANA SPEECH RECOGNITION
    Zhao, Yong
    Li, Jinyu
    Zhang, Shixiong
    Chen, Liping
    Gong, Yifan
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5984 - 5988
  • [8] Unsupervised Domain Adaptation Learning Algorithm for RGB-D Stairway Recognition
    Jing WANG
    Kuangen ZHANG
    Instrumentation, 2019, 6 (02) : 21 - 29
  • [9] Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
    Wang, Qiongqiong
    Koshinaka, Takafumi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3727 - 3731
  • [10] EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification
    Li, Jingyu
    Liu, Wei
    Lee, Tan
    INTERSPEECH 2022, 2022, : 3694 - 3698