THE CORAL plus plus ALGORITHM FOR UNSUPERVISED DOMAIN ADAPTATION OF SPEAKER RECOGNITION

被引:8
|
作者
Li, Rongjin [1 ]
Zhang, Weibin [1 ]
Chen, Dongpeng [1 ]
机构
[1] VoiceAI Technol Co Ltd, Shenzhen, Peoples R China
关键词
Speaker recognition; speaker embedding; domain adaptation; unsupervised learning;
D O I
10.1109/ICASSP43922.2022.9747792
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
State-of-the-art speaker recognition systems are trained with a large amount of human-labeled training data set. Such a training set is usually composed of various data sources to enhance the modeling capability of models. However, in practical deployment, unseen condition is almost inevitable. Domain mismatch is a common problem in real-life applications due to the statistical difference between the training and testing data sets. To alleviate the degradation caused by domain mismatch, we propose a new feature-based unsupervised domain adaptation algorithm. The algorithm we propose is a further optimization based on the well-known CORrelation ALignment (CORAL), so we call it CORAL++. On the NIST 2019 Speaker Recognition Evaluation (SRE19), we use SRE18 CTS set as the development set to verify the effectiveness of CORAL++. With the typical x-vector/PLDA setup, the CORAL++ outperforms the CORAL by 9.40% relatively on EER.
引用
收藏
页码:7172 / 7176
页数:5
相关论文
共 50 条
  • [31] Two-Step Unsupervised Speaker Adaptation Based on Speaker and Gender Recognition and HMM Combination
    Cerva, Petr
    Nouza, Jan
    Silovsky, Jan
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2326 - 2329
  • [32] Unsupervised Domain Adaptation for Skeleton Recognition With Fourier Analysis
    Hu, Ruotong
    Wang, Xianzhi
    Ding, Xiangqian
    Zhang, Yongle
    Xin, Xiaowei
    Pang, Wei
    Yu, Shusong
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (24): : 40166 - 40175
  • [33] Unsupervised Domain Adaptation Dictionary Learning for Visual Recognition
    Zhong, Zhun
    Li, Zongmin
    Li, Runlin
    Sun, Xiaoxia
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING: PAKDD 2018 WORKSHOPS, 2018, 11154 : 16 - 26
  • [34] Unsupervised Domain Adaptation for Video Transformers in Action Recognition
    da Costa, Victor G. Turrisi
    Zara, Giacomo
    Rota, Paolo
    Oliveira-Santos, Thiago
    Sebe, Nicu
    Murino, Vittorio
    Ricci, Elisa
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1258 - 1265
  • [35] Improving Hazy Image Recognition by Unsupervised Domain Adaptation
    Yuan, Zhiyu
    Li, Yuhang
    Yang, Jianfei
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 311 - 316
  • [36] Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos
    Sohn, Kihyuk
    Liu, Sifei
    Zhong, Guangyu
    Yu, Xiang
    Yang, Ming-Hsuan
    Chandraker, Manmohan
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5917 - 5925
  • [37] Unsupervised Domain Adaptation for Human Activity Recognition in Radar
    Li, Xinyu
    Jing, Xiaojun
    He, Yuan
    2020 IEEE RADAR CONFERENCE (RADARCONF20), 2020,
  • [38] SpEx plus : A Complete Time Domain Speaker Extraction Network
    Ge, Meng
    Xu, Chenglin
    Wang, Longbiao
    Chng, Eng Siong
    Dang, Jianwu
    Li, Haizhou
    INTERSPEECH 2020, 2020, : 1406 - 1410
  • [39] SEA plus plus : Multi-Graph-Based Higher-Order Sensor Alignment for Multivariate Time-Series Unsupervised Domain Adaptation
    Wang, Yucheng
    Xu, Yuecong
    Yang, Jianfei
    Wu, Min
    Li, Xiaoli
    Xie, Lihua
    Chen, Zhenghua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10781 - 10796
  • [40] UNSUPERVISED DOMAIN ADAPTATION OF NEURAL PLDA USING SEGMENT PAIRS FOR SPEAKER VERIFICATION
    Ulgen, I. Rasim
    Arslan, Levent M.
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 571 - 576