THE CORAL plus plus ALGORITHM FOR UNSUPERVISED DOMAIN ADAPTATION OF SPEAKER RECOGNITION

被引：8

作者：

Li, Rongjin ^{[1
]}

Zhang, Weibin ^{[1
]}

Chen, Dongpeng ^{[1
]}

机构：

[1] VoiceAI Technol Co Ltd, Shenzhen, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

Speaker recognition; speaker embedding; domain adaptation; unsupervised learning;

D O I：

10.1109/ICASSP43922.2022.9747792

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

State-of-the-art speaker recognition systems are trained with a large amount of human-labeled training data set. Such a training set is usually composed of various data sources to enhance the modeling capability of models. However, in practical deployment, unseen condition is almost inevitable. Domain mismatch is a common problem in real-life applications due to the statistical difference between the training and testing data sets. To alleviate the degradation caused by domain mismatch, we propose a new feature-based unsupervised domain adaptation algorithm. The algorithm we propose is a further optimization based on the well-known CORrelation ALignment (CORAL), so we call it CORAL++. On the NIST 2019 Speaker Recognition Evaluation (SRE19), we use SRE18 CTS set as the development set to verify the effectiveness of CORAL++. With the typical x-vector/PLDA setup, the CORAL++ outperforms the CORAL by 9.40% relatively on EER.

引用

页码：7172 / 7176

页数：5

共 50 条

[31] Two-Step Unsupervised Speaker Adaptation Based on Speaker and Gender Recognition and HMM Combination
Cerva, Petr
Nouza, Jan
Silovsky, Jan
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2326 - 2329
[32] Unsupervised Domain Adaptation for Skeleton Recognition With Fourier Analysis
Hu, Ruotong
Wang, Xianzhi
Ding, Xiangqian
Zhang, Yongle
Xin, Xiaowei
Pang, Wei
Yu, Shusong
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (24): : 40166 - 40175
[33] Unsupervised Domain Adaptation Dictionary Learning for Visual Recognition
Zhong, Zhun
Li, Zongmin
Li, Runlin
Sun, Xiaoxia
TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING: PAKDD 2018 WORKSHOPS, 2018, 11154 : 16 - 26
[34] Unsupervised Domain Adaptation for Video Transformers in Action Recognition
da Costa, Victor G. Turrisi
Zara, Giacomo
Rota, Paolo
Oliveira-Santos, Thiago
Sebe, Nicu
Murino, Vittorio
Ricci, Elisa
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1258 - 1265
[35] Improving Hazy Image Recognition by Unsupervised Domain Adaptation
Yuan, Zhiyu
Li, Yuhang
Yang, Jianfei
2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 311 - 316
[36] Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos
Sohn, Kihyuk
Liu, Sifei
Zhong, Guangyu
Yu, Xiang
Yang, Ming-Hsuan
Chandraker, Manmohan
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5917 - 5925
[37] Unsupervised Domain Adaptation for Human Activity Recognition in Radar
Li, Xinyu
Jing, Xiaojun
He, Yuan
2020 IEEE RADAR CONFERENCE (RADARCONF20), 2020,
[38] SpEx plus : A Complete Time Domain Speaker Extraction Network
Ge, Meng
Xu, Chenglin
Wang, Longbiao
Chng, Eng Siong
Dang, Jianwu
Li, Haizhou
INTERSPEECH 2020, 2020, : 1406 - 1410
[39] SEA plus plus : Multi-Graph-Based Higher-Order Sensor Alignment for Multivariate Time-Series Unsupervised Domain Adaptation
Wang, Yucheng
Xu, Yuecong
Yang, Jianfei
Wu, Min
Li, Xiaoli
Xie, Lihua
Chen, Zhenghua
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10781 - 10796
[40] UNSUPERVISED DOMAIN ADAPTATION OF NEURAL PLDA USING SEGMENT PAIRS FOR SPEAKER VERIFICATION
Ulgen, I. Rasim
Arslan, Levent M.
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 571 - 576

← 1 2 3 4 5 →