SELF-SUPERVISED LEARNING BASED DOMAIN ADAPTATION FOR ROBUST SPEAKER VERIFICATION

被引:18
|
作者
Chen, Zhengyang [1 ]
Wang, Shuai [1 ]
Qian, Yanmin [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, SpeechLab,Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
Domain Adaptation; Self-Supervised Learning; Speaker Verification; Contrastive Learning;
D O I
10.1109/ICASSP39728.2021.9414261
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Large performance degradation is often observed for speaker verification systems when applied to a new domain dataset. Given an unlabeled target-domain dataset, unsupervised domain adaptation (UDA) methods, which usually leverage adversarial training strategies, are commonly used to bridge the performance gap caused by the domain mismatch. However, such adversarial training strategy only uses the distribution information of target domain data and can not ensure the performance improvement on the target domain. In this paper, we incorporate self-supervised learning strategy to the unsupervised domain adaptation system and proposed a self-supervised learning based domain adaptation approach (SSDA). Compared to the traditional UDA method, the new SSDA training strategy can fully leverage the potential label information from target domain and adapt the speaker discrimination ability from source domain simultaneously. We evaluated the proposed approach on the VoxCeleb (labeled source domain) and CnCeleb (unlabeled target domain) datasets, and the best SSDA system obtains 10.2% Equal Error Rate (EER) on the CnCeleb dataset without using any speaker labels on CnCeleb, which also can achieve the state-of-the-art results on this corpus.
引用
收藏
页码:5834 / 5838
页数:5
相关论文
共 50 条
  • [1] ROBUST SPEAKER VERIFICATION WITH JOINT SELF-SUPERVISED AND SUPERVISED LEARNING
    Wang, Kai
    Zhang, Xiaolei
    Zhang, Miao
    Li, Yuguang
    Lee, Jaeyun
    Cho, Kiho
    Park, Sung-UN
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7637 - 7641
  • [2] Self-supervised learning based domain regularization for mask-wearing speaker verification
    Zhang, Ruiteng
    Wei, Jianguo
    Lu, Xugang
    Lu, Wenhuan
    Jin, Di
    Zhang, Lin
    Ji, Yantao
    Xu, Junhai
    SPEECH COMMUNICATION, 2023, 152
  • [3] Robust self-supervised learning for source-free domain adaptation
    Liang Tian
    Lihua Zhou
    Hao Zhang
    Zhenbin Wang
    Mao Ye
    Signal, Image and Video Processing, 2023, 17 : 2405 - 2413
  • [4] Robust self-supervised learning for source-free domain adaptation
    Tian, Liang
    Zhou, Lihua
    Zhang, Hao
    Wang, Zhenbin
    Ye, Mao
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (05) : 2405 - 2413
  • [5] Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
    Wu, Haibin
    Li, Xu
    Liu, Andy T.
    Wu, Zhiyong
    Meng, Helen
    Lee, Hung-Yi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 202 - 217
  • [6] Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-Supervised Speaker Verification
    Mun, Sung Hwan
    Han, Min Hyun
    Lee, Dongjune
    Kim, Jihwan
    Kim, Nam Soo
    IEEE ACCESS, 2021, 9 : 167615 - 167627
  • [7] Barlow Twins self-supervised learning for robust speaker recognition
    Mohammadamini, Mohammad
    Matrouf, Driss
    Bonastre, Jean-Francois
    Dowerah, Sandipana
    Serizel, Romain
    Jouvet, Denis
    INTERSPEECH 2022, 2022, : 4033 - 4037
  • [8] SELF-SUPERVISED SPEAKER VERIFICATION WITH SIMPLE SIAMESE NETWORK AND SELF-SUPERVISED REGULARIZATION
    Sang, Mufan
    Li, Haoqi
    Liu, Fang
    Arnold, Andrew O.
    Wan, Li
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6127 - 6131
  • [9] Self-Supervised Learning for Domain Adaptation on Point Clouds
    Achituve, Idan
    Maron, Haggai
    Chechik, Gal
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 123 - 133
  • [10] Prototype Division for Self-Supervised Speaker Verification
    Zhao, Zhenduo
    Li, Zhuo
    Zhang, Xueshuai
    Wang, Wenchao
    Zhang, Pengyuan
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 880 - 884