Collaborative and adversarial network for text-independent speaker verification in domain adaptation

被引:0
|
作者
Qiang, Junhao [1 ]
Yang, Qun [1 ]
Gao, Jie [1 ]
Liu, Shaohan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
关键词
audio signal processing; speaker recognition;
D O I
10.1049/ell2.12709
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speaker verification models have achieved good results on the single genre data. But the performance degrades when model training and testing are not in the same domain. The adversarial training method is proposed to solve this problem by minimizing domain distribution differences. However, the adversarial training ignores domain-specific information for the domain-invariant speaker representations. In this paper, an improved collaborative adversarial network for domain adaptation in speaker verification is performed. Compared to the adversarial training, a collaborative discriminator is newly incorporated that learns domain-specific information at the lower layers. Further, the projection block is added to the collaborative discriminator. It reduces the noise introduced by the collaborative discriminator. Experiments are conducted in different mismatch scenarios and using different speaker encoders. All the experimental results show that the performance of this method is better than the baseline and previous work using adversarial training.
引用
收藏
页数:3
相关论文
共 50 条
  • [41] Group-based speaker embeddings for text-independent speaker verification
    Jung, Youngmoon
    Eom, Youngsik
    Lee, Yeonghyeon
    Kim, Hoirin
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 496 - 502
  • [42] Text-independent speaker verification using covariance modeling
    Zilca, RD
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (04) : 97 - 99
  • [43] Text-independent speaker verification with dynamic trajectory model
    Xiang, B
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (05) : 141 - 143
  • [44] Exploration of Local Variability in Text-Independent Speaker Verification
    Liping Chen
    Kong Aik Lee
    Bin Ma
    Wu Guo
    Haizhou Li
    Li-Rong Dai
    [J]. Journal of Signal Processing Systems, 2016, 82 : 217 - 228
  • [45] FACTORED COVARIANCE MODELING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Wang, Eryu
    Lee, Kong Aik
    Ma, Bin
    Li, Haizhou
    Guo, Wu
    Dai, Lirong
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4856 - 4859
  • [46] A CORRECTIVE LEARNING APPROACH FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Wen, Yandong
    Zhou, Tianyan
    Singh, Rita
    Raj, Bhiksha
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4894 - 4898
  • [47] Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification
    Zhu, Yingke
    Ko, Tom
    Snyder, David
    Mak, Brian
    Povey, Daniel
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3573 - 3577
  • [48] A joint factor analysis approach to progressive model adaptation in text-independent speaker verification
    Yin, Shou-Chun
    Rose, Richard
    Kenny, Patrick
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 1999 - 2010
  • [49] Speaker adaptive cohort selection for Tnorm in text-independent speaker verification
    Sturim, DE
    Reynolds, DA
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 741 - 744
  • [50] Adversarial Domain Adaptation for Speaker Verification using Partially Shared Network
    Chen, Zhengyang
    Wang, Shuai
    Qian, Yanmin
    [J]. INTERSPEECH 2020, 2020, : 3017 - 3021