Collaborative and adversarial network for text-independent speaker verification in domain adaptation

被引：0

作者：

Qiang, Junhao ^{[1
]}

Yang, Qun ^{[1
]}

Gao, Jie ^{[1
]}

Liu, Shaohan ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China

来源：

ELECTRONICS LETTERS | 2023年 / 59卷 / 02期

关键词：

audio signal processing; speaker recognition;

D O I：

10.1049/ell2.12709

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Speaker verification models have achieved good results on the single genre data. But the performance degrades when model training and testing are not in the same domain. The adversarial training method is proposed to solve this problem by minimizing domain distribution differences. However, the adversarial training ignores domain-specific information for the domain-invariant speaker representations. In this paper, an improved collaborative adversarial network for domain adaptation in speaker verification is performed. Compared to the adversarial training, a collaborative discriminator is newly incorporated that learns domain-specific information at the lower layers. Further, the projection block is added to the collaborative discriminator. It reduces the noise introduced by the collaborative discriminator. Experiments are conducted in different mismatch scenarios and using different speaker encoders. All the experimental results show that the performance of this method is better than the baseline and previous work using adversarial training.

引用

页数：3

共 50 条

[41] Group-based speaker embeddings for text-independent speaker verification
Jung, Youngmoon
Eom, Youngsik
Lee, Yeonghyeon
Kim, Hoirin
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 496 - 502
[42] Text-independent speaker verification using covariance modeling
Zilca, RD
[J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (04) : 97 - 99
[43] Text-independent speaker verification with dynamic trajectory model
Xiang, B
[J]. IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (05) : 141 - 143
[44] Exploration of Local Variability in Text-Independent Speaker Verification
Liping Chen
Kong Aik Lee
Bin Ma
Wu Guo
Haizhou Li
Li-Rong Dai
[J]. Journal of Signal Processing Systems, 2016, 82 : 217 - 228
[45] FACTORED COVARIANCE MODELING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Wang, Eryu
Lee, Kong Aik
Ma, Bin
Li, Haizhou
Guo, Wu
Dai, Lirong
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4856 - 4859
[46] A CORRECTIVE LEARNING APPROACH FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Wen, Yandong
Zhou, Tianyan
Singh, Rita
Raj, Bhiksha
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4894 - 4898
[47] Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification
Zhu, Yingke
Ko, Tom
Snyder, David
Mak, Brian
Povey, Daniel
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3573 - 3577
[48] A joint factor analysis approach to progressive model adaptation in text-independent speaker verification
Yin, Shou-Chun
Rose, Richard
Kenny, Patrick
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 1999 - 2010
[49] Speaker adaptive cohort selection for Tnorm in text-independent speaker verification
Sturim, DE
Reynolds, DA
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 741 - 744
[50] Adversarial Domain Adaptation for Speaker Verification using Partially Shared Network
Chen, Zhengyang
Wang, Shuai
Qian, Yanmin
[J]. INTERSPEECH 2020, 2020, : 3017 - 3021

← 1 2 3 4 5 →