TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCES

被引:0
|
作者
Liu, Kai [1 ]
Zhou, Huan [1 ]
机构
[1] Huawei Technol, Artificial Intelligence Applicat Res Ctr, Shenzhen, Peoples R China
关键词
speaker embedding; speaker verification; generative adversarial network;
D O I
10.1109/icassp40776.2020.9054036
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A text-independent speaker verification system suffers severe performance degradation under short utterance condition. To address the problem, in this paper, we propose an adversarially learned embedding mapping model that directly maps a short embedding to an enhanced embedding with increased discriminability. In particular, a Wasserstein GAN with a bunch of loss criteria are investigated. These loss functions have distinct optimization objectives and some of them are less favoured for the speaker verification research area. Different from most prior studies, our main objective in this study is to investigate the effectiveness of those loss criteria by conducting numerous ablation studies. Experiments on Voxceleb dataset showed that some criteria are beneficial to the verification performance while some have trivial effects. Lastly, a Wasserstein GAN with chosen loss criteria, without fine-tuning, achieves meaningful advancements over the baseline, with 4% relative improvements on EER and 7% on minDCF in the challenging scenario of short 2second utterances.
引用
收藏
页码:6569 / 6573
页数:5
相关论文
共 50 条
  • [1] A deep learning approach for text-independent speaker recognition with short utterances
    Rania Chakroun
    Mondher Frikha
    [J]. Multimedia Tools and Applications, 2023, 82 : 33111 - 33133
  • [2] A deep learning approach for text-independent speaker recognition with short utterances
    Chakroun, Rania
    Frikha, Mondher
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (21) : 33111 - 33133
  • [3] End-to-End Text-Independent Speaker Verification with Triplet Loss on Short Utterances
    Zhang, Chunlei
    Koishida, Kazuhito
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1487 - 1491
  • [4] Robust features for text-independent speaker recognition with short utterances
    Rania Chakroun
    Mondher Frikha
    [J]. Neural Computing and Applications, 2020, 32 : 13863 - 13883
  • [5] Robust features for text-independent speaker recognition with short utterances
    Chakroun, Rania
    Frikha, Mondher
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (17): : 13863 - 13883
  • [6] Deep Speaker Feature Learning for Text-independent Speaker Verification
    Li, Lantian
    Chen, Yixiang
    Shi, Zing
    Tang, Zhiyuan
    Wang, Dong
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1542 - 1546
  • [7] Utilizing Tandem Features for Text-Independent Speaker Recognition on Short Utterances
    Alvarez, Arvin Kenneth
    Pelipas, Mary Tricia Ann
    Rayos del Sol, Carl Ivan
    Tomas, John Paul
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND BIG DATA (ICCBD 2019), 2019, : 105 - 110
  • [8] Mixup Learning Strategies for Text-independent Speaker Verification
    Zhu, Yingke
    Ko, Tom
    Mak, Brian
    [J]. INTERSPEECH 2019, 2019, : 4345 - 4349
  • [9] A CORRECTIVE LEARNING APPROACH FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Wen, Yandong
    Zhou, Tianyan
    Singh, Rita
    Raj, Bhiksha
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4894 - 4898
  • [10] Collaborative and adversarial network for text-independent speaker verification in domain adaptation
    Qiang, Junhao
    Yang, Qun
    Gao, Jie
    Liu, Shaohan
    [J]. ELECTRONICS LETTERS, 2023, 59 (02)