Robust Training for Speaker Verification against Noisy Labels

被引:2
|
作者
Fang, Zhihua [1 ,2 ]
He, Liang [1 ,2 ,3 ]
Ma, Hanhan [1 ,2 ]
Guo, Xiaochen [1 ,2 ]
Li, Lin [4 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi 830017, Peoples R China
[2] Xinjiang Key Lab Signal Detect & Proc, Urumqi 830017, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[4] Xiamen Univ, Sch Elect Sci & Engn, Xiamen 361005, Peoples R China
来源
基金
国家重点研发计划;
关键词
speaker verification; speaker embedding; noisy label; early learning; curriculum learning;
D O I
10.21437/Interspeech.2023-452
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The deep learning models used for speaker verification rely heavily on large amounts of data and correct labeling. However, noisy (incorrect) labels often occur, which degrades the performance of the system. In this paper, we propose a novel twostage learning method to filter out noisy labels from speaker datasets. Since a DNN will first fit data with clean labels, we first train the model with all data for several epochs. Then, based on this model, the model predictions are compared with the labels using our proposed the OR-Gate with top-k mechanism to select the data with clean labels and the selected data is used to train the model. This process is iterated until the training is completed. We have demonstrated the effectiveness of this method in filtering noisy labels through extensive experiments and have achieved excellent performance on the VoxCeleb (1 and 2) with different added noise rates.
引用
收藏
页码:3192 / 3196
页数:5
相关论文
共 50 条
  • [41] Speaker verification in realistic noisy environment in forensic science
    Kamada, Toshiaki
    Minematsu, Nobuaki
    Osanai, Takashi
    Makinae, Hisanori
    Tanimoto, Masumi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 558 - 566
  • [42] A NOVEL SPEAKER VERIFICATION APPROACH FOR CERTAIN NOISY ENVIRONMENT
    Cao Yiming
    Jiang Wenbin
    Ying Rendong
    Liu Peilin
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 549 - 554
  • [43] Speaker Verification in Noisy Environment Using GMM Supervectors
    Sarkar, Sourjya
    Rao, K. Sreenivasa
    2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
  • [44] Improved Accuracy of Speaker Verification System in Noisy Environment
    Salam, Nirmala
    Nair, Rekha
    2016 17TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2016, : 219 - 224
  • [45] Type-2 fuzzy GMMs for robust text-independent speaker verification in noisy environments
    Pinheiro, Hector N. B.
    Ren, Tsang Ing
    Cavalcanti, George D. C.
    Jyh, Tsang Ing
    Sijbers, Jan
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4531 - 4536
  • [46] Robust Speaker Verification for Mobile Transmission
    Manjusha, V.
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 518 - 521
  • [47] Robustness and Reliability When Training With Noisy Labels
    Olmin, Amanda
    Lindsten, Fredrik
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 922 - 942
  • [48] Training Robust Deep Neural Networks on Noisy Labels Using Adaptive Sample Selection With Disagreement
    Takeda, Hiroshi
    Yoshida, Soh
    Muneyasu, Mitsuji
    IEEE ACCESS, 2021, 9 : 141131 - 141143
  • [49] Disentangled Speaker and Nuisance Attribute Embedding for Robust Speaker Verification
    Kang, Woo Hyun
    Mun, Sung Hwan
    Han, Min Hyun
    Kim, Nam Soo
    IEEE ACCESS, 2020, 8 : 141838 - 141849
  • [50] Speaker verification using minimum verification error training
    Rosenberg, AE
    Siohan, O
    Parthasarathy, S
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 105 - 108