Robust Training for Speaker Verification against Noisy Labels

被引:2
|
作者
Fang, Zhihua [1 ,2 ]
He, Liang [1 ,2 ,3 ]
Ma, Hanhan [1 ,2 ]
Guo, Xiaochen [1 ,2 ]
Li, Lin [4 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi 830017, Peoples R China
[2] Xinjiang Key Lab Signal Detect & Proc, Urumqi 830017, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[4] Xiamen Univ, Sch Elect Sci & Engn, Xiamen 361005, Peoples R China
来源
基金
国家重点研发计划;
关键词
speaker verification; speaker embedding; noisy label; early learning; curriculum learning;
D O I
10.21437/Interspeech.2023-452
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The deep learning models used for speaker verification rely heavily on large amounts of data and correct labeling. However, noisy (incorrect) labels often occur, which degrades the performance of the system. In this paper, we propose a novel twostage learning method to filter out noisy labels from speaker datasets. Since a DNN will first fit data with clean labels, we first train the model with all data for several epochs. Then, based on this model, the model predictions are compared with the labels using our proposed the OR-Gate with top-k mechanism to select the data with clean labels and the selected data is used to train the model. This process is iterated until the training is completed. We have demonstrated the effectiveness of this method in filtering noisy labels through extensive experiments and have achieved excellent performance on the VoxCeleb (1 and 2) with different added noise rates.
引用
收藏
页码:3192 / 3196
页数:5
相关论文
共 50 条
  • [31] Co-teaching: Robust Training of Deep Neural Networks with Extremely Noisy Labels
    Han, Bo
    Yao, Quanming
    Yu, Xingrui
    Niu, Gang
    Xu, Miao
    Hu, Weihua
    Tsang, Ivor W.
    Sugiyama, Masashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [32] Automatic Error Correction for Speaker Embedding Learning with Noisy Labels
    Tong, Fuchuan
    Liu, Yan
    Li, Song
    Wang, Jie
    Li, Lin
    Hong, Qingyang
    INTERSPEECH 2021, 2021, : 4628 - 4632
  • [33] TOWARDS ROBUST SPEAKER VERIFICATION WITH TARGET SPEAKER ENHANCEMENT
    Zhang, Chunlei
    Yu, Meng
    Weng, Chao
    Yu, Dong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6693 - 6697
  • [34] Penalty based robust learning with noisy labels
    Kong, Kyeongbo
    Lee, Junggi
    Kwak, Youngchul
    Cho, Young-Rae
    Kim, Seong-Eun
    Song, Woo-Jin
    NEUROCOMPUTING, 2022, 489 : 112 - 127
  • [35] Learning with noisy labels for robust fatigue detection
    Wang, Mei
    Hu, Ruimin
    Zhu, Xiaojie
    Zhu, Dongliang
    Wang, Xiaochen
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [36] Learning to rectify for robust learning with noisy labels
    Sun, Haoliang
    Guo, Chenhui
    Wei, Qi
    Han, Zhongyi
    Yin, Yilong
    PATTERN RECOGNITION, 2022, 124
  • [37] Robust Speaker Identification in Noisy and Reverberant Conditions
    Zhao, Xiaojia
    Wang, Yuxuan
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 836 - 845
  • [38] ROBUST SPEAKER IDENTIFICATION IN NOISY AND REVERBERANT CONDITIONS
    Zhao, Xiaojia
    Wang, Yuxuan
    Wang, DeLiang
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [39] Robust model for speaker verification against session-dependent utterance variation
    Matsui, T
    Aikawa, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 117 - 120
  • [40] Robust model for speaker verification against session-dependent utterance variation
    Matsui, T
    Aikawa, K
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (04): : 712 - 718