Improving Speaker Verification With Noise-Aware Label Ensembling and Sample Selection: Learning and Correcting Noisy Speaker Labels

被引:1
|
作者
Fang, Zhihua [1 ,2 ]
He, Liang [3 ]
Li, Lin [4 ,5 ]
Hu, Ying [1 ,2 ]
机构
[1] Xinjiang Univ, Sch Comp Sci & Technol, Urumqi 830017, Peoples R China
[2] Xinjiang Univ, Xinjiang Key Lab Signal Detect & Proc, Urumqi 830017, Peoples R China
[3] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Dept Elect Engn, Beijing 100084, Peoples R China
[4] Xiamen Univ, XMU Speech Lab, Xiamen 361005, Peoples R China
[5] Xiamen Univ, Sch Elect Sci & Engn, Xiamen 361005, Peoples R China
关键词
Confidence ranking; exponential moving average; label ensembling; noisy labels; speaker verification; RECOGNITION;
D O I
10.1109/TASLP.2024.3407527
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Supervised deep learning has achieved tremendous success in speaker verification. However, deep speaker models tend to overfit noisy labels when they are present in the speaker datasets. To mitigate the detrimental effects of noisy labels, in this paper, we propose a novel Label Ensembling and Sample Selection framework. Firstly, we select labels with high confidence rankings as clean samples. Additionally, we use predictions from different epochs during training to smoothly correct the noisy labels. Our method does not require staged training and achieves integration of learning from noisy labels, selecting clean labels, and correcting noisy labels. A significant number of experimental results demonstrate the robustness of our method under noisy labels. Even when the training data contains 50% noisy labels, our method can mitigate an average of 86.54% of the performance degradation compared to the standard training method. Furthermore, further ablation experiments and analysis validate the effectiveness of High Confidence Ranking for sample selection and the correctness of Label Ensembling for noisy label correction.
引用
收藏
页码:2988 / 3001
页数:14
相关论文
共 5 条
  • [1] Noise-Aware Extended U-Net With Split Encoder and Feature Refinement Module for Robust Speaker Verification in Noisy Environments
    Lim, Chan-Yeong
    Heo, Jungwoo
    Kim, Ju-Ho
    Shin, Hyun-Seo
    Yu, Ha-Jin
    IEEE ACCESS, 2024, 12 : 111673 - 111682
  • [2] Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization
    Wei, Qi
    Sun, Haoliang
    Lu, Xiankai
    Yin, Yilong
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 516 - 532
  • [3] On the influence of metric learning loss functions for robust self-supervised speaker verification to label noise
    Fathan, Abderrahim
    Zhu, Xiaolin
    Alam, Jahangir
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1024 - 1031
  • [4] Label-noise learning via uncertainty-aware neighborhood sample selection
    Zhang, Yiliang
    Lu, Yang
    Wang, Hanzi
    PATTERN RECOGNITION LETTERS, 2024, 186 : 191 - 197
  • [5] Meta-learning-based sample discrimination framework for improving dynamic selection of classifiers under label noise
    Xu, Che
    Zhu, Yingming
    Zhu, Peng
    Cui, Longqing
    KNOWLEDGE-BASED SYSTEMS, 2024, 295