Robust Training for Speaker Verification against Noisy Labels

被引:2
|
作者
Fang, Zhihua [1 ,2 ]
He, Liang [1 ,2 ,3 ]
Ma, Hanhan [1 ,2 ]
Guo, Xiaochen [1 ,2 ]
Li, Lin [4 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi 830017, Peoples R China
[2] Xinjiang Key Lab Signal Detect & Proc, Urumqi 830017, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[4] Xiamen Univ, Sch Elect Sci & Engn, Xiamen 361005, Peoples R China
来源
基金
国家重点研发计划;
关键词
speaker verification; speaker embedding; noisy label; early learning; curriculum learning;
D O I
10.21437/Interspeech.2023-452
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The deep learning models used for speaker verification rely heavily on large amounts of data and correct labeling. However, noisy (incorrect) labels often occur, which degrades the performance of the system. In this paper, we propose a novel twostage learning method to filter out noisy labels from speaker datasets. Since a DNN will first fit data with clean labels, we first train the model with all data for several epochs. Then, based on this model, the model predictions are compared with the labels using our proposed the OR-Gate with top-k mechanism to select the data with clean labels and the selected data is used to train the model. This process is iterated until the training is completed. We have demonstrated the effectiveness of this method in filtering noisy labels through extensive experiments and have achieved excellent performance on the VoxCeleb (1 and 2) with different added noise rates.
引用
收藏
页码:3192 / 3196
页数:5
相关论文
共 50 条
  • [1] BAYESIAN ESTIMATION OF PLDA WITH NOISY TRAINING LABELS, WITH APPLICATIONS TO SPEAKER VERIFICATION
    Borgstrom, Bengt J.
    Torres-Carrasquillo, Pedro
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7594 - 7598
  • [2] Bayesian Estimation of PLDA in the Presence of Noisy Training Labels, with Applications to Speaker Verification
    Borgstrom, Bengt J.
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2022, 30 : 414 - 428
  • [3] Bayesian Estimation of PLDA in the Presence of Noisy Training Labels, With Applications to Speaker Verification
    Borgstrom, Bengt J.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 414 - 428
  • [4] ROBUST FEATURE LEARNING AGAINST NOISY LABELS
    Tai, Tsung-Ming
    Jhang, Yun-Jie
    Hwang, Wen-Jyi
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2235 - 2239
  • [5] Instance-adaptive training with noise-robust losses against noisy labels
    Jin, Lifeng
    Song, Linfeng
    Xu, Kun
    Yu, Dong
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5647 - 5663
  • [6] Robust Speaker Verification Against Additive Noise
    Wang, Ming-He
    Zhang, Er-Hua
    Tang, Zhen-Min
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2019, 35 (02) : 291 - 305
  • [7] Robust discriminative training against data insufficiency in PLDA-based speaker verification
    Rohdin, Johan
    Biswas, Sangeeta
    Shinoda, Koichi
    COMPUTER SPEECH AND LANGUAGE, 2016, 35 : 32 - 57
  • [8] Robust Loss Functions for Training Decision Trees with Noisy Labels
    Wilton, Jonathan
    Ye, Nan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15859 - 15867
  • [9] Improving Speaker Verification With Noise-Aware Label Ensembling and Sample Selection: Learning and Correcting Noisy Speaker Labels
    Fang, Zhihua
    He, Liang
    Li, Lin
    Hu, Ying
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2988 - 3001
  • [10] Robust Features Fusion for Text Independent Speaker Verification Enhancement in Noisy Environments
    Mohammadi, Mohsen
    Mohammadi, Hamid Reza Sadegh
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1863 - 1868