Robust Training for Speaker Verification against Noisy Labels

被引:2
|
作者
Fang, Zhihua [1 ,2 ]
He, Liang [1 ,2 ,3 ]
Ma, Hanhan [1 ,2 ]
Guo, Xiaochen [1 ,2 ]
Li, Lin [4 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi 830017, Peoples R China
[2] Xinjiang Key Lab Signal Detect & Proc, Urumqi 830017, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[4] Xiamen Univ, Sch Elect Sci & Engn, Xiamen 361005, Peoples R China
来源
基金
国家重点研发计划;
关键词
speaker verification; speaker embedding; noisy label; early learning; curriculum learning;
D O I
10.21437/Interspeech.2023-452
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The deep learning models used for speaker verification rely heavily on large amounts of data and correct labeling. However, noisy (incorrect) labels often occur, which degrades the performance of the system. In this paper, we propose a novel twostage learning method to filter out noisy labels from speaker datasets. Since a DNN will first fit data with clean labels, we first train the model with all data for several epochs. Then, based on this model, the model predictions are compared with the labels using our proposed the OR-Gate with top-k mechanism to select the data with clean labels and the selected data is used to train the model. This process is iterated until the training is completed. We have demonstrated the effectiveness of this method in filtering noisy labels through extensive experiments and have achieved excellent performance on the VoxCeleb (1 and 2) with different added noise rates.
引用
收藏
页码:3192 / 3196
页数:5
相关论文
共 50 条
  • [21] Robust Speaker Recognition in Noisy Conditions by Means of Online Training with Noise Profiles
    Al-Noori, Ahmed H. Y.
    Duncan, Philip
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2019, 67 (04): : 174 - 189
  • [22] DISENTANGLED SPEAKER EMBEDDING FOR ROBUST SPEAKER VERIFICATION
    Yi, Lu
    Mak, Man-Wai
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7662 - 7666
  • [23] Robust speaker recognition in noisy conditions
    Ming, Ji
    Hazen, Timothy J.
    Glass, James R.
    Reynolds, Douglas A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05): : 1711 - 1723
  • [24] Robust Federated Learning with Parameter Classification and Weighted Aggregation against Noisy Labels
    Li, Qun
    Duan, Congying
    Chen, Siguang
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2445 - 2450
  • [25] Robust speaker identification and verification
    Wang, Jia-Ching
    Yang, Chung-Hsien
    Wang, Jhing-Fa
    Lee, Hsiao-Ping
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2007, 2 (02) : 52 - 59
  • [26] FA-ExU-Net: The Simultaneous Training of an Embedding Extractor and Enhancement Model for a Speaker Verification System Robust to Short Noisy Utterances
    Kim, Ju-ho
    Heo, Jungwoo
    Shin, Hyun-seo
    Lim, Chan-yeong
    Yu, Ha-Jin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2269 - 2282
  • [27] Self-Training of Graph Neural Networks Using Similarity Reference for Robust Training with Noisy Labels
    Park, Hyoungseob
    Jeong, Minki
    Kim, Youngeun
    Kim, Changick
    Proceedings - International Conference on Image Processing, ICIP, 2020, 2020-October : 1951 - 1955
  • [28] SELF-TRAINING OF GRAPH NEURAL NETWORKS USING SIMILARITY REFERENCE FOR ROBUST TRAINING WITH NOISY LABELS
    Park, Hyoungseob
    Jeong, Minki
    Kim, Youngeun
    Kim, Changick
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1951 - 1955
  • [29] Instance Discrimination Based Robust Training for Facial Expression Recognition Under Noisy Labels
    Vikas G.N.
    Gera D.
    Balasubramanian S.
    SN Computer Science, 4 (1)
  • [30] Training Robust Object Detectors From Noisy Category Labels and Imprecise Bounding Boxes
    Xu, Youjiang
    Zhu, Linchao
    Yang, Yi
    Wu, Fei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5782 - 5792