IMPROVED NOISY ITERATIVE PSEUDO-LABELING FOR SEMI-SUPERVISED SPEECH RECOGNITION

被引:1
|
作者
Li, Tian [1 ]
Meng, Qingliang [1 ]
Sun, Yujian [1 ]
机构
[1] Shumei AI Res Inst, Beijing, Peoples R China
关键词
pseudo-labeling; semi-supervised learning; end-to-end speech recognition; deep learning;
D O I
10.1109/SLT54892.2023.10022417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the high annotation cost in ASR, the implementation of semi-supervised training has been a hot issue in research and industry. In a multitude of recent investigations, it has been established that pseudo-labeling, a fundamental sub-direction of semi-supervised learning, is effective in ASR. However, if the iterative PL is utilized, the expense of doing data experiments is prohibitively high, making the promotion to diverse situations of ASR tasks problematic. In this paper, we propose an empirical scoring method based on hypothesis distribution testing to guide iterative PL training, therefore lowering the cost of data experiments and boosting ASR performance. Meanwhile, we conducted extensive experiments to determine the necessity and limitation of model perturbation in the initial training and the PL stages. On the Librispeech 100/860 task, our method improves the 12+6 transformer-based CTC+S2S architecture performance from 4.8%/10.1% to 3.9%/9.6% on test-clean and test-other.
引用
收藏
页码:167 / 173
页数:7
相关论文
共 50 条
  • [41] PLBR: A Semi-Supervised Document Key Information Extraction via Pseudo-Labeling Bias Rectification
    Guo, Pengcheng
    Song, Yonghong
    Wang, Boyu
    Liu, Jiaohao
    Zhang, Qi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 9025 - 9036
  • [42] Better Pseudo-Labeling for Semi-Supervised Domain Generalization in Medical Magnetic Resonance Image Segmentation
    Hu, Liangqing
    Meng, Zuqiang
    Tan, Chaohong
    Zhou, Yumin
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2025, 18 (01)
  • [43] Class-Distribution-Aware Pseudo-Labeling for Semi-Supervised Multi-Label Learning
    Xie, Ming-Kun
    Xiao, Jia-Hao
    Liu, Hao-Zhe
    Niu, Gang
    Sugiyama, Masashi
    Huang, Sheng-Jun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [44] SelectiveKD: A Semi-supervised Framework for Cancer Detection in DBT Through Knowledge Distillation and Pseudo-labeling
    Dillard, Laurent
    Lee, Hyeonsoo
    Lee, Weonsuk
    Kim, Tae Soo
    Diba, Ali
    Kooi, Thijs
    CANCER PREVENTION, DETECTION, AND INTERVENTION, CAPTION 2024, 2025, 15199 : 154 - 163
  • [45] Noisy-Consistent Pseudo Labeling Model for Semi-supervised Skin Lesion Classification
    Zhu, Qi
    Li, Sen
    Li, Zhantao
    Min, Xianjun
    Li, Qian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2023 WORKSHOPS, 2023, 14394 : 241 - 252
  • [46] SPLAL: Similarity-based pseudo-labeling with alignment loss for semi-supervised medical image classification
    Mahmood, Md Junaid
    Raj, Pranaw
    Agarwal, Divyansh
    Kumari, Suruchi
    Singh, Pravendra
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [47] Improving semi-supervised remote sensing scene classification via Multilevel Feature Fusion and pseudo-labeling
    Feng, Jiangfan
    Luo, Hongxin
    Gu, Zhujun
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2025, 136
  • [48] A semi-supervised medical image classification method based on combined pseudo-labeling and distance metric consistency
    Boya Ke
    Huijuan Lu
    Cunqian You
    Wenjie Zhu
    Li Xie
    Yudong Yao
    Multimedia Tools and Applications, 2024, 83 : 33313 - 33331
  • [49] Self Adaptive Threshold Pseudo-labeling and Unreliable Sample Contrastive Loss for Semi-supervised Image Classification
    Zhang, Xuerong
    Huang, Li
    Lv, Jing
    Yang, Ming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 61 - 75
  • [50] Semi-supervised learning with pseudo-labeling compares favorably with large language models for regulatory sequence prediction
    Phan, Han
    Brouard, Celine
    Mourad, Raphael
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)