Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data

被引:12
|
作者
Chen, Yuhao [1 ]
Tan, Xin [2 ]
Zhao, Borui [1 ]
Chen, Zhaowei [1 ]
Song, Renjie [1 ]
Liang, Jiajun [1 ]
Lu, Xuequan [3 ]
机构
[1] MEGVII Technol, Beijing, Peoples R China
[2] East China Normal Univ, Shanghai, Peoples R China
[3] Deakin Univ, Geelong, Vic, Australia
关键词
D O I
10.1109/CVPR52729.2023.00729
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning (SSL) has attracted enormous attention due to its vast potential of mitigating the dependence on large labeled datasets. The latest methods (e.g., FixMatch) use a combination of consistency regularization and pseudo-labeling to achieve remarkable successes. However, these methods all suffer from the waste of complicated examples since all pseudo-labels have to be selected by a high threshold to filter out noisy ones. Hence, the examples with ambiguous predictions will not contribute to the training phase. For better leveraging all unlabeled examples, we propose two novel techniques: Entropy Meaning Loss (EML) and Adaptive Negative Learning (ANL). EML incorporates the prediction distribution of non-target classes into the optimization objective to avoid competition with target class, and thus generating more high-confidence predictions for selecting pseudo-label. ANL introduces the additional negative pseudo-label for all unlabeled data to leverage low-confidence examples. It adaptively allocates this label by dynamically evaluating the top-k performance of the model. EML and ANL do not introduce any additional parameter and hyperparameter. We integrate these techniques with FixMatch, and develop a simple yet powerful framework called FullMatch. Extensive experiments on several common SSL benchmarks (CIFAR-10/100, SVHN, STL-10 and ImageNet) demonstrate that FullMatch exceeds FixMatch by a large margin. Integrated with FlexMatch (an advanced FixMatch-based framework), we achieve state-of-the-art performance. Source code is available at https://github.com/megvii-research/FullMatch.
引用
收藏
页码:7548 / 7557
页数:10
相关论文
共 50 条
  • [1] Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning
    Ren, Zhongzheng
    Yeh, Raymond A.
    Schwing, Alexander G.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] SEMI-SUPERVISED LEARNING: EXPLOITING UNLABELED DATA WITH SYMMETRICAL DISTRIBUTION AND HIGH CONFIDENCE
    Zhang, Yihao
    Wen, Junhao
    Tang, Fangfang
    Jiang, Zhuo
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [3] Exploiting Self-Supervised and Semi-Supervised Learning for Facial Landmark Tracking with Unlabeled Data
    Yin, Shi
    Wang, Shangfei
    Chen, Xiaoping
    Chen, Enhong
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2991 - 2998
  • [4] Semi-supervised text categorization: Exploiting unlabeled data using ensemble learning algorithms
    Keyvanpour, Mohammad Reza
    Imani, Maryam Bahojb
    [J]. INTELLIGENT DATA ANALYSIS, 2013, 17 (03) : 367 - 385
  • [5] Exploiting Diversity of Unlabeled Data for Label-Efficient Semi-Supervised Active Learning
    Buchert, Felix
    Navab, Nassir
    Kim, Seong Tae
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2063 - 2069
  • [6] Semi-supervised Learning from General Unlabeled Data
    Huang, Kaizhu
    Xu, Zenglin
    King, Irwin
    Lyu, Michael R.
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 273 - +
  • [7] AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data
    Banitalebi-Dehkordi, Amin
    Gujjar, Pratik
    Zhang, Yong
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3998 - 4005
  • [8] Semi-supervised multi-class Adaboost by exploiting unlabeled data
    Song, Enmin
    Huang, Dongshan
    Ma, Guangzhi
    Hung, Chih-Cheng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (06) : 6720 - 6726
  • [9] POSITIVE UNLABELED LEARNING BY SEMI-SUPERVISED LEARNING
    Wang, Zhuowei
    Jiang, Jing
    Long, Guodong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2976 - 2980
  • [10] Exploitation Maximization of Unlabeled Data for Federated Semi-Supervised Learning
    Chen, Siguang
    Shen, Jianhua
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, : 1 - 6