Debiased Learning from Naturally Imbalanced Pseudo-Labels

被引:25
|
作者
Wang, Xudong [1 ,2 ]
Wu, Zhirong [3 ]
Lian, Long [1 ,2 ]
Yu, Stella X. [1 ,2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] ICSI, New Delhi, India
[3] Microsoft Res, Redmond, WA USA
关键词
CAUSAL INFERENCE; STATISTICS;
D O I
10.1109/CVPR52688.2022.01424
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pseudo-labels are confident predictions made on unlabeled target data by a classifier trained on labeled source data. They are widely used for adapting a model to unlabeled data, e.g., in a semi-supervised learning setting. Our key insight is that pseudo-labels are naturally imbalanced due to intrinsic data similarity, even when a model is trained on balanced source data and evaluated on balanced target data. If we address this previously unknown imbalanced classification problem arising from pseudo-labels instead of ground-truth training labels, we could remove model biases towards false majorities created by pseudo-labels. We propose a novel and effective debiased learning method with pseudo-labels, based on counterfactual reasoning and adaptive margins: The former removes the classifier response bias, whereas the latter adjusts the margin of each class according to the imbalance of pseudo-labels. Validated by extensive experimentation, our simple debiased learning delivers significant accuracy gains over the state-of-the-art on ImageNet-1K: 26% for semi-supervised learning with 0.2% annotations and 9% for zero-shot learning. Our code is available at: https://github.com/frank-xwang/debiased-pseudo-labeling.
引用
下载
收藏
页码:14627 / 14637
页数:11
相关论文
共 50 条
  • [1] Learning Articulated Shape with Keypoint Pseudo-labels from Web Images
    Stathopoulos, Anastasis
    Pavlakos, Georgios
    Han, Ligong
    Metaxas, Dimitris
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13092 - 13101
  • [2] Efficient Transfer by Robust Label Selection and Learning with Pseudo-Labels
    Huizinga, Wyke
    Kruithof, Maarten
    Burghouts, Gertjan
    Schutte, Klamer
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2660 - 2664
  • [3] Semi-Supervised Learning of Semantic Correspondence with Pseudo-Labels
    Kim, Jiwon
    Ryoo, Kwangrok
    Seo, Junyoung
    Lee, Gyuseong
    Kim, Daehwan
    Cho, Hansang
    Kim, Seungryong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19667 - 19677
  • [4] A Novel Low-Query-Budget Active Learner with Pseudo-Labels for Imbalanced Data
    Tharwat, Alaa
    Schenck, Wolfram
    MATHEMATICS, 2022, 10 (07)
  • [5] Direct Hashing without Pseudo-Labels
    Zheng, Feng
    Huang, Heng
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4539 - 4546
  • [6] PLAD: Learning to Infer Shape Programs with Pseudo-Labels and Approximate Distributions
    Jones, R. Kenny
    Walke, Homer
    Ritchie, Daniel
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9861 - 9870
  • [7] Pseudo Labels for Imbalanced Multi-Label Learning
    Zeng, Wenrong
    Chen, Xuewen
    Cheng, Hong
    2014 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2014, : 25 - 31
  • [8] Learning from pseudo-labels: deep networks improve consistency in longitudinal brain volume estimation
    Zhan, Geng
    Wang, Dongang
    Cabezas, Mariano
    Bai, Lei
    Kyle, Kain
    Ouyang, Wanli
    Barnett, Michael
    Wang, Chenyu
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [9] Learning Consistency From High-Confidence Pseudo-Labels for Weakly Supervised Object Localization
    Sun, Kangbo
    Zhu, Jie
    IEEE ACCESS, 2023, 11 : 16657 - 16666
  • [10] Graph Topology Noise Aware Learning by Feature Clustering and Pseudo-labels Generator
    He, Changqin
    Kou, Guang
    Zhang, Haoyu
    Hu, Zhihui
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,