Transfer and share: semi-supervised learning from long-tailed data

被引:6
|
作者
Wei, Tong [1 ]
Liu, Qian-Yu [2 ]
Shi, Jiang-Xin [2 ]
Tu, Wei-Wei [3 ]
Guo, Lan-Zhe [2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210096, Peoples R China
[2] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[3] 4Paradigm Inc, Beijing 100000, Peoples R China
关键词
Long-tailed learning; Semi-supervised learning; Pseudo-label distribution; Logit transformation;
D O I
10.1007/s10994-022-06247-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long-Tailed Semi-Supervised Learning (LTSSL) aims to learn from class-imbalanced data where only a few samples are annotated. Existing solutions typically require substantial cost to solve complex optimization problems, or class-balanced undersampling which can result in information loss. In this paper, we present the TRAS (TRAnsfer and Share) to effectively utilize long-tailed semi-supervised data. TRAS transforms the imbalanced pseudo-label distribution of a traditional SSL model via a delicate function to enhance the supervisory signals for minority classes. It then transfers the distribution to a target model such that the minority class will receive significant attention. Interestingly, TRAS shows that more balanced pseudo-label distribution can substantially benefit minority-class training, instead of seeking to generate accurate pseudo-labels as in previous works. To simplify the approach, TRAS merges the training of the traditional SSL model and the target model into a single procedure by sharing the feature extractor, where both classifiers help improve the representation learning. According to extensive experiments, TRAS delivers much higher accuracy than state-of-the-art methods in the entire set of classes as well as minority classes.
引用
收藏
页码:1725 / 1742
页数:18
相关论文
共 50 条
  • [21] Semi-supervised learning from unbalanced labeled data - An improvement
    Huang, TM
    Kecman, V
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2004, 3215 : 802 - 808
  • [22] Semi-supervised learning from unbalanced labeled data: An improvement
    Huang, Te-Ming
    Kecman, Vojislav
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2006, 10 (01) : 21 - 27
  • [23] Frequency-Aware Self-Supervised Long-Tailed Learning
    Lin, Ci-Siang
    Chen, Min-Hung
    Wang, Yu-Chiang Frank
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 963 - 972
  • [24] Rebalanced supervised contrastive learning with prototypes for long-tailed visual recognition
    Chang, Xuhui
    Zhai, Junhai
    Qiu, Shaoxin
    Sun, Zhengrong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 252
  • [25] Anchored Supervised Contrastive Learning for Long-Tailed Medical Image Regression
    Li, Zhaoying
    Xing, Zhaohu
    Liu, Hongying
    Zhu, Lei
    Wan, Liang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XV, 2025, 15045 : 3 - 18
  • [26] Semi-Supervised Learning with Data Augmentation for Tabular Data
    Fang, Junpeng
    Tang, Caizhi
    Cui, Qing
    Zhu, Feng
    Li, Longfei
    Zhou, Jun
    Zhu, Wei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3928 - 3932
  • [27] Incremental semi-supervised learning on streaming data
    Li, Yanchao
    Wang, Yongli
    Liu, Qi
    Bi, Cheng
    Jiang, Xiaohui
    Sun, Shurong
    PATTERN RECOGNITION, 2019, 88 : 383 - 396
  • [28] A Semi-Supervised Learning Algorithm for Data Classification
    Kuo, Cheng-Chien
    Shieh, Horng-Lin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (05)
  • [29] Self-Supervised Graph Learning for Long-Tailed Cognitive Diagnosis
    Wang, Shanshan
    Zeng, Zhen
    Yang, Xun
    Zhang, Xingyi
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 110 - 118
  • [30] Data heterogeneity consideration in semi-supervised learning
    Araujo, Bilza
    Zhao, Liang
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 45 : 234 - 247