DeCAB: Debiased Semi-supervised Learning for Imbalanced Open-Set Data

被引:0
|
作者
Huang, Xiaolin [1 ]
Li, Mengke [2 ]
Lu, Yang [1 ]
Wang, Hanzi [1 ]
机构
[1] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart City, Xiamen, Peoples R China
[2] Guangdong Lab Artificial Intelligence & Digital E, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised Learning; Imbalanced Learning; Contrastive Learning;
D O I
10.1007/978-981-99-8546-3_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning (SSL) has received significant attention due to its ability to use limited labeled data and various unlabeled data to train models with high generalization performance. However, the assumption of a balanced class distribution in traditional SSL approaches limits a wide range of real applications, where the training data exhibits long-tailed distributions. As a consequence, the model is biased towards head classes and disregards tail classes, thereby leading to severe class-aware bias. Additionally, since the unlabeled data may contain out-of-distribution (OOD) samples without manual filtering, the model will be inclined to assign OOD samples to non-tail classes with high confidence, which further overwhelms the tail classes. To alleviate this class-aware bias, we propose an end-to-end semi-supervised method Debias Class-Aware Bias (DeCAB). DeCAB introduces positive-pair scores for contrastive learning instead of positive-negative pairs based on unreliable pseudo-labels, avoiding false negative pairs negatively impacts the feature space. At the same time, DeCAB utilizes class-aware thresholds to select more tail samples and selective sample reweighting for feature learning, preventing OOD samples from being misclassified as head classes and accelerating the convergence speed of the model. Experimental results demonstrate that DeCAB is robust in various semi-supervised benchmarks and achieves state-of-the-art performance. Our code is temporarily available at https://github.com/xlhuang132/decab.
引用
收藏
页码:104 / 119
页数:16
相关论文
共 50 条
  • [1] Exploration and Exploitation of Unlabeled Data for Open-Set Semi-supervised Learning
    Zhao, Ganlong
    Li, Guanbin
    Qin, Yipeng
    Zhang, Jinjin
    Chai, Zhenhua
    Wei, Xiaolin
    Lin, Liang
    Yu, Yizhou
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024,
  • [2] Knowledge Distillation Meets Open-Set Semi-supervised Learning
    Yang, Jing
    Zhu, Xiatian
    Bulat, Adrian
    Martinez, Brais
    Tzimiropoulos, Georgios
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024,
  • [3] Mutual Filter Teaching for Open-Set Semi-Supervised Learning
    Li, Xiaokun
    Yi, Rumeng
    Huang, Yaping
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7700 - 7708
  • [4] Open-Set Semi-Supervised Object Detection
    Liu, Yen-Cheng
    Ma, Chih-Yao
    Dai, Xiaoliang
    Tian, Junjiao
    Vajda, Peter
    He, Zijian
    Kira, Zsolt
    [J]. COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 143 - 159
  • [5] OpenMatch: Open-set Consistency Regularization for Semi-supervised Learning with Outliers
    Saito, Kuniaki
    Kim, Donghyun
    Saenko, Kate
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [6] OSSGAN: Open-Set Semi-Supervised Image Generation
    Katsumata, Kai
    Duc Minh Vo
    Nakayama, Hideki
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11175 - 11183
  • [7] ANEDL: Adaptive Negative Evidential Deep Learning for Open-Set Semi-supervised Learning
    Yu, Yang
    Deng, Danruo
    Liu, Furui
    Dou, Qi
    Jin, Yueming
    Chen, Guangyong
    Heng, Pheng Ann
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16587 - 16595
  • [8] On the link between generative semi-supervised learning and generative open-set recognition
    Engelbrecht, Emile-Reyn
    du Preez, Johan A.
    [J]. SCIENTIFIC AFRICAN, 2023, 22
  • [9] Knowing the unknowns: Network traffic detection with open-set semi-supervised learning
    Chen, Rui
    Luo, Lailong
    Wang, Xiaodong
    Ren, Bangbang
    Guo, Deke
    Zhu, Shi
    [J]. COMPUTER NETWORKS, 2024, 251
  • [10] Open-Set Fault Diagnosis Method for Industrial Process Based on Semi-supervised Learning
    Liu, Jiaren
    Song, Hong
    Wang, Jianguo
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT IV, 2022, 13458 : 103 - 112