A post-processing framework for class-imbalanced learning in a transductive setting

被引:0
|
作者
Jiang, Zhen [1 ]
Lu, Yu [1 ]
Zhao, Lingyun [1 ]
Zhan, Yongzhao [1 ]
Mao, Qirong [1 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Commun Engn, 301 Xuefu Rd, Zhenjiang 212013, Peoples R China
基金
中国国家自然科学基金;
关键词
Class-imbalanced learning; Post-processing; Class proportion; Compact prototype; ENSEMBLES; DATASETS; SVM;
D O I
10.1016/j.eswa.2024.123832
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional classification tasks suffer from the class-imbalanced problem, where some classes far outnumber others. To address this issue, existing class-imbalanced learning (CIL) methods either preprocess classimbalanced datasets or adapt traditional classification algorithms to the imbalanced class distribution. Inspired by the idea of transductive learning, we propose a post -processing framework called PPF for CIL. Distinct from existing CIL methods, PPF directly adjusts the predicted labels of test data to fit the imbalanced class distribution. Specifically, we relabel some test data according to their prediction probabilities so that the class proportion of test data is close to that of training data. The underlying assumption is that training and test data, drawn independently from one data space, should obey the same class distribution. Furthermore, we propose a Compact Prototype -based Nearest Neighbor (CPNN) algorithm to assist the original classifier with the adjustment. Instead of training a classifier, CPNN classifies test data according to their distances to a set of prototypes estimated on labeled data. Thus, it is computationally simple and relatively robust to class imbalance. As a general framework, PPF can be easily applied to both traditional classification and CIL algorithms. To validate the effectiveness of the proposed method, we conducted extensive experiments on a variety of classimbalanced datasets, using SVM and C4.5 as the original classifiers, respectively. Measured by F -measure, Gmean, and AUC, both PPF-SVM and PPF-C4.5 outperform 10 state-of-the-art CIL algorithms. Additionally, PPF further improved their performances when applied to 10 CIL algorithms.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] A Diversity-Based Method for Class-Imbalanced Cost-Sensitive Learning
    Dong, Shangyan
    Wu, Yongcheng
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2018), 2018, : 51 - 55
  • [42] An Integrated Class-Imbalanced Learning Scheme for Diagnosing Bearing Defects in Induction Motors
    Razavi-Far, Roozbeh
    Farajzadeh-Zanjani, Maryam
    Saif, Mehrdad
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (06) : 2758 - 2769
  • [43] A novel oversampling technique for class-imbalanced learning based on SMOTE and natural neighbors
    Li, Junnan
    Zhu, Qingsheng
    Wu, Quanwang
    Fan, Zhu
    INFORMATION SCIENCES, 2021, 565 : 438 - 455
  • [44] Performance of Machine Learning Algorithms for Class-Imbalanced Process Fault Detection Problems
    Lee, Taehyung
    Lee, Ki Bum
    Kim, Chang Ouk
    IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2016, 29 (04) : 436 - 445
  • [45] ABAE: Auxiliary Balanced AutoEncoder for class-imbalanced semi-supervised learning
    Tang, Qianying
    Wei, Xiang
    Su, Qi
    Zhang, Shunli
    PATTERN RECOGNITION LETTERS, 2024, 182 : 118 - 124
  • [46] OCI-SSL: Open Class-Imbalanced Semi-Supervised Learning With Contrastive Learning
    Zhou, Yuting
    Gao, Can
    Zhou, Jie
    Ding, Weiping
    Shen, Linlin
    Lai, Zhihui
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (06): : 1 - 14
  • [47] A generic post-processing framework for image dehazing
    Kumar, Balla Pavan
    Kumar, Arvind
    Pandey, Rajoo
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (06) : 3183 - 3191
  • [48] A generic post-processing framework for image dehazing
    Balla Pavan Kumar
    Arvind Kumar
    Rajoo Pandey
    Signal, Image and Video Processing, 2023, 17 : 3183 - 3191
  • [49] An Effective Intrusion Detection Model for Class-imbalanced Learning Based on SMOTE and Attention Mechanism
    Jiao, Xubin
    Li, Jinguo
    2021 18TH INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2021,
  • [50] 3LPR: A three-stage label propagation and reassignment framework for class-imbalanced semi-supervised learning
    Kong, Xiangyuan
    Wei, Xiang
    Liu, Xiaoyu
    Wang, Jingjie
    Lu, Siyang
    Xing, Weiwei
    Lu, Wei
    KNOWLEDGE-BASED SYSTEMS, 2022, 253