A post-processing framework for class-imbalanced learning in a transductive setting

被引:0
|
作者
Jiang, Zhen [1 ]
Lu, Yu [1 ]
Zhao, Lingyun [1 ]
Zhan, Yongzhao [1 ]
Mao, Qirong [1 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Commun Engn, 301 Xuefu Rd, Zhenjiang 212013, Peoples R China
基金
中国国家自然科学基金;
关键词
Class-imbalanced learning; Post-processing; Class proportion; Compact prototype; ENSEMBLES; DATASETS; SVM;
D O I
10.1016/j.eswa.2024.123832
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional classification tasks suffer from the class-imbalanced problem, where some classes far outnumber others. To address this issue, existing class-imbalanced learning (CIL) methods either preprocess classimbalanced datasets or adapt traditional classification algorithms to the imbalanced class distribution. Inspired by the idea of transductive learning, we propose a post -processing framework called PPF for CIL. Distinct from existing CIL methods, PPF directly adjusts the predicted labels of test data to fit the imbalanced class distribution. Specifically, we relabel some test data according to their prediction probabilities so that the class proportion of test data is close to that of training data. The underlying assumption is that training and test data, drawn independently from one data space, should obey the same class distribution. Furthermore, we propose a Compact Prototype -based Nearest Neighbor (CPNN) algorithm to assist the original classifier with the adjustment. Instead of training a classifier, CPNN classifies test data according to their distances to a set of prototypes estimated on labeled data. Thus, it is computationally simple and relatively robust to class imbalance. As a general framework, PPF can be easily applied to both traditional classification and CIL algorithms. To validate the effectiveness of the proposed method, we conducted extensive experiments on a variety of classimbalanced datasets, using SVM and C4.5 as the original classifiers, respectively. Measured by F -measure, Gmean, and AUC, both PPF-SVM and PPF-C4.5 outperform 10 state-of-the-art CIL algorithms. Additionally, PPF further improved their performances when applied to 10 CIL algorithms.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A genetic algorithm-based approach for class-imbalanced learning
    Dong, Shangyan
    Wu, Yongcheng
    THIRD INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2018, 10828
  • [22] A deep multimodal generative and fusion framework for class-imbalanced multimodal data
    Qing Li
    Guanyuan Yu
    Jun Wang
    Yuehao Liu
    Multimedia Tools and Applications, 2020, 79 : 25023 - 25050
  • [23] Learning sample representativeness for class-imbalanced multi-label classification
    Zhang, Yu
    Cao, Sichen
    Mi, Siya
    Bian, Yali
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
  • [24] On Supervised Class-Imbalanced Learning: An Updated Perspective and Some Key Challenges
    Das S.
    Mullick S.S.
    Zelinka I.
    IEEE Transactions on Artificial Intelligence, 2022, 3 (06): : 973 - 993
  • [25] Weed recognition using deep learning techniques on class-imbalanced imagery
    Hasan, A. S. M. Mahmudul
    Sohel, Ferdous
    Diepeveen, Dean
    Laga, Hamid
    Jones, Michael G. K.
    CROP & PASTURE SCIENCE, 2023, 74 (06): : 628 - 644
  • [26] Multitask Semi-Supervised Learning for Class-Imbalanced Discourse Classification
    Spangher, Alexander
    May, Jonathan
    Shiang, Sz-rung
    Deng, Lingjia
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 498 - 517
  • [27] Class-imbalanced complementary-label learning via weighted loss
    Wei, Meng
    Zhou, Yong
    Li, Zhongnian
    Xu, Xinzheng
    NEURAL NETWORKS, 2023, 166 : 555 - 565
  • [28] Large-Scale Learning with Structural Kernels for Class-Imbalanced Datasets
    Severyn, Aliaksei
    Moschitti, Alessandro
    ETERNAL SYSTEMS, 2012, 255 : 34 - 41
  • [29] An adaptive fault diagnosis framework under class-imbalanced conditions based on contrastive augmented deep reinforcement learning
    Zhao, Qin
    Ding, Yu
    Lu, Chen
    Wang, Chao
    Ma, Liang
    Tao, Laifa
    Ma, Jian
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 234
  • [30] Tuning model parameters in class-imbalanced learning with precision-recall curve
    Fu, Guang-Hui
    Yi, Lun-Zhao
    Pan, Jianxin
    BIOMETRICAL JOURNAL, 2019, 61 (03) : 652 - 664