A post-processing framework for class-imbalanced learning in a transductive setting

被引：0

作者：

Jiang, Zhen ^{[1
]}

Lu, Yu ^{[1
]}

Zhao, Lingyun ^{[1
]}

Zhan, Yongzhao ^{[1
]}

Mao, Qirong ^{[1
]}

机构：

[1] Jiangsu Univ, Sch Comp Sci & Commun Engn, 301 Xuefu Rd, Zhenjiang 212013, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 249卷

基金：

中国国家自然科学基金;

关键词：

Class-imbalanced learning; Post-processing; Class proportion; Compact prototype; ENSEMBLES; DATASETS; SVM;

D O I：

10.1016/j.eswa.2024.123832

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Traditional classification tasks suffer from the class-imbalanced problem, where some classes far outnumber others. To address this issue, existing class-imbalanced learning (CIL) methods either preprocess classimbalanced datasets or adapt traditional classification algorithms to the imbalanced class distribution. Inspired by the idea of transductive learning, we propose a post -processing framework called PPF for CIL. Distinct from existing CIL methods, PPF directly adjusts the predicted labels of test data to fit the imbalanced class distribution. Specifically, we relabel some test data according to their prediction probabilities so that the class proportion of test data is close to that of training data. The underlying assumption is that training and test data, drawn independently from one data space, should obey the same class distribution. Furthermore, we propose a Compact Prototype -based Nearest Neighbor (CPNN) algorithm to assist the original classifier with the adjustment. Instead of training a classifier, CPNN classifies test data according to their distances to a set of prototypes estimated on labeled data. Thus, it is computationally simple and relatively robust to class imbalance. As a general framework, PPF can be easily applied to both traditional classification and CIL algorithms. To validate the effectiveness of the proposed method, we conducted extensive experiments on a variety of classimbalanced datasets, using SVM and C4.5 as the original classifiers, respectively. Measured by F -measure, Gmean, and AUC, both PPF-SVM and PPF-C4.5 outperform 10 state-of-the-art CIL algorithms. Additionally, PPF further improved their performances when applied to 10 CIL algorithms.

引用

页数：17

共 50 条

[21] A genetic algorithm-based approach for class-imbalanced learning
Dong, Shangyan
Wu, Yongcheng
THIRD INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2018, 10828
[22] A deep multimodal generative and fusion framework for class-imbalanced multimodal data
Qing Li
Guanyuan Yu
Jun Wang
Yuehao Liu
Multimedia Tools and Applications, 2020, 79 : 25023 - 25050
[23] Learning sample representativeness for class-imbalanced multi-label classification
Zhang, Yu
Cao, Sichen
Mi, Siya
Bian, Yali
PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
[24] On Supervised Class-Imbalanced Learning: An Updated Perspective and Some Key Challenges
Das S.
Mullick S.S.
Zelinka I.
IEEE Transactions on Artificial Intelligence, 2022, 3 (06): : 973 - 993
[25] Weed recognition using deep learning techniques on class-imbalanced imagery
Hasan, A. S. M. Mahmudul
Sohel, Ferdous
Diepeveen, Dean
Laga, Hamid
Jones, Michael G. K.
CROP & PASTURE SCIENCE, 2023, 74 (06): : 628 - 644
[26] Multitask Semi-Supervised Learning for Class-Imbalanced Discourse Classification
Spangher, Alexander
May, Jonathan
Shiang, Sz-rung
Deng, Lingjia
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 498 - 517
[27] Class-imbalanced complementary-label learning via weighted loss
Wei, Meng
Zhou, Yong
Li, Zhongnian
Xu, Xinzheng
NEURAL NETWORKS, 2023, 166 : 555 - 565
[28] Large-Scale Learning with Structural Kernels for Class-Imbalanced Datasets
Severyn, Aliaksei
Moschitti, Alessandro
ETERNAL SYSTEMS, 2012, 255 : 34 - 41
[29] An adaptive fault diagnosis framework under class-imbalanced conditions based on contrastive augmented deep reinforcement learning
Zhao, Qin
Ding, Yu
Lu, Chen
Wang, Chao
Ma, Liang
Tao, Laifa
Ma, Jian
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 234
[30] Tuning model parameters in class-imbalanced learning with precision-recall curve
Fu, Guang-Hui
Yi, Lun-Zhao
Pan, Jianxin
BIOMETRICAL JOURNAL, 2019, 61 (03) : 652 - 664

← 1 2 3 4 5 →