Evolutionary simultaneous under and oversampling of instances for dealing with class-imbalance datasets in multilabel problems

被引:2
|
作者
Garcia-Pedrajas, Nicolas [1 ]
Cuevas-Munoz, Jose M. [1 ]
de Haro-Garcia, Aida [1 ]
机构
[1] Univ Cordoba, Campus Rabanales, Cordoba 14011, Spain
关键词
Multilabel classification; Class-imbalanced datasets; Evolutionary algorithms; LABEL; CLASSIFICATION; ENSEMBLE; SMOTE; COST;
D O I
10.1016/j.asoc.2024.111618
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilabel classification has recently attracted great attention from the data mining research community. Multilabel classification is concerned with learning where each instance can be associated with multiple classes (or labels). Class -imbalance problems appear in any classification task when the class distribution of the instances is very different. In multilabel classification, this problem is ubiquitous, as a large percentage of labels suffer from a class-imbalanced distribution. The adaptation of single -label methods to deal with the class -imbalance problem in multilabel learning is problematic as many of their basic concepts are not easily transferred. In this paper, we propose the use of evolutionary computation to simultaneously oversample the minority class and undersample the majority class for multilabel problems. Letting the algorithm autonomously select the instances to undersample and oversample allows us to extend these two successful paradigms to the multilabel task. An extensive comparison setup of 35 datasets shows the advantages of using this approach to deal with class -imbalance datasets for multilabel problems compared with previously published methods as well as the basic classification algorithms with the original datasets.
引用
收藏
页数:21
相关论文
共 34 条
  • [1] An Oversampling Method for Class Imbalance Problems on Large Datasets
    Rodriguez-Torres, Fredy
    Martinez-Trinidad, Jose F.
    Carrasco-Ochoa, Jesus A.
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [2] Generating Counterfactual Instances for Explainable Class-Imbalance Learning
    Chen, Zhi
    Duan, Jiang
    Kang, Li
    Xu, Hongyan
    Chen, Rui
    Qiu, Guoping
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (03) : 1130 - 1144
  • [3] Partial random under/oversampling for multilabel problems
    Garcia-Pedrajas, Nicolas
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 302
  • [4] Measuring the class-imbalance extent of multi-class problems
    Ortigosa-Hernandez, Jonathan
    Inza, Inaki
    Lozano, Jose A.
    [J]. PATTERN RECOGNITION LETTERS, 2017, 98 : 32 - 38
  • [5] Multi-fairness Under Class-Imbalance
    Roy, Arjun
    Iosifidis, Vasileios
    Ntoutsi, Eirini
    [J]. DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 286 - 301
  • [6] On the Performance of Oversampling Techniques for Class Imbalance Problems
    Kong, Jiawen
    Rios, Thiago
    Kowalczyk, Wojtek
    Menzel, Stefan
    Back, Thomas
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 84 - 96
  • [7] Exploratory under-sampling for class-imbalance learning
    Liu, Xu-Ying
    Wu, Jianxin
    Zhou, Zhi-Hua
    [J]. ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 965 - 969
  • [8] Novel resampling algorithms with maximal cliques for class-imbalance problems
    [J]. Chen, Li-fang (hblg_clf@163.com), 2025, 199
  • [9] An Ensemble Learning Approach with Gradient Resampling for Class-Imbalance Problems
    Zhao, Hongke
    Zhao, Chuang
    Zhang, Xi
    Liu, Nanlin
    Zhu, Hengshu
    Liu, Qi
    Xiong, Hui
    [J]. INFORMS JOURNAL ON COMPUTING, 2023, 35 (04) : 747 - 763
  • [10] On Chance Performance in High-Dimensional Class-Imbalance Problems
    Udu, Amadi Gabriel
    Lecchini-Visintini, Andrea
    Dong, Hongbiao
    [J]. 2024 UKACC 14TH INTERNATIONAL CONFERENCE ON CONTROL, CONTROL, 2024, : 254 - 255