Boosting for learning multiple classes with imbalanced class distribution

被引:192
|
作者
Sun, Yanmin [1 ]
Kamel, Mohamed S.
Wang, Yang
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[2] Software Syst Ltd, Pattern Discovery, Waterloo, ON, Canada
关键词
D O I
10.1109/icdm.2006.29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classification of data with imbalanced class distribution has posed a significant drawback of the performance attainable by most standard classifier learning algorithms, which assume a relatively balanced class distribution and equal misclassification costs. This learning difficulty attracts a lot of research interests. Most efforts concentrate on bi-class problems. However bi-class is not the only scenario where the class imbalance problem prevails. Reported solutions for bi-class applications are not applicable to multi-class problems. In this paper we develop a cost-sensitive boosting algorithm to improve the classification performance of imbalanced data involving multiple classes. One barrier of applying the cost-sensitive boosting algorithm to the imbalanced data is that the cost matrix is often unavailable for a problem domain. To solve this problem, we apply Genetic Algorithm to search the optimum cost setup of each class. Empirical tests show that the proposed cost-sensitive boosting algorithm improves the classification performances of imbalanced data sets significantly.
引用
收藏
页码:592 / 602
页数:11
相关论文
共 50 条
  • [41] Addressing the Imbalanced Class Distribution in Fatty Liver Detection in CT Images Using Transfer Learning
    Hailat, Hamzeh
    Mustafa, Ahmad M.
    Najadat, Hassan
    Alomari, Mamoon
    Freihat, Sofyan
    2024 15TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS, ICICS 2024, 2024,
  • [42] Boosting Generalized Few-Shot Learning by Scattering Intra-class Distribution
    Yu, Yunlong
    Jin, Lisha
    Li, Yingming
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 438 - 453
  • [43] Boosting the oversampling methods based on differential evolution strategies for imbalanced learning
    Korkmaz, Sedat
    Sahman, Mehmet Akif
    Cinar, Ahmet Cevahir
    Kaya, Ersin
    APPLIED SOFT COMPUTING, 2021, 112
  • [44] Class association rule mining with multiple imbalanced attributes
    Zhang, Huaifeng
    Zhao, Yanchang
    Cao, Longbing
    Zhang, Chengqi
    AI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4830 : 827 - 831
  • [45] Distribution-Sensitive Learning for Imbalanced Datasets
    Song, Yale
    Morency, Louis-Philippe
    Davis, Randall
    2013 10TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), 2013,
  • [46] Minority Class Oriented Active Learning for Imbalanced Datasets
    Aggarwal, Umang
    Popescu, Adrian
    Hudelot, Celine
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9920 - 9927
  • [47] Active Learning with Imbalanced Multiple Noisy Labeling
    Zhang, Jing
    Wu, Xindong
    Sheng, Victor S.
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (05) : 1081 - 1093
  • [48] Dynamic Centroid Insertion and Adjustment for Data Sets with Multiple Imbalanced Classes
    Silva, Evandro J. R.
    Zanchettin, Cleber
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 766 - 778
  • [49] On Learning Deep Models with Imbalanced Data Distribution
    Majumdar, Puspita
    Singh, Richa
    Vatsa, Mayank
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15720 - 15721
  • [50] Class Rectification Hard Mining for Imbalanced Deep Learning
    Dong, Qi
    Gong, Shaogang
    Zhu, Xiatian
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1869 - 1878