Boosting for learning multiple classes with imbalanced class distribution

被引:192
|
作者
Sun, Yanmin [1 ]
Kamel, Mohamed S.
Wang, Yang
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[2] Software Syst Ltd, Pattern Discovery, Waterloo, ON, Canada
来源
ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS | 2006年
关键词
D O I
10.1109/icdm.2006.29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classification of data with imbalanced class distribution has posed a significant drawback of the performance attainable by most standard classifier learning algorithms, which assume a relatively balanced class distribution and equal misclassification costs. This learning difficulty attracts a lot of research interests. Most efforts concentrate on bi-class problems. However bi-class is not the only scenario where the class imbalance problem prevails. Reported solutions for bi-class applications are not applicable to multi-class problems. In this paper we develop a cost-sensitive boosting algorithm to improve the classification performance of imbalanced data involving multiple classes. One barrier of applying the cost-sensitive boosting algorithm to the imbalanced data is that the cost matrix is often unavailable for a problem domain. To solve this problem, we apply Genetic Algorithm to search the optimum cost setup of each class. Empirical tests show that the proposed cost-sensitive boosting algorithm improves the classification performances of imbalanced data sets significantly.
引用
收藏
页码:592 / 602
页数:11
相关论文
共 50 条
  • [31] Boundary Focal Loss for Class Imbalanced Learning
    Lin, Weizhong
    Wu, Peng
    Xiao, Xuan
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [32] Defying Imbalanced Forgetting in Class Incremental Learning
    Xu, Shixiong
    Meng, Gaofeng
    Nie, Xing
    Ni, Bolin
    Fan, Bin
    Xiang, Shiming
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 16211 - 16219
  • [33] Fairness-aware Class Imbalanced Learning
    Subramanian, Shivashankar
    Rahimi, Afshin
    Baldwin, Timothy
    Cohn, Trevor
    Frermann, Lea
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2045 - 2051
  • [34] Adapting MultiBoost Ensemble for Class Imbalanced Learning
    Mustafa, Ghulam
    Niu, Zhendong
    Chen, Jie
    2015 IEEE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2015, : 12 - 17
  • [35] Adjusting Decision Boundary for Class Imbalanced Learning
    Kim, Byungju
    Kim, Junmo
    IEEE ACCESS, 2020, 8 : 81674 - 81685
  • [36] Addressing the issue of digital mapping of soil classes with imbalanced class observations
    Sharififar, Amin
    Sarmadian, Fereydoon
    Malone, Brendan P.
    Minasny, Budiman
    GEODERMA, 2019, 350 : 84 - 92
  • [37] Parallel classifiers ensemble with hierarchical machine learning for imbalanced classes
    Zhang, Yun
    Luo, Bing
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 94 - 99
  • [38] Boosting methods for multi-class imbalanced data classification: an experimental review
    Jafar Tanha
    Yousef Abdi
    Negin Samadi
    Nazila Razzaghi
    Mohammad Asadpour
    Journal of Big Data, 7
  • [39] Boosting methods for multi-class imbalanced data classification: an experimental review
    Tanha, Jafar
    Abdi, Yousef
    Samadi, Negin
    Razzaghi, Nazila
    Asadpour, Mohammad
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [40] Improving Classification Performance for the Minority Class in Highly Imbalanced Dataset using Boosting
    Abouelenien, Mohamed
    Yuan, Xiaohui
    Duraisamy, Prakash
    Yuan, Xiaojing
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,