Boosting for learning multiple classes with imbalanced class distribution

被引:192
|
作者
Sun, Yanmin [1 ]
Kamel, Mohamed S.
Wang, Yang
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[2] Software Syst Ltd, Pattern Discovery, Waterloo, ON, Canada
来源
ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS | 2006年
关键词
D O I
10.1109/icdm.2006.29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classification of data with imbalanced class distribution has posed a significant drawback of the performance attainable by most standard classifier learning algorithms, which assume a relatively balanced class distribution and equal misclassification costs. This learning difficulty attracts a lot of research interests. Most efforts concentrate on bi-class problems. However bi-class is not the only scenario where the class imbalance problem prevails. Reported solutions for bi-class applications are not applicable to multi-class problems. In this paper we develop a cost-sensitive boosting algorithm to improve the classification performance of imbalanced data involving multiple classes. One barrier of applying the cost-sensitive boosting algorithm to the imbalanced data is that the cost matrix is often unavailable for a problem domain. To solve this problem, we apply Genetic Algorithm to search the optimum cost setup of each class. Empirical tests show that the proposed cost-sensitive boosting algorithm improves the classification performances of imbalanced data sets significantly.
引用
收藏
页码:592 / 602
页数:11
相关论文
共 50 条
  • [21] Plankton Detection with Adversarial Learning and a Densely Connected Deep Learning Model for Class Imbalanced Distribution
    Li, Yan
    Guo, Jiahong
    Guo, Xiaomin
    Hu, Zhiqiang
    Tian, Yu
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2021, 9 (06)
  • [22] Imbalanced Label Distribution Learning
    Zhao, Xingyu
    An, Yuexuan
    Xu, Ning
    Wang, Jing
    Geng, Xin
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11336 - 11344
  • [23] An Analysis of Several Machine Learning Algorithms for Imbalanced Classes
    Datta, Soma
    Arputharaj, Anuprabha
    2018 5TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI), 2018, : 22 - 27
  • [24] Lifelong learning on evolving graphs under the constraints of imbalanced classes and new classes
    Galke, Lukas
    Vagliano, Iacopo
    Franke, Benedikt
    Zielke, Tobias
    Hoffmann, Marcel
    Scherp, Ansgar
    NEURAL NETWORKS, 2023, 164 : 156 - 176
  • [25] SCD:Sampling-based Class Distribution for Imbalanced Semi-Supervised Learning
    Qiu, Haomiao
    Liu, Haixing
    Zhang, Chi
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 567 - 572
  • [26] Improvement in Boosting Method by Using RUSTBoost Technique for Class Imbalanced Data
    Kumar, Ashutosh
    Bharti, Roshan
    Gupta, Deepak
    Saha, Anish Kumar
    RECENT DEVELOPMENTS IN MACHINE LEARNING AND DATA ANALYTICS, 2019, 740 : 51 - 66
  • [27] Boosting one-class transfer learning for multiple view uncertain data
    Liu, Bo
    Cao, Fan
    Zhao, Shilei
    Xiao, Yanshan
    INFORMATION SCIENCES, 2025, 692
  • [28] Loss Factors for Learning Boosting Ensembles from Imbalanced Data
    Soleymani, Roghayeh
    Granger, Eric
    Fumera, Giorgio
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 204 - 209
  • [29] CDMAD: Class-Distribution-Mismatch-Aware Debiasing for Class-Imbalanced Semi-Supervised Learning
    Lee, Hyuck
    Kim, Heeyoung
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 23891 - 23900
  • [30] Hierarchical classification for imbalanced multiple classes in machine vision inspection
    Luo, Bing
    Zhang, Yun
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS, 2007, : 536 - +