Boosting for learning multiple classes with imbalanced class distribution

被引:192
|
作者
Sun, Yanmin [1 ]
Kamel, Mohamed S.
Wang, Yang
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[2] Software Syst Ltd, Pattern Discovery, Waterloo, ON, Canada
关键词
D O I
10.1109/icdm.2006.29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classification of data with imbalanced class distribution has posed a significant drawback of the performance attainable by most standard classifier learning algorithms, which assume a relatively balanced class distribution and equal misclassification costs. This learning difficulty attracts a lot of research interests. Most efforts concentrate on bi-class problems. However bi-class is not the only scenario where the class imbalance problem prevails. Reported solutions for bi-class applications are not applicable to multi-class problems. In this paper we develop a cost-sensitive boosting algorithm to improve the classification performance of imbalanced data involving multiple classes. One barrier of applying the cost-sensitive boosting algorithm to the imbalanced data is that the cost matrix is often unavailable for a problem domain. To solve this problem, we apply Genetic Algorithm to search the optimum cost setup of each class. Empirical tests show that the proposed cost-sensitive boosting algorithm improves the classification performances of imbalanced data sets significantly.
引用
收藏
页码:592 / 602
页数:11
相关论文
共 50 条
  • [1] The Influence of Multiple Classes on Learning from Imbalanced Data Streams
    Lipska, Agnieszka
    Stefanowski, Jerzy
    FOURTH INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 183, 2022, 183 : 187 - 198
  • [2] Noise Detection in Imbalanced Classes Using Adaptive Boosting
    Saglam, Fatih
    Cengiz, Mehmet Ali
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 449 - 452
  • [3] Multi-class Boosting for Imbalanced Data
    Fernandez-Baldera, Antonio
    Buenaposada, Jose M.
    Baumela, Luis
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 57 - 64
  • [4] Boosting weighted ELM for imbalanced learning
    Li, Kuan
    Kong, Xiangfei
    Lu, Zhi
    Liu Wenyin
    Yin, Jianping
    NEUROCOMPUTING, 2014, 128 : 15 - 21
  • [5] Fairness-Aware Class Imbalanced Learning on Multiple Subgroups
    Tarzanagh, Davoud Ataee
    Hou, Bojian
    Tong, Boning
    Long, Qi
    Shen, Li
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2123 - 2133
  • [6] Online Active Learning with Imbalanced Classes
    Ferdowsi, Zahra
    Ghani, Rayid
    Settimi, Raffaella
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 1043 - 1048
  • [7] Learning Ensembles in the Presence of Imbalanced Classes
    Saadallah, Amal
    Piatkowski, Nico
    Finkeldey, Felix
    Wiederkehr, Petra
    Morik, Katharina
    ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 866 - 873
  • [8] Evolutionary inversion of class distribution in overlapping areas for multi-class imbalanced learning
    Fernandes, Everlandio R. Q.
    de Carvalho, Andre C. P. L. F.
    INFORMATION SCIENCES, 2019, 494 : 141 - 154
  • [9] Supervised Class Distribution Learning for GANs-based Imbalanced Classification
    Cai, Zixin
    Wang, Xinyue
    Zhou, Mingjie
    Xu, Jian
    Jing, Liping
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 41 - 50
  • [10] Imbalanced Class Learning in Epigenetics
    Haque, M. Muksitul
    Skinner, Michael K.
    Holder, Lawrence B.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2014, 21 (07) : 492 - 507