Adaptive Ensemble Undersampling-Boost: A novel learning framework for imbalanced data

被引:45
|
作者
Lu, Wei [1 ]
Li, Zhe [1 ]
Chu, Jinghui [1 ]
机构
[1] Tianjin Univ, Sch Elect Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Classification; Imbalanced data sets; Real Adaboost; Voting algorithm; Adaptive decision boundary; Ensemble Undersampling; CLASSIFICATION; ALGORITHMS; SUPPORT;
D O I
10.1016/j.jss.2017.07.006
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As one of the most challenging and attractive problems in the pattern recognition and machine intelligence field, imbalanced classification has received a large amount of research attention for many years. In binary classification tasks, one class usually tends to be underrepresented when it consists of far fewer patterns than the other class, which results in undesirable classification results, especially for the minority class. Several techniques, including resampling, boosting and cost-sensitive methods have been proposed to alleviate this problem. Recently, some ensemble methods that focus on combining individual techniques to obtain better performance have been observed to present better classification performance on the minority class. In this paper, we propose a novel ensemble framework called Adaptive Ensemble Undersampling-Boost for imbalanced learning. Our proposal combines the Ensemble of Undersampling (EUS) technique, Real Adaboost, cost-sensitive weight modification, and adaptive boundary decision strategy to build a hybrid algorithm. The superiority of our method over other state-of-the-art ensemble methods is demonstrated by experiments on 18 real world data sets with various data distributions and different imbalance ratios. Given the experimental results and further analysis, our proposal is proven to be a promising alternative that can be applied to various imbalanced classification domains. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:272 / 282
页数:11
相关论文
共 50 条
  • [31] Undersampling based on generalized learning vector quantization and natural nearest neighbors for imbalanced data
    Wang, Long-Hui
    Dai, Qi
    Wang, Jia-You
    Du, Tony
    Chen, Lifang
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [32] A Novel Adaptive Undersampling Framework for Class-Imbalance Fault Detection
    Qian, Min
    Li, Yan-Fu
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2023, 72 (03) : 1003 - 1017
  • [33] Ensemble learning method based on CNN for class imbalanced data
    Xin Zhong
    Nan Wang
    [J]. The Journal of Supercomputing, 2024, 80 : 10090 - 10121
  • [34] Incremental Weighted Ensemble Broad Learning System for Imbalanced Data
    Yang, Kaixiang
    Yu, Zhiwen
    Chen, C. L. Philip
    Cao, Wenming
    You, Jane
    Wong, Hau-San
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (12) : 5809 - 5824
  • [35] Ensemble learning method based on CNN for class imbalanced data
    Zhong, Xin
    Wang, Nan
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (07): : 10090 - 10121
  • [36] imDC: an ensemble learning method for imbalanced classification with miRNA data
    Wang, C. Y.
    Hu, L. L.
    Guo, M. Z.
    Liu, X. Y.
    Zou, Q.
    [J]. GENETICS AND MOLECULAR RESEARCH, 2015, 14 (01): : 123 - 133
  • [37] A preprocessing method combined with an ensemble framework for the multiclass imbalanced data classification
    Pavan Kumar M.R.
    Jayagopal P.
    [J]. International Journal of Computers and Applications, 2022, 44 (12) : 1178 - 1185
  • [38] Spark-based ensemble learning for imbalanced data classification
    Ding J.
    Wang S.
    Jia L.
    You J.
    Jiang Y.
    [J]. International Journal of Performability Engineering, 2018, 14 (05) : 945 - 964
  • [39] Adaptive Ensemble Method Based on Spatial Characteristics for Classifying Imbalanced Data
    Wang, Lei
    Zhao, Lei
    Gui, Guan
    Zheng, Baoyu
    Huang, Ruochen
    [J]. SCIENTIFIC PROGRAMMING, 2017, 2017
  • [40] A clustering-based adaptive undersampling ensemble method for highly unbalanced data classification
    Yuan, Xiaohan
    Sun, Chuan
    Chen, Shuyu
    [J]. APPLIED SOFT COMPUTING, 2024, 159