Adaptive Ensemble Undersampling-Boost: A novel learning framework for imbalanced data

被引:45
|
作者
Lu, Wei [1 ]
Li, Zhe [1 ]
Chu, Jinghui [1 ]
机构
[1] Tianjin Univ, Sch Elect Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Classification; Imbalanced data sets; Real Adaboost; Voting algorithm; Adaptive decision boundary; Ensemble Undersampling; CLASSIFICATION; ALGORITHMS; SUPPORT;
D O I
10.1016/j.jss.2017.07.006
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As one of the most challenging and attractive problems in the pattern recognition and machine intelligence field, imbalanced classification has received a large amount of research attention for many years. In binary classification tasks, one class usually tends to be underrepresented when it consists of far fewer patterns than the other class, which results in undesirable classification results, especially for the minority class. Several techniques, including resampling, boosting and cost-sensitive methods have been proposed to alleviate this problem. Recently, some ensemble methods that focus on combining individual techniques to obtain better performance have been observed to present better classification performance on the minority class. In this paper, we propose a novel ensemble framework called Adaptive Ensemble Undersampling-Boost for imbalanced learning. Our proposal combines the Ensemble of Undersampling (EUS) technique, Real Adaboost, cost-sensitive weight modification, and adaptive boundary decision strategy to build a hybrid algorithm. The superiority of our method over other state-of-the-art ensemble methods is demonstrated by experiments on 18 real world data sets with various data distributions and different imbalance ratios. Given the experimental results and further analysis, our proposal is proven to be a promising alternative that can be applied to various imbalanced classification domains. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:272 / 282
页数:11
相关论文
共 50 条
  • [1] MUEnsemble: Multi-ratio Undersampling-Based Ensemble Framework for Imbalanced Data
    Komamizu, Takahiro
    Uehara, Risa
    Ogawa, Yasuhiro
    Toyama, Katsuhiko
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2020, PT II, 2020, 12392 : 213 - 228
  • [2] A Novel Selective Ensemble Algorithm for Imbalanced Data Classification Based on Exploratory Undersampling
    Yin, Qing-Yan
    Zhang, Jiang-She
    Zhang, Chun-Xia
    Ji, Nan-Nan
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [3] A Novel Ensemble Learning Paradigm for Medical Diagnosis With Imbalanced Data
    Liu, Na
    Li, Xiaomei
    Qi, Ershi
    Xu, Man
    Li, Ling
    Gao, Bo
    [J]. IEEE ACCESS, 2020, 8 : 171263 - 171280
  • [4] An Adaptive Sampling Ensemble Classifier for Learning from Imbalanced Data Sets
    Geiler, Ordonez Jon
    Hong, Li
    Yue-Jian, Guo
    [J]. INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 513 - 517
  • [5] A Heterogeneous Ensemble Learning Framework for Spam Detection in Social Networks with Imbalanced Data
    Zhao, Chensu
    Xin, Yang
    Li, Xuefeng
    Yang, Yixian
    Chen, Yuling
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (03):
  • [6] MESA: Boost Ensemble Imbalanced Learning with MEta-SAmpler
    Liu, Zhining
    Wei, Pengfei
    Jiang, Jing
    Cao, Wei
    Bian, Jiang
    Chang, Yi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [7] A Novel Ensemble Framework Based on K-Means and Resampling for Imbalanced Data
    Duan, Huajuan
    Wei, Yongqing
    Liu, Peiyu
    Yin, Hongxia
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (05):
  • [8] Multicriteria Classifier Ensemble Learning for Imbalanced Data
    Wegier, Weronika
    Koziarski, Michal
    Wozniak, Micha
    Wegier, Weronika
    [J]. IEEE Access, 2022, 10 : 16807 - 16818
  • [9] Multicriteria Classifier Ensemble Learning for Imbalanced Data
    Wegier, Weronika
    Koziarski, Michal
    Wozniak, Micha
    [J]. IEEE ACCESS, 2022, 10 : 16807 - 16818
  • [10] An Improved Ensemble Learning for Imbalanced Data Classification
    Yuan, Zhengwu
    Zhao, Pu
    [J]. PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 408 - 411