Actively Balanced Bagging for Imbalanced Data

被引:8
|
作者
Blaszczynski, Jerzy [1 ]
Stefanowski, Jerzy [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, Piotrowo 2, PL-60965 Poznan, Poland
关键词
Class imbalance; Active learning; Bagging; Ensembles of classifiers; Neighbourhood Balanced Bagging;
D O I
10.1007/978-3-319-60438-1_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Under-sampling extensions of bagging are currently the most accurate ensembles specialized for class imbalanced data. Nevertheless, since improvements of recognition of the minority class, in this type of ensembles, are usually associated with a decrease of recognition of majority classes, we introduce a new, two phase, ensemble called Actively Balanced Bagging. The proposal is to first learn a bagging classifier and then iteratively improve it by updating its bootstraps with a limited number learning examples. The examples are selected according to an active learning strategy, which takes into account: decision margin of votes, example class distribution in the training set and/or in its neighbourhood, and prediction errors of component classifiers. Experiments with synthetic and real-world data confirm usefulness of this proposal.
引用
收藏
页码:271 / 281
页数:11
相关论文
共 50 条
  • [1] The Usefulness of Roughly Balanced Bagging for Complex and High-Dimensional Imbalanced Data
    Lango, Mateusz
    Stefanowski, Jerzy
    [J]. NEW FRONTIERS IN MINING COMPLEX PATTERNS, 2016, 9607 : 93 - 107
  • [2] Diversity Analysis on Imbalanced Data Using Neighbourhood and Roughly Balanced Bagging Ensembles
    Blaszczynski, Jerzy
    Lango, Mateusz
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2016, 2016, 9692 : 552 - 562
  • [3] Extending Bagging for Imbalanced Data
    Blaszczynski, Jerzy
    Stefanowski, Jerzy
    Idkowiak, Lukasz
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 269 - 278
  • [4] Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data
    Mateusz Lango
    Jerzy Stefanowski
    [J]. Journal of Intelligent Information Systems, 2018, 50 : 97 - 127
  • [5] Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data
    Lango, Mateusz
    Stefanowski, Jerzy
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (01) : 97 - 127
  • [6] Neighbourhood sampling in bagging for imbalanced data
    Blaszczynski, Jerzy
    Stefanowski, Jerzy
    [J]. NEUROCOMPUTING, 2015, 150 : 529 - 542
  • [7] Lazy bagging for classifying imbalanced data
    Zhu, Xingquan
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 763 - 768
  • [8] Abstaining in rule set bagging for imbalanced data
    Napierala, Krystyna
    Stefanowski, Jerzy
    [J]. LOGIC JOURNAL OF THE IGPL, 2015, 23 (03) : 421 - 430
  • [9] Online Bagging and Boosting for Imbalanced Data Streams
    Wang, Boyu
    Pineau, Joelle
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3353 - 3366
  • [10] Data Balanced Bagging Ensemble of Convolutional-LSTM Neural Networks for Time Series Data Classification with an Imbalanced Dataset
    Ward, Matthew
    Malmsten, Kevin
    Salamy, Hassan
    Min, Cheol-Hong
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,