Extending Bagging for Imbalanced Data

被引:23
|
作者
Blaszczynski, Jerzy [1 ]
Stefanowski, Jerzy [1 ]
Idkowiak, Lukasz [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, PL-60965 Poznan, Poland
关键词
Class distributions - Experimental comparison - Imbalanced data - Neighbourhood - Over sampling - Under-sampling;
D O I
10.1007/978-3-319-00969-8_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various modifications of bagging for class imbalanced data are discussed. An experimental comparison of known bagging modifications shows that integrating with undersampling is more powerful than oversampling. We introduce Local-and-Over-All Balanced bagging where probability of sampling an example is tuned according to the class distribution inside its neighbourhood. Experiments indicate that this proposal is competitive to best undersampling bagging extensions.
引用
收藏
页码:269 / 278
页数:10
相关论文
共 50 条
  • [1] Neighbourhood sampling in bagging for imbalanced data
    Blaszczynski, Jerzy
    Stefanowski, Jerzy
    [J]. NEUROCOMPUTING, 2015, 150 : 529 - 542
  • [2] Actively Balanced Bagging for Imbalanced Data
    Blaszczynski, Jerzy
    Stefanowski, Jerzy
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2017, 2017, 10352 : 271 - 281
  • [3] Lazy bagging for classifying imbalanced data
    Zhu, Xingquan
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 763 - 768
  • [4] Abstaining in rule set bagging for imbalanced data
    Napierala, Krystyna
    Stefanowski, Jerzy
    [J]. LOGIC JOURNAL OF THE IGPL, 2015, 23 (03) : 421 - 430
  • [5] Online Bagging and Boosting for Imbalanced Data Streams
    Wang, Boyu
    Pineau, Joelle
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3353 - 3366
  • [6] Comparing Boosting and Bagging Techniques With Noisy and Imbalanced Data
    Khoshgoftaar, Taghi M.
    Van Hulse, Jason
    Napolitano, Amri
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2011, 41 (03): : 552 - 568
  • [7] Clustering Based Bagging Algorithm on Imbalanced Data Sets
    Sun, Xiao-Yan
    Zhang, Hua-Xiang
    Wang, Zhi-Chao
    [J]. INTEGRATED UNCERTAINTY IN KNOWLEDGE MODELLING AND DECISION MAKING, 2011, 7027 : 179 - 186
  • [8] An Empirical Evaluation of Bagging with Different Algorithms on Imbalanced Data
    Liang, Guohua
    Zhang, Chengqi
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PT I, 2011, 7120 : 339 - 352
  • [9] On Properties of Undersampling Bagging and Its Extensions for Imbalanced Data
    Stefanowski, Jerzy
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 407 - 417
  • [10] Modifications of Classification Strategies in Rule Set Based Bagging for Imbalanced Data
    Napierala, Krystyna
    Stefanowski, Jerzy
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PT II, 2012, 7209 : 514 - 525