A Bootstrap-Based Iterative Selection for Ensemble Generation

被引:0
|
作者
Oliveira, Dayvid V. R. [1 ]
Porpino, Thyago N. [1 ]
Cavalcanti, George D. C. [1 ]
Ren, Tsang Ing [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
关键词
Ensemble Generation; Multiple Classifier Systems; Bagging; Imbalanced Datasets; SMOTE; SUPPORT VECTOR MACHINES; IMBALANCED DATA; CLASSIFICATION; DIVERSITY; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a bootstrap-based iterative method for generating classifier ensembles called Iterative Classifier Selection Bagging (ICS-Bagging). Each iteration of ICS-Bagging has two phases: i) bootstrap sampling to generate a pool of classifiers; and, ii) selection of the best classifier of the pool using a fitness function based on the ensemble accuracy and diversity. The selected classifier is added to the final ensemble. The bootstrap sampling runs on each iteration and updates the probability of sampling per class based on the class accuracy. This process is repeated until the number of classifiers in the final ensemble is reached. For the specific case of imbalanced datasets, we also propose the SMOTE-ICS-Bagging, a variation of the ICS-Bagging that runs SMOTE at the beginning of each iteration in order to reduce the class imbalance before data sampling. We compared the proposed techniques with Bagging, Random Subspace and SMOTEBagging, using 15 imbalanced datasets from KEEL. The results show the proposed techniques outperform all other techniques in accuracy. Ranking diagrams revealed that the proposed algorithms achieved the highest rankings in accuracy, outperforming SMOTEBagging, a renowned ensemble generation method for imbalanced datasets.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Bootstrap-based tolerance intervals for application to method validation
    Rebafka, Tabea
    Clemencon, Stphan
    Feinberg, Max
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2007, 89 (02) : 69 - 81
  • [42] BOOTSTRAP-BASED SVM AGGREGATION FOR CLASS IMBALANCE PROBLEMS
    Sukhanov, S.
    Merentitis, A.
    Debes, C.
    Hahn, J.
    Zoubir, A. M.
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 165 - 169
  • [43] Bootstrap-based Support of HGT Inferred by Maximum Parsimony
    Hyun Jung Park
    Guohua Jin
    Luay Nakhleh
    [J]. BMC Evolutionary Biology, 10
  • [44] Estimating the variance of a combined forecast: Bootstrap-based approach
    Hounyo, Ulrich
    Lahiri, Kajal
    [J]. JOURNAL OF ECONOMETRICS, 2023, 232 (02) : 445 - 468
  • [45] A bootstrap-based non-parametric forecast density
    Manzan, Sebastiano
    Zerom, Dawit
    [J]. INTERNATIONAL JOURNAL OF FORECASTING, 2008, 24 (03) : 535 - 550
  • [46] Analysis of heat wave effects on health by using generalized additive model and bootstrap-based model selection
    Pauli, Francesco
    Rizzi, Laura
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2008, 57 : 473 - 485
  • [47] Detecting statistically significant changes in connectedness: A bootstrap-based technique
    Greenwood-Nimmo, Matthew
    Kocenda, Evzen
    Nguyen, Viet Hoang
    [J]. ECONOMIC MODELLING, 2024, 140
  • [48] A Bootstrap-Based Approach for Improving Measurements by Retarding Potential Analyzers
    Debchoudhury, Shantanab
    Sengupta, Srijan
    Earle, Gregory
    Coley, William
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-SPACE PHYSICS, 2019, 124 (06) : 4569 - 4584
  • [49] Bootstrap-based critical values for tests of common factor restrictions
    Godfrey, LG
    Veall, MR
    [J]. ECONOMICS LETTERS, 1998, 59 (01) : 1 - 5
  • [50] BOOTSTRAP-BASED BANDWIDTH CHOICE FOR LOG-PERIODOGRAM REGRESSION
    Arteche, Josu
    Orbe, Jesus
    [J]. JOURNAL OF TIME SERIES ANALYSIS, 2009, 30 (06) : 591 - 617