A Bootstrap-Based Iterative Selection for Ensemble Generation

被引:0
|
作者
Oliveira, Dayvid V. R. [1 ]
Porpino, Thyago N. [1 ]
Cavalcanti, George D. C. [1 ]
Ren, Tsang Ing [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
关键词
Ensemble Generation; Multiple Classifier Systems; Bagging; Imbalanced Datasets; SMOTE; SUPPORT VECTOR MACHINES; IMBALANCED DATA; CLASSIFICATION; DIVERSITY; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a bootstrap-based iterative method for generating classifier ensembles called Iterative Classifier Selection Bagging (ICS-Bagging). Each iteration of ICS-Bagging has two phases: i) bootstrap sampling to generate a pool of classifiers; and, ii) selection of the best classifier of the pool using a fitness function based on the ensemble accuracy and diversity. The selected classifier is added to the final ensemble. The bootstrap sampling runs on each iteration and updates the probability of sampling per class based on the class accuracy. This process is repeated until the number of classifiers in the final ensemble is reached. For the specific case of imbalanced datasets, we also propose the SMOTE-ICS-Bagging, a variation of the ICS-Bagging that runs SMOTE at the beginning of each iteration in order to reduce the class imbalance before data sampling. We compared the proposed techniques with Bagging, Random Subspace and SMOTEBagging, using 15 imbalanced datasets from KEEL. The results show the proposed techniques outperform all other techniques in accuracy. Ranking diagrams revealed that the proposed algorithms achieved the highest rankings in accuracy, outperforming SMOTEBagging, a renowned ensemble generation method for imbalanced datasets.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] The Impact of Under-sampling on the Performance of Bootstrap-based Ensemble Feature Selection
    Guney, Huseyin
    Oztoprak, Huseyin
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [2] Bootstrap-based homogeneous ensemble feature selection for network intrusion detection system
    Damtew, Yeshalem Gezahegn
    Chen, Hongmei
    Din, Burhan Mohi Yu
    [J]. DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 27 - 34
  • [3] Bootstrap-based ARMA order selection
    Fenga, Livio
    Politis, Dimitris N.
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2011, 81 (07) : 799 - 814
  • [4] Bootstrap-based Selection for Instrumental Variables Model
    Wang, Wenjie
    Liu, Qingfeng
    [J]. ECONOMICS BULLETIN, 2015, 35 (03): : 1886 - +
  • [5] Bootstrap-based model selection criteria for beta regressions
    Fábio M. Bayer
    Francisco Cribari-Neto
    [J]. TEST, 2015, 24 : 776 - 795
  • [6] Bootstrap-based model selection criteria for beta regressions
    Bayer, Fabio M.
    Cribari-Neto, Francisco
    [J]. TEST, 2015, 24 (04) : 776 - 795
  • [7] THE BOOTSTRAP-BASED SELECTION CRITERIA: AN OPTIMAL CHOICE FOR MODEL SELECTION IN LINEAR REGRESSION
    Shang, Junfeng
    [J]. ADVANCES AND APPLICATIONS IN STATISTICS, 2010, 14 (02) : 173 - 189
  • [8] A bootstrap-based strategy for spectral interval selection in PLS regression
    Bras, Ligia P.
    Lopes, Marta
    Ferreira, Ana P.
    Menezes, Jose C.
    [J]. JOURNAL OF CHEMOMETRICS, 2008, 22 (11-12) : 695 - 700
  • [9] Bootstrap-based signal denoising
    Kan, HE
    Hippenstiel, RD
    Fargues, MP
    [J]. THIRTY-SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS - CONFERENCE RECORD, VOLS 1 AND 2, CONFERENCE RECORD, 2002, : 958 - 962
  • [10] An exact bootstrap-based bandwidth selection rule for kernel quantile estimators
    Liu, Xiaoyu
    Song, Yan
    Zhang, Kun
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (08) : 3699 - 3720