A Double Pruning Algorithm for Classification Ensembles

被引:0
|
作者
Soto, Victor [1 ]
Martinez-Munoz, Gonzalo [1 ]
Hernandez-Lobato, Daniel [1 ]
Suarez, Alberto [1 ]
机构
[1] Univ Autonoma Madrid, EPS, E-28049 Madrid, Spain
来源
关键词
ensemble pruning; instance-based pruning; ensemble learning; decision trees;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article introduces a double pruning algorithm that can be used to reduce the storage requirements, speed-up the classification process and improve the performance of parallel ensembles. A key element in the design of the algorithm is the estimation of the class label that the ensemble assigns to a given test instance by polling only a fraction of its classifiers. Instead of applying this form of dynamical (instance-based) pruning to the original ensemble, we propose to apply it to a subset of classifiers selected using standard ensemble pruning techniques. The pruned subensemble is built by first modifying the order in which classifiers are aggregated in the ensemble and then selecting the first classifiers in the ordered sequence. Experiments in benchmark problems illustrate the improvements that can be obtained with this technique. Specifically, using a bagging ensemble of 101 CART trees as a starting point, only the 21 trees of the pruned ordered ensemble need to be stored in memory. Depending on the classification task, on average, only 5 to 12 of these 21 classifiers are queried to compute the predictions. The generalization performance achieved by this double pruning algorithm is similar to pruned ordered bagging and significantly better than standard bagging.
引用
收藏
页码:104 / 113
页数:10
相关论文
共 50 条
  • [41] Pruning and dynamic scheduling of cost-sensitive ensembles
    Fan, W
    Chu, F
    Wang, HX
    Yu, PS
    EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 146 - 151
  • [42] EnSyth: A Pruning Approach to Synthesis of Deep Learning Ensembles
    Alhalabi, Besher
    Gaber, Mohamed Medhat
    Basurra, Shadi
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3466 - 3473
  • [43] Trial pruning based on genetic algorithm for single-trial EEG classification
    Wang, Boyu
    Wong, Chi Man
    Wan, Feng
    Mak, Peng Un
    Mak, Pui-In
    Vai, Mang I.
    COMPUTERS & ELECTRICAL ENGINEERING, 2012, 38 (01) : 35 - 44
  • [44] A New Hybrid Associative Classification Algorithm Based On OR-tree And Pruning Skills
    Liao, Qin
    Wu, Jianhui
    Tang, Zhonghua
    2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 129 - 132
  • [45] On Evolutionary Classification Ensembles
    Kardas, Aleksandra
    Kawulok, Michal
    Nalepa, Jakub
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2974 - 2981
  • [46] Subspace ensembles for classification
    Sun, Shillang
    Zhang, Changshui
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2007, 385 (01) : 199 - 207
  • [47] Classification by evolutionary ensembles
    Wang, X
    Wang, H
    PATTERN RECOGNITION, 2006, 39 (04) : 595 - 607
  • [48] Double transductive inference algorithm for text classification
    Liu, Yihong
    Teng, Guifa
    Ma, Jianbin
    Yang, Duanli
    Wang, Fang
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2007, 3 (6A): : 1463 - 1469
  • [49] A double-circle algorithm for ore classification
    Junhao, Ying
    Xiubin, Zhang
    Machine Graphics and Vision, 2010, 19 (04): : 451 - 462
  • [50] Hybrid Pruning Algorithm
    Du Xiangran
    Wang Xizhao
    Wan Yuanyuan
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 30 - +