Cost Complexity-Based Pruning of Ensemble Classifiers

被引:44
|
作者
Prodromidis, Andreas L. [1 ]
Stolfo, Salvatore J. [1 ]
机构
[1] Department of Computer Science, Columbia University, New York, United States
关键词
Keywords: Classifier evaluation; Credit card fraud detection; Distributed data mining; Ensembles of classifiers; Meta-learning; Pruning;
D O I
10.1007/PL00011678
中图分类号
学科分类号
摘要
In this paper we study methods that combine multiple classification models learned over separate data sets. Numerous studies posit that such approaches provide the means to efficiently scale learning to large data sets, while also boosting the accuracy of individual classifiers. These gains, however, come at the expense of an increased demand for run-time system resources. The final ensemble meta-classifier may consist of a large collection of base classifiers that require increased memory resources while also slowing down classification throughput. Here, we describe an algorithm for pruning (i.e., discarding a subset of the available base classifiers) the ensemble meta-classifier as a means to reduce its size while preserving its accuracy and we present a technique for measuring the trade-off between predictive performance and available run-time system resources. The algorithm is independent of the method used initially when computing the meta-classifier. It is based on decision tree pruning methods and relies on the mapping of an arbitrary ensemble meta-classifier to a decision tree model. Through an extensive empirical study on meta-classifiers computed over two real data sets, we illustrate our pruning algorithm to be a robust and competitive approach to discarding classification models without degrading the overall predictive performance of the smaller ensemble computed over those that remain after pruning.
引用
收藏
页码:449 / 469
页数:20
相关论文
共 50 条
  • [1] Analysis of a complexity-based pruning scheme for classification trees
    Nobel, AB
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2002, 48 (08) : 2362 - 2368
  • [2] A LEMPEL-ZIV COMPLEXITY-BASED NEURAL NETWORK PRUNING ALGORITHM
    Ahmed, Sultan Uddin
    Shahjahan, Md.
    Murase, Kazuyuki
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2011, 21 (05) : 427 - 441
  • [3] A heuristic complexity-based method for cost estimation of aerospace systems
    Banazadeh, Afshin
    Jafari, Mohammad Haji
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2013, 227 (11) : 1685 - 1700
  • [4] Pruning an ensemble of classifiers via reinforcement learning
    Partalas, Ioannis
    Tsoumakas, Grigorios
    Vlahavas, Ioannis
    [J]. NEUROCOMPUTING, 2009, 72 (7-9) : 1900 - 1909
  • [5] Complexity-based Steganalysis
    Manoharan, Sathiamoorthy
    Master, Aquib
    Speidel, Ulrich
    [J]. 2014 INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY AND ITS APPLICATIONS (ISITA), 2014, : 40 - 44
  • [6] COMPLEXITY-BASED INDUCTION
    CONKLIN, D
    WITTEN, IH
    [J]. MACHINE LEARNING, 1994, 16 (03) : 203 - 225
  • [7] Ensemble one-class classifiers based on hybrid diversity generation and pruning
    Liu, Jia-Chen
    Miao, Qi-Guang
    Cao, Ying
    Song, Jian-Feng
    Quan, Yi-Ning
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2015, 37 (02): : 386 - 393
  • [8] Online pruning of base classifiers for Dynamic Ensemble Selection
    Oliveira, Dayvid V. R.
    Cavalcanti, George D. C.
    Sabourin, Robert
    [J]. PATTERN RECOGNITION, 2017, 72 : 44 - 58
  • [9] Complexity-Based Code Embeddings
    Folea, Rares
    Iacob, Radu
    Slusanschi, Emil
    Rebedea, Traian
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2023, 2023, 14162 : 256 - 269
  • [10] Rapid cost assessment of space mission concepts through application of complexity-based cost indices
    Peterson, Craig
    Cutts, James
    Balint, Tibor
    Hall, James B.
    [J]. 2008 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2008, : 4519 - +