An overview and comprehensive comparison of ensembles for concept drift

被引:48
|
作者
Maior de Barros, Roberto Souto [1 ]
de Carvalho Santos, Silas Garrido T. [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, BR-50740560 Recife, PE, Brazil
关键词
Concept drift; Ensembles; Detectors; Large-scale comparison; Data stream; Online learning; WEIGHTED-MAJORITY; ONLINE; CLASSIFIERS;
D O I
10.1016/j.inffus.2019.03.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online learning is about extracting information from large data streams which may be affected by changes in the distribution of the data, events known as concept drift. Concept drift detectors are small programs that try to detect these changes and make it possible to replace the base classifier, improving the overall accuracy. Ensembles of classifiers are also common in this application area and some of them are configurable with a drift detector. This article summarizes a large-scale comparison of six ensemble algorithms, configured with 10 different drift detectors, for learning from fully labeled data streams, using a large number of artificial datasets and two popular base learners in the area: Naive Bayes and Hoeffding Tree. In addition, the code of one the ensembles (Leveraging Bagging) was modified to permit its configuration with any drift detector: its original implementation only uses ADWIN. The goal is to assess how good the existing ensemble algorithms configurable with detectors really are and also to verify and challenge a common belief in the area. The results of the experiments suggest that, in most datasets, the choice of ensemble algorithm has much more impact on the final accuracy than the choice of drift detector used in its configuration. They also suggest the best auxiliary detectors to configure the ensembles, i.e. those that maximize the accuracy of the ensembles, are only marginally different from the best detectors in the same datasets in terms of their accuracies (recently reported in another article).
引用
收藏
页码:213 / 244
页数:32
相关论文
共 50 条
  • [11] An online ensembles approach for handling concept drift in data streams: diversified online ensembles detection
    Parneeta Sidhu
    M. P. S. Bhatia
    International Journal of Machine Learning and Cybernetics, 2015, 6 : 883 - 909
  • [12] Dynamically Adjusting Diversity in Ensembles for the Classification of Data Streams with Concept Drift
    Hidalgo, Juan I. G.
    Santos, Silas G. T. C.
    Barros, Roberto S. M.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (02)
  • [13] A comprehensive analysis of concept drift locality in data streams
    Aguiar, Gabriel J.
    Cano, Alberto
    KNOWLEDGE-BASED SYSTEMS, 2024, 289
  • [14] A comprehensive analysis of concept drift locality in data streams
    Department of Computer Science, Virginia Commonwealth University, Richmond
    VA, United States
    Knowl Based Syst,
  • [15] Concept Drift Detection and Model Selection with Simulated Recurrence and Ensembles of Statistical Detectors
    Sobolewski, Piotr
    Wozniak, Michal
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2013, 19 (04) : 462 - 483
  • [16] From concept drift to model degradation: An overview on performance-aware drift detectors
    Bayram, Firas
    Ahmed, Bestoun S.
    Kassler, Andreas
    KNOWLEDGE-BASED SYSTEMS, 2022, 245
  • [17] Learning from streaming data with concept drift and imbalance: an overview
    Hoens, T. Ryan
    Polikar, Robi
    Chawla, Nitesh V.
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2012, 1 (01) : 89 - 101
  • [18] Learning from streaming data with concept drift and imbalance: an overview
    T. Ryan Hoens
    Robi Polikar
    Nitesh V. Chawla
    Progress in Artificial Intelligence, 2012, 1 (1) : 89 - 101
  • [19] A Comparison of Techniques for Virtual Concept Drift Detection
    Gonzalez, Manuel L.
    Sedano, Javier
    Garcia-Vico, Angel M.
    Villar, Jose R.
    16TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2021), 2022, 1401 : 3 - 13
  • [20] Learning concept drift with ensembles of optimum-path forest-based classifiers
    Iwashita, Adriana Sayuri
    de Albuquerque, Victor Hugo C.
    Papa, Joao Paulo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 95 : 198 - 211