An overview and comprehensive comparison of ensembles for concept drift

被引:48
|
作者
Maior de Barros, Roberto Souto [1 ]
de Carvalho Santos, Silas Garrido T. [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, BR-50740560 Recife, PE, Brazil
关键词
Concept drift; Ensembles; Detectors; Large-scale comparison; Data stream; Online learning; WEIGHTED-MAJORITY; ONLINE; CLASSIFIERS;
D O I
10.1016/j.inffus.2019.03.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online learning is about extracting information from large data streams which may be affected by changes in the distribution of the data, events known as concept drift. Concept drift detectors are small programs that try to detect these changes and make it possible to replace the base classifier, improving the overall accuracy. Ensembles of classifiers are also common in this application area and some of them are configurable with a drift detector. This article summarizes a large-scale comparison of six ensemble algorithms, configured with 10 different drift detectors, for learning from fully labeled data streams, using a large number of artificial datasets and two popular base learners in the area: Naive Bayes and Hoeffding Tree. In addition, the code of one the ensembles (Leveraging Bagging) was modified to permit its configuration with any drift detector: its original implementation only uses ADWIN. The goal is to assess how good the existing ensemble algorithms configurable with detectors really are and also to verify and challenge a common belief in the area. The results of the experiments suggest that, in most datasets, the choice of ensemble algorithm has much more impact on the final accuracy than the choice of drift detector used in its configuration. They also suggest the best auxiliary detectors to configure the ensembles, i.e. those that maximize the accuracy of the ensembles, are only marginally different from the best detectors in the same datasets in terms of their accuracies (recently reported in another article).
引用
收藏
页码:213 / 244
页数:32
相关论文
共 50 条
  • [21] INCREMENTAL RULE-BASED LEARNERS FOR HANDLING CONCEPT DRIFT: AN OVERVIEW
    Deckert, Magdalena
    FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2013, 38 (01) : 35 - 65
  • [22] Overview of concept drift detection for industrial process soft sensor modeling
    Qiao J.-F.
    Sun Z.-J.
    Tang J.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2021, 38 (08): : 1159 - 1174
  • [23] A large-scale comparison of concept drift detectors
    Maior Barros, Roberto Souto
    Carvalho Santos, Silas Garrido T.
    INFORMATION SCIENCES, 2018, 451 : 348 - 370
  • [24] Ensembles of Long Short-Term Memory Experts for Streaming Data with Sudden Concept Drift
    Apfeld, Sabine
    Charlish, Alexander
    Ascheid, Gerd
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 716 - 723
  • [25] A comprehensive active learning method for multiclass imbalanced data streams with concept drift
    Liu, Weike
    Zhang, Hang
    Ding, Zhaoyun
    Liu, Qingbao
    Zhu, Cheng
    KNOWLEDGE-BASED SYSTEMS, 2021, 215
  • [26] NEW CONCEPT OF STATISTICAL ENSEMBLES
    Gorenstein, M. I.
    JOURNAL OF PHYSICAL STUDIES, 2009, 13 (04):
  • [27] Concept drift from 1980 to 2020: a comprehensive bibliometric analysis with future research insight
    Baburoglu, Elif Selen
    Durmusoglu, Alptekin
    Dereli, Turkay
    EVOLVING SYSTEMS, 2024, 15 (03) : 789 - 809
  • [28] Comparison based analysis of window approach for concept drift detection and adaptation
    Agrahari, Supriya
    Singh, Anil Kumar
    Applied Intelligence, 2025, 55 (01)
  • [29] Rotational Drift Spectroscopy for Magnetic Particle Ensembles
    Rueckert, Martin A.
    Vogel, Patrick
    Vilter, Anna
    Kullmann, Walter H.
    Jakob, Peter M.
    Behr, Volker C.
    IEEE TRANSACTIONS ON MAGNETICS, 2015, 51 (02) : 6500604
  • [30] Characterizing concept drift
    Webb, Geoffrey I.
    Hyde, Roy
    Cao, Hong
    Hai Long Nguyen
    Petitjean, Francois
    DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (04) : 964 - 994