An overview and comprehensive comparison of ensembles for concept drift

被引：48

作者：

Maior de Barros, Roberto Souto ^{[1
]}

de Carvalho Santos, Silas Garrido T. ^{[1
]}

机构：

[1] Univ Fed Pernambuco, Ctr Informat, BR-50740560 Recife, PE, Brazil

来源：

INFORMATION FUSION | 2019年 / 52卷

关键词：

Concept drift; Ensembles; Detectors; Large-scale comparison; Data stream; Online learning; WEIGHTED-MAJORITY; ONLINE; CLASSIFIERS;

D O I：

10.1016/j.inffus.2019.03.006

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Online learning is about extracting information from large data streams which may be affected by changes in the distribution of the data, events known as concept drift. Concept drift detectors are small programs that try to detect these changes and make it possible to replace the base classifier, improving the overall accuracy. Ensembles of classifiers are also common in this application area and some of them are configurable with a drift detector. This article summarizes a large-scale comparison of six ensemble algorithms, configured with 10 different drift detectors, for learning from fully labeled data streams, using a large number of artificial datasets and two popular base learners in the area: Naive Bayes and Hoeffding Tree. In addition, the code of one the ensembles (Leveraging Bagging) was modified to permit its configuration with any drift detector: its original implementation only uses ADWIN. The goal is to assess how good the existing ensemble algorithms configurable with detectors really are and also to verify and challenge a common belief in the area. The results of the experiments suggest that, in most datasets, the choice of ensemble algorithm has much more impact on the final accuracy than the choice of drift detector used in its configuration. They also suggest the best auxiliary detectors to configure the ensembles, i.e. those that maximize the accuracy of the ensembles, are only marginally different from the best detectors in the same datasets in terms of their accuracies (recently reported in another article).

引用

页码：213 / 244

页数：32

共 50 条

[1] Improving Diversity in Concept Drift Ensembles
Martinez Perez, Jose Luis
Palomino Marino, Laura Maria
Maior de Barros, Roberto Souto
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
[2] Comparing Block Ensembles for Data Streams with Concept Drift
Deckert, Magdalena
Stefanowski, Jerzy
NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, 2013, 185 : 69 - 78
[3] Classifier Ensembles for Virtual Concept Drift - The DEnBoost Algorithm
Bartocha, Kamil
Podolak, Igor T.
HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART II, 2011, 6679 : 164 - 171
[4] Ensembles of Heterogeneous Concept Drift Detectors - Experimental Study
Wozniak, Michal
Ksieniewicz, Pawel
Cyganek, Boguslaw
Walkowiak, Krzysztof
COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2016, 2016, 9842 : 538 - 549
[5] Using Diversity Ensembles with Time Limits to Handle Concept Drift
Van Camp, Robert
IEEE SOUTHEASTCON 2018, 2018,
[6] Adaptive cascade of boosted ensembles for face detection in concept drift
Susnjak, Teo
Barczak, Andre L. C.
Hawick, Ken A.
NEURAL COMPUTING & APPLICATIONS, 2012, 21 (04): : 671 - 682
[7] Using Evolving Ensembles to Deal with Concept Drift in Streaming Scenarios
Ramos, Diogo
Carneiro, Davide
Novais, Paulo
INTELLIGENT DISTRIBUTED COMPUTING XIV, 2022, 1026 : 59 - 68
[8] A First Attempt to Construct Effective Concept Drift Detector Ensembles
Wozniak, Michal
Ksieniewicz, Pawel
Kasprzak, Andrzej
Puchala, Karol
Ryba, Przemyslaw
IMAGE PROCESSING AND COMMUNICATIONS CHALLENGES 8, 2017, 525 : 27 - 34
[9] Adaptive cascade of boosted ensembles for face detection in concept drift
Teo Susnjak
Andre L. C. Barczak
Ken A. Hawick
Neural Computing and Applications, 2012, 21 : 671 - 682
[10] An online ensembles approach for handling concept drift in data streams: diversified online ensembles detection
Sidhu, Parneeta
Bhatia, M. P. S.
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) : 883 - 909

← 1 2 3 4 5 →