An online ensembles approach for handling concept drift in data streams: diversified online ensembles detection

被引:0
|
作者
Parneeta Sidhu
M. P. S. Bhatia
机构
[1] Netaji Subhas Institute of Technology,Division of CoE
关键词
Concept drift; Ensemble; Diversity; Data stream; Online learning;
D O I
暂无
中图分类号
学科分类号
摘要
Data Streams are continuous data instances arriving at a very high speed with varying underlying conceptual distribution. We present a novel online ensemble approach, Diversified online ensembles detection (DOED), for handling these drifting concepts in data streams. Our approach maintains two ensembles of weighted experts, an ensemble with low diversity and an ensemble with high diversity, which are updated as per their accuracy in classifying the new data instances. Our approach detects drifts by comparing the two accuracies: an accuracy of an ensemble on the recent examples and its accuracy since the beginning of the learning. The final prediction for an instance is the class predicted by the ensemble which gives better accuracy in classifying the recent examples. When a drift is detected by an ensemble, it is reinitialized still maintaining its diversity levels. Experimental evaluation using various artificial and real-world datasets proves that DOED provides very high accuracy in classifying new data instances, irrespective of the size of dataset, type of drift or presence of noise. We compare DOED with the other learners in terms of new performance metrics such as kappa statistic, model cost, and the evaluation time and memory requirements. Our approach proved to be highly resource effective achieving very high accuracies even in a resource constrained environment.
引用
下载
收藏
页码:883 / 909
页数:26
相关论文
共 50 条
  • [21] Online Ensemble Using Adaptive Windowing for Data Streams with Concept Drift
    Sun, Yange
    Wang, Zhihai
    Liu, Haiyang
    Du, Chao
    Yuan, Jidong
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2016,
  • [22] Adaptive cascade of boosted ensembles for face detection in concept drift
    Susnjak, Teo
    Barczak, Andre L. C.
    Hawick, Ken A.
    NEURAL COMPUTING & APPLICATIONS, 2012, 21 (04): : 671 - 682
  • [23] Improving Diversity in Concept Drift Ensembles
    Martinez Perez, Jose Luis
    Palomino Marino, Laura Maria
    Maior de Barros, Roberto Souto
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [24] Adaptive cascade of boosted ensembles for face detection in concept drift
    Teo Susnjak
    Andre L. C. Barczak
    Ken A. Hawick
    Neural Computing and Applications, 2012, 21 : 671 - 682
  • [25] Online breakage detection of multitooth tools using classifier ensembles for imbalanced data
    Bustillo, Andres
    Rodriguez, Juan J.
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2014, 45 (12) : 2590 - 2602
  • [26] Shrub Ensembles for Online Classification
    Buschjaeger, Sebastian
    Hess, Sibylle
    Morik, Katharina J.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6123 - 6131
  • [27] Dynamic Classification Ensembles for Handling Imbalanced Multiclass Drifted Data Streams
    Madkour A.H.
    Abdelkader H.M.
    Mohammed A.M.
    Information Sciences, 2024, 670
  • [28] Generalized CMAC Adaptive Ensembles for Concept-Drifting Data Streams
    Gonzalez-Serrano, Francisco J.
    Figueiras-Vidal, Anibal R.
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2669 - 2673
  • [29] Diversified SVM ensembles for large data sets
    Tsang, Ivor W.
    Kocsor, Andras
    Kwok, James T.
    MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 792 - 800
  • [30] An overview and comprehensive comparison of ensembles for concept drift
    Maior de Barros, Roberto Souto
    de Carvalho Santos, Silas Garrido T.
    INFORMATION FUSION, 2019, 52 : 213 - 244