An online ensembles approach for handling concept drift in data streams: diversified online ensembles detection

被引:0
|
作者
Parneeta Sidhu
M. P. S. Bhatia
机构
[1] Netaji Subhas Institute of Technology,Division of CoE
关键词
Concept drift; Ensemble; Diversity; Data stream; Online learning;
D O I
暂无
中图分类号
学科分类号
摘要
Data Streams are continuous data instances arriving at a very high speed with varying underlying conceptual distribution. We present a novel online ensemble approach, Diversified online ensembles detection (DOED), for handling these drifting concepts in data streams. Our approach maintains two ensembles of weighted experts, an ensemble with low diversity and an ensemble with high diversity, which are updated as per their accuracy in classifying the new data instances. Our approach detects drifts by comparing the two accuracies: an accuracy of an ensemble on the recent examples and its accuracy since the beginning of the learning. The final prediction for an instance is the class predicted by the ensemble which gives better accuracy in classifying the recent examples. When a drift is detected by an ensemble, it is reinitialized still maintaining its diversity levels. Experimental evaluation using various artificial and real-world datasets proves that DOED provides very high accuracy in classifying new data instances, irrespective of the size of dataset, type of drift or presence of noise. We compare DOED with the other learners in terms of new performance metrics such as kappa statistic, model cost, and the evaluation time and memory requirements. Our approach proved to be highly resource effective achieving very high accuracies even in a resource constrained environment.
引用
下载
收藏
页码:883 / 909
页数:26
相关论文
共 50 条
  • [1] An online ensembles approach for handling concept drift in data streams: diversified online ensembles detection
    Sidhu, Parneeta
    Bhatia, M. P. S.
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) : 883 - 909
  • [2] Comparing Block Ensembles for Data Streams with Concept Drift
    Deckert, Magdalena
    Stefanowski, Jerzy
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, 2013, 185 : 69 - 78
  • [3] Dynamic adaptation of online ensembles for drifting data streams
    Olorunnimbe, M. Kehinde
    Viktor, Herna L.
    Paquet, Eric
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (02) : 291 - 313
  • [4] Dynamic adaptation of online ensembles for drifting data streams
    M. Kehinde Olorunnimbe
    Herna L. Viktor
    Eric Paquet
    Journal of Intelligent Information Systems, 2018, 50 : 291 - 313
  • [5] Online Clustering for Novelty Detection and Concept Drift in Data Streams
    Garcia, Kemilly Dearo
    Poel, Mannes
    Kok, Joost N.
    de Carvalho, Andre C. P. L. F.
    PROGRESS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11805 : 448 - 459
  • [6] A Stable and Online Approach to Detect Concept Drift in Data Streams
    da Costa, Fausto Guzzo
    de Mello, Rodrigo Fernandes
    2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 330 - 335
  • [7] A Novel Online Ensemble Approach for Concept Drift in Data Streams
    Sidhu, Parneeta
    Bhatia, M. P. S.
    Bindal, Aditya
    2013 IEEE SECOND INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2013, : 550 - 555
  • [8] Dynamically Adjusting Diversity in Ensembles for the Classification of Data Streams with Concept Drift
    Hidalgo, Juan I. G.
    Santos, Silas G. T. C.
    Barros, Roberto S. M.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (02)
  • [9] A Theoretical Framework on the Ideal Number of Classifiers for Online Ensembles in Data Streams
    Bonab, Hamed R.
    Can, Fazli
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 2053 - 2056
  • [10] Online Feature Screening for Data Streams With Concept Drift
    Wang, Mingyuan
    Barbu, Adrian
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11693 - 11707