An online ensembles approach for handling concept drift in data streams: diversified online ensembles detection

被引:0
|
作者
Parneeta Sidhu
M. P. S. Bhatia
机构
[1] Netaji Subhas Institute of Technology,Division of CoE
关键词
Concept drift; Ensemble; Diversity; Data stream; Online learning;
D O I
暂无
中图分类号
学科分类号
摘要
Data Streams are continuous data instances arriving at a very high speed with varying underlying conceptual distribution. We present a novel online ensemble approach, Diversified online ensembles detection (DOED), for handling these drifting concepts in data streams. Our approach maintains two ensembles of weighted experts, an ensemble with low diversity and an ensemble with high diversity, which are updated as per their accuracy in classifying the new data instances. Our approach detects drifts by comparing the two accuracies: an accuracy of an ensemble on the recent examples and its accuracy since the beginning of the learning. The final prediction for an instance is the class predicted by the ensemble which gives better accuracy in classifying the recent examples. When a drift is detected by an ensemble, it is reinitialized still maintaining its diversity levels. Experimental evaluation using various artificial and real-world datasets proves that DOED provides very high accuracy in classifying new data instances, irrespective of the size of dataset, type of drift or presence of noise. We compare DOED with the other learners in terms of new performance metrics such as kappa statistic, model cost, and the evaluation time and memory requirements. Our approach proved to be highly resource effective achieving very high accuracies even in a resource constrained environment.
引用
下载
收藏
页码:883 / 909
页数:26
相关论文
共 50 条
  • [41] Classifier Ensembles for Virtual Concept Drift - The DEnBoost Algorithm
    Bartocha, Kamil
    Podolak, Igor T.
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART II, 2011, 6679 : 164 - 171
  • [42] Ensembles of Heterogeneous Concept Drift Detectors - Experimental Study
    Wozniak, Michal
    Ksieniewicz, Pawel
    Cyganek, Boguslaw
    Walkowiak, Krzysztof
    COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2016, 2016, 9842 : 538 - 549
  • [43] Dynamically adaptive and diverse dual ensemble learning approach for handling concept drift in data streams
    Goel, Kanu
    Batra, Shalini
    COMPUTATIONAL INTELLIGENCE, 2022, 38 (02) : 463 - 505
  • [44] Towards Online Concept Drift Detection with Feature Selection for Data Stream Classification
    Hammoodi, Mahmood
    Stahl, Frederic
    Tennant, Mark
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1549 - 1550
  • [45] Anytime clustering of data streams while handling noise and concept drift
    Challa, Jagat Sesh
    Goyal, Poonam
    Kokandakar, Ajinkya
    Mantri, Dhananjay
    Verma, Pranet
    Balasubramaniam, Sundar
    Goyal, Navneet
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2022, 34 (03) : 399 - 429
  • [46] The use of time stamps in handling latency and concept drift in online learning
    Marrs, G. R.
    Black, M. M.
    Hickey, R. J.
    EVOLVING SYSTEMS, 2012, 3 (04) : 203 - 220
  • [47] A Method Aware of Concept Drift for Online Botnet Detection
    Schwengber, Bruno Henrique
    Vergutz, Andressa
    Prates, Nelson G., Jr.
    Nogueira, Michele
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [48] Concept drift detection and accelerated convergence of online learning
    Husheng Guo
    Hai Li
    Ni Sun
    Qiaoyan Ren
    Aijuan Zhang
    Wenjian Wang
    Knowledge and Information Systems, 2023, 65 : 1005 - 1043
  • [49] Concept drift detection and accelerated convergence of online learning
    Guo, Husheng
    Li, Hai
    Sun, Ni
    Ren, Qiaoyan
    Zhang, Aijuan
    Wang, Wenjian
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (03) : 1005 - 1043
  • [50] OHODIN - Online Anomaly Detection for Data Streams
    Gruhl, Christian
    Tomforde, Sven
    2021 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS COMPANION (ACSOS-C 2021), 2021, : 193 - 197