SDDM: an interpretable statistical concept drift detection method for data streams

被引:0
|
作者
Simona Micevska
Ahmed Awad
Sherif Sakr
机构
[1] University of Tartu,
[2] Nile University,undefined
关键词
Online machine learning; Concept drift detection; Data streams analytics;
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning models assume that data is drawn from a stationary distribution. However, in practice, challenges are imposed on models that need to make sense of fast-evolving data streams, where the content of data is changing and evolving over time. This change between the distributions of training data seen so-far and the distribution of newly coming data is called concept drift. It is of utmost importance to detect concept drifts to maintain the accuracy and reliability of online classifiers. Reactive drift detectors monitor the performance of the underlying machine learning model. That is, to detect a drift, feedback on the classifier output has to be given to the drift detector, known as prequential evaluation. In many real-life scenarios, immediate feedback on classifier output is not possible. Thus, drift detection is delayed and gets out of context. Moreover, the drift detector output is in the form of a binary answer if there is a drift or not. However, it is equally important to explain the source of drift. In this paper, we present the Statistical Drift Detection Method (SDDM) which can detect drifts by monitoring the change of data distribution without the need for feedback on classifier output. Moreover, the detection is quantified and the source of drift is identified. We empirically evaluate our method against the state-of-the-art on both synthetic and real life data sets. SDDM outperforms other related approaches by producing a smaller number of false positives and false negatives.
引用
收藏
页码:459 / 484
页数:25
相关论文
共 50 条
  • [41] A comprehensive analysis of concept drift locality in data streams
    Aguiar, Gabriel J.
    Cano, Alberto
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 289
  • [42] Incremental Weighted Ensemble for Data Streams With Concept Drift
    Jiao B.
    Guo Y.
    Yang C.
    Pu J.
    Zheng Z.
    Gong D.
    [J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (01): : 92 - 103
  • [43] A comprehensive analysis of concept drift locality in data streams
    Department of Computer Science, Virginia Commonwealth University, Richmond
    VA, United States
    [J]. Knowl Based Syst,
  • [44] Accuracy Updated Ensemble for Data Streams with Concept Drift
    Brzezinski, Dariusz
    Stefanowski, Jerzy
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART II, 2011, 6679 : 155 - 163
  • [45] Novel statistical method for data drift detection in satellite telemetry
    Praveen, M. V. Ramachandra
    Kuchhal, Piyush
    Choudhury, Sushabhan
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2024, 37 (09)
  • [46] Feature Drift Detection in Evolving Data Streams
    Zhao, Di
    Koh, Yun Sing
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2020, PT II, 2020, 12392 : 335 - 349
  • [47] Concept Drift Detection for Streaming Data
    Wang, Heng
    Abraham, Zubin
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [48] An online ensembles approach for handling concept drift in data streams: diversified online ensembles detection
    Parneeta Sidhu
    M. P. S. Bhatia
    [J]. International Journal of Machine Learning and Cybernetics, 2015, 6 : 883 - 909
  • [49] Unsupervised Concept Drift Detection using Dynamic Crucial Feature Distribution Test in Data Streams
    Wan, Yen-Ning
    Jaysawal, Bijay Prasad
    Huang, Jen-Wei
    [J]. 2022 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE, TAAI, 2022, : 137 - 142
  • [50] An online ensembles approach for handling concept drift in data streams: diversified online ensembles detection
    Sidhu, Parneeta
    Bhatia, M. P. S.
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) : 883 - 909