Concept Drift Detection for Streaming Data

被引:0
|
作者
Wang, Heng [1 ]
Abraham, Zubin [2 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Robert Bosch LLC, Res & Technol Ctr North Amer, Gerlingen, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Common statistical prediction models often require and assume stationarity in the data. However, in many practical applications, changes in the relationship of the response and predictor variables are regularly observed over time, resulting in the deterioration of the predictive performance of these models. This paper presents Linear Four Rates (LFR), a framework for detecting these concept drifts and subsequently identifying the data points that belong to the new concept (for relearning the model). Unlike conventional concept drift detection approaches, LFR can be applied to both batch and stream data; is not limited by the distribution properties of the response variable (e.g., datasets with imbalanced labels); is independent of the underlying statistical-model; and uses user-specified parameters that are intuitively comprehensible. The performance of LFR is compared to benchmark approaches using both simulated and commonly used public datasets that span the gamut of concept drift types. The results show LFR significantly outperforms benchmark approaches in terms of recall, accuracy and delay in detection of concept drifts across datasets.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] On the reliable detection of concept drift from streaming unlabeled data
    Sethi, Tegjyot Singh
    Kantardzic, Mehmed
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 82 : 77 - 99
  • [2] Concept Drift Detection on Streaming Data with Dynamic Outlier Aggregation
    Zellner, Ludwig
    Richter, Florian
    Sontheim, Janina
    Maldonado, Andrea
    Seidl, Thomas
    [J]. PROCESS MINING WORKSHOPS, ICPM 2020 INTERNATIONAL WORKSHOPS, 2021, 406 : 206 - 217
  • [3] Concept Drift Detection on Streaming Data under Limited Labeling
    Kim, Young In
    Park, Cheong Hee
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2016, : 273 - 280
  • [4] Streaming Data Classification with Concept Drift
    Althabiti, Mashail
    Abdullah, Manal
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2019, 12 (01): : 177 - 184
  • [5] Ensemble framework for concept-drift detection in multidimensional streaming data
    Prasad K.S.N.
    Rao A.S.
    Ramana A.V.
    [J]. International Journal of Computers and Applications, 2022, 44 (12) : 1193 - 1200
  • [6] Handling adversarial concept drift in streaming data
    Sethi, Tegjyot Singh
    Kantardzic, Mehmed
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 97 : 18 - 40
  • [7] Temporal Attention for Few-Shot Concept Drift Detection in Streaming Data
    Lin, Ximing
    Chang, Longtao
    Nie, Xiushan
    Dong, Fei
    [J]. ELECTRONICS, 2024, 13 (11)
  • [8] No Free Lunch Theorem for concept drift detection in streaming data classification: A review
    Hu, Hanqing
    Kantardzic, Mehmed
    Sethi, Tegjyot S.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 10 (02)
  • [9] An Efficient Concept Drift Detection Method for Streaming Data under Limited Labeling
    Kim, Youngin
    Park, Cheong Hee
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (10): : 2537 - 2546
  • [10] Concept drift detection with quadtree-based spatial mapping of streaming data
    Coelho, Rodrigo Amador
    Torres, Luiz Carlos Bambirra
    de Castro, Cristiano Leite
    [J]. INFORMATION SCIENCES, 2023, 625 : 578 - 592