Concept Drift Detection on Streaming Data with Dynamic Outlier Aggregation

被引:5
|
作者
Zellner, Ludwig [1 ]
Richter, Florian [1 ]
Sontheim, Janina [1 ]
Maldonado, Andrea [1 ]
Seidl, Thomas [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Munich, Germany
关键词
Concept drift detection; Local outlier factor; Micro-clusters;
D O I
10.1007/978-3-030-72693-5_16
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many processes no matter what kind are regularly changing over time, adapting themselves to external and internal circumstances. Analyzing them in a streaming context is a very demanding task. Particularly the detection and classification of significant deviations is important to be able to re-integrate these possible micro-processes. Assuming a deviation of a certain process, the significance is implicitly given when a high number of instances contain this deviation similarly. To enhance a process the integration of or preventive measures against those anomalies is of high interest for all stakeholders as the actual process core gets discovered more and more in detail. Considering various areas of application, we focus on previously neglected but potentially significant anomalies like small changes in the disease process of a virus infection that has to be discovered to develop an appropriate reaction mechanism. We concentrate on non-conforming traces of a stream on which we compute a local outlier factor. This allows us to detect relations between traces based on changing outlier scores. Hence, hereby connected traces are clusters with which we achieve the detection of concept drift. We evaluate our approach on a synthetic event log and a real-world dataset corresponding to a process representing building permit applications which emphasizes the extensive applicability.
引用
下载
收藏
页码:206 / 217
页数:12
相关论文
共 50 条
  • [41] Multivariate Outlier Detection for Forest Fire Data Aggregation Accuracy
    Alkhatib, Ahmad A. A.
    Abed-Al, Qusai
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (02): : 1071 - 1087
  • [42] Forgetful Forests: Data Structures for Machine Learning on Streaming Data under Concept Drift
    Yuan, Zhehu
    Sun, Yinqi
    Shasha, Dennis
    ALGORITHMS, 2023, 16 (06)
  • [43] Outlier detection with streaming dyadic decomposition
    Gupta, Chetan
    Grossman, Robert
    ADVANCES IN DATA MINING: THEORETICAL ASPECTS AND APPLICATIONS, PROCEEDINGS, 2007, 4597 : 77 - +
  • [44] Dynamic Road Anomaly Detection: Harnessing Smartphone Accelerometer Data with Incremental Concept Drift Detection and Classification
    Ferjani, Imen
    Alsaif, Suleiman Ali
    Sensors, 2024, 24 (24)
  • [45] Diversity measure as a new drift detection method in data streaming
    Mahdi, Osama A.
    Pardede, Eric
    Ali, Nawfal
    Cao, Jinli
    KNOWLEDGE-BASED SYSTEMS, 2020, 191
  • [46] Handling Concept Drift in Data Streams by Using Drift Detection Methods
    Patil, Malini M.
    DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2018, VOL 2, 2019, 839 : 155 - 166
  • [47] A Modified Outlier Detection Method in Dynamic Data Reconciliation
    周凌柯
    苏宏业
    褚健
    Chinese Journal of Chemical Engineering, 2005, (04) : 542 - 547
  • [48] A strategy for simultaneous dynamic data reconciliation and outlier detection
    Chen, J
    Romagnoli, JA
    COMPUTERS & CHEMICAL ENGINEERING, 1998, 22 (4-5) : 559 - 562
  • [49] A modified outlier detection method in dynamic data reconciliation
    Zhou, LK
    Su, HY
    Chu, J
    CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2005, 13 (04) : 542 - 547
  • [50] A grid density based framework for classifying streaming data in the presence of concept drift
    Tegjyot Singh Sethi
    Mehmed Kantardzic
    Hanquing Hu
    Journal of Intelligent Information Systems, 2016, 46 : 179 - 211