Integrated detection and localization of concept drifts in process mining with batch and stream trace clustering support

被引:4
|
作者
de Sousa, Rafael Gaspar [1 ]
Meira Neto, Antonio Carlos [1 ]
Fantinato, Marcelo [1 ]
Peres, Sarajane Marques [1 ]
Reijers, Hajo Alexander [2 ]
机构
[1] Univ Sao Paulo, Sch Arts Sci & Humanities, Sao Paulo, Brazil
[2] Univ Utrecht, Dept Informat & Comp Sci, Utrecht, Netherlands
基金
巴西圣保罗研究基金会;
关键词
Concept drift; Trace clustering; Process mining; Business processes; Data mining;
D O I
10.1016/j.datak.2023.102253
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Process mining can help organizations by extracting knowledge from event logs. However, process mining techniques often assume business processes are stationary, while actual business processes are constantly subject to change because of the complexity of organizations and their external environment. Thus, addressing process changes over time - known as concept drifts - allows for a better understanding of process behavior and can provide a competitive edge for organizations, especially in an online data stream scenario. Current approaches to handling process concept drift focus primarily on detecting and locating concept drifts, often through an integrated, albeit offline, approach. However, part of these integrated approaches rely on complex data structures related to tree-based process models, usually discovered through algorithms whose results are influenced by specific heuristic rules. Moreover, most of the proposed approaches have not been tested on public true concept drift-labeled event logs commonly used as benchmark, making comparative analysis difficult. In this article, we propose an online approach to detect and localize concept drifts in an integrated way using batch and stream trace clustering support. In our approach, cluster models provide input information for both concept drift detection and localization methods. Each cluster abstracts a behavior profile underlying the process and reveals descriptive information about the discovered concept drifts. Experiments with benchmark synthetic event logs with different control-flow changes, as well as with real-world event logs, showed that our approach, when relying on the same clustering model, is competitive in relation to baselines concept drift detection method. In addition, the experiment showed our approach is able to correctly locate the concept drifts detected and allows the analysis of such concept drifts through different process behavior profiles.
引用
收藏
页数:33
相关论文
共 50 条
  • [31] Leveraging Process Discovery with Trace Clustering and Text Mining for Intelligent Analysis of Incident Management Processes
    De Weerdt, Jochen
    vanden Broucke, Seppe K. L. M.
    Vanthienen, Jan
    Baesens, Bart
    2012 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2012,
  • [32] Novel Class Detection in Concept-Drifting Data Stream Mining Employing Decision Tree
    Farid, Dewan Md
    Rahman, Chowdhury Mofizur
    2012 7TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2012,
  • [33] Correction: Adversarial concept drift detection under poisoning attacks for robust data stream mining
    Łukasz Korycki
    Bartosz Krawczyk
    Machine Learning, 2024, 113 : 3303 - 3304
  • [34] Wafer-to-Wafer Process Fault Detection Using Data Stream Mining Techniques
    Ko, Jong Myoung
    Hong, Seong Rok
    Choi, Ja Young
    Kim, Chang Ouk
    INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2013, 14 (01) : 103 - 113
  • [35] Wafer-to-wafer process fault detection using data stream mining techniques
    Jong Myoung Ko
    Seong Rok Hong
    Ja Young Choi
    Chang Ouk Kim
    International Journal of Precision Engineering and Manufacturing, 2013, 14 : 103 - 113
  • [36] Looking for Change: A Computer Vision Approach for Concept Drift Detection in Process Mining
    Kraus, Alexander
    van der Aa, Han
    BUSINESS PROCESS MANAGEMENT, BPM 2024, 2024, 14940 : 273 - 290
  • [37] A novel clustering approach and adaptive SVM classifier for intrusion detection in WSN: A data mining concept
    Borkar, Gautam M.
    Patil, Leena H.
    Dalgade, Dilip
    Hutke, Ankush
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2019, 23 (120-135): : 120 - 135
  • [38] Concept for IT System-based Value Stream Mapping - Generation of Value Streams using Process Mining
    Klenk E.
    ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2019, 114 (09): : 513 - 516
  • [39] Deterministic Concept Drift Detection in Ensemble Classifier Based Data Stream Classification Process
    Abdualrhman, Mohammed Ahmed Ali
    Padma, M. C.
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2019, 11 (01) : 29 - 48
  • [40] Sustainability-focused digital value stream mapping: Concept for employing process mining for sustainability-focused value stream mapping
    Horsthofer-Rauch J.
    Vernim S.
    Reinhart G.
    ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2021, 116 (09): : 590 - 593