Integrated detection and localization of concept drifts in process mining with batch and stream trace clustering support

被引:4
|
作者
de Sousa, Rafael Gaspar [1 ]
Meira Neto, Antonio Carlos [1 ]
Fantinato, Marcelo [1 ]
Peres, Sarajane Marques [1 ]
Reijers, Hajo Alexander [2 ]
机构
[1] Univ Sao Paulo, Sch Arts Sci & Humanities, Sao Paulo, Brazil
[2] Univ Utrecht, Dept Informat & Comp Sci, Utrecht, Netherlands
基金
巴西圣保罗研究基金会;
关键词
Concept drift; Trace clustering; Process mining; Business processes; Data mining;
D O I
10.1016/j.datak.2023.102253
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Process mining can help organizations by extracting knowledge from event logs. However, process mining techniques often assume business processes are stationary, while actual business processes are constantly subject to change because of the complexity of organizations and their external environment. Thus, addressing process changes over time - known as concept drifts - allows for a better understanding of process behavior and can provide a competitive edge for organizations, especially in an online data stream scenario. Current approaches to handling process concept drift focus primarily on detecting and locating concept drifts, often through an integrated, albeit offline, approach. However, part of these integrated approaches rely on complex data structures related to tree-based process models, usually discovered through algorithms whose results are influenced by specific heuristic rules. Moreover, most of the proposed approaches have not been tested on public true concept drift-labeled event logs commonly used as benchmark, making comparative analysis difficult. In this article, we propose an online approach to detect and localize concept drifts in an integrated way using batch and stream trace clustering support. In our approach, cluster models provide input information for both concept drift detection and localization methods. Each cluster abstracts a behavior profile underlying the process and reveals descriptive information about the discovered concept drifts. Experiments with benchmark synthetic event logs with different control-flow changes, as well as with real-world event logs, showed that our approach, when relying on the same clustering model, is competitive in relation to baselines concept drift detection method. In addition, the experiment showed our approach is able to correctly locate the concept drifts detected and allows the analysis of such concept drifts through different process behavior profiles.
引用
收藏
页数:33
相关论文
共 50 条
  • [21] A Framework for Explainable Concept Drift Detection in Process Mining
    Adams, Jan Niklas
    van Zelst, Sebastiaan J.
    Quack, Lara
    Hausmann, Kathrin
    van der Aalst, Wil M. P.
    Rose, Thomas
    BUSINESS PROCESS MANAGEMENT (BPM 2021), 2021, 12875 : 400 - 416
  • [22] Comparing Concept Drift Detection with Process Mining Tools
    Omori, Nicolas Jashchenko
    Tavares, Gabriel Marques
    Ceravolo, Paolo
    Barbon, Sylvio, Jr.
    PROCEEDINGS OF THE XV BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS, SBSI 2019: Complexity on Modern Information Systems, 2019,
  • [23] Anomaly Detection in Business Process based on Data Stream Mining
    Tavares, Gabriel Marques
    Turrisi da Costa, Victor G.
    Martins, Vinicius Eiji
    Ceravolo, Paolo
    Barbon, Sylvio, Jr.
    PROCEEDINGS OF THE 14TH BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS (SBSI2018), 2018, : 120 - 127
  • [24] Process Duration Modelling and Concept Drift Detection for Business Process Mining
    Yang, Lingkai
    McClean, Sally
    Donnelly, Mark
    Burke, Kevin
    Khan, Kashaf
    2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 653 - 658
  • [25] Concept Drift Detection in Data Stream Clustering and its Application on Weather Data
    Namitha, K.
    Kumar, Santhosh G.
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS, 2020, 11 (01) : 67 - 85
  • [26] DIAG Approach: Introducing the Cognitive Process Mining by an Ontology-Driven Approach to Diagnose and Explain Concept Drifts
    Araghi, Sina Namaki
    Fontanili, Franck
    Sarkar, Arkopaul
    Lamine, Elyes
    Karray, Mohamed-Hedi
    Benaben, Frederick
    MODELLING, 2024, 5 (01): : 85 - 98
  • [27] Adversarial concept drift detection under poisoning attacks for robust data stream mining
    Łukasz Korycki
    Bartosz Krawczyk
    Machine Learning, 2023, 112 : 4013 - 4048
  • [28] Adversarial concept drift detection under poisoning attacks for robust data stream mining
    Korycki, Lukasz
    Krawczyk, Bartosz
    MACHINE LEARNING, 2023, 112 (10) : 4013 - 4048
  • [29] Early detection of gradual concept drifts by text categorization and Support Vector Machine techniques: The TRIO algorithm
    Marseguerra, M.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2014, 129 : 1 - 9
  • [30] Visualization for enabling human-in-the-loop in trace clustering-based process mining tasks
    Neubauer, Thais Rodrigues
    Sobrinho, Glaucia Pamponet
    Fantinato, Marcelo
    Peres, Sarajane Marques
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3548 - 3556