Integrated detection and localization of concept drifts in process mining with batch and stream trace clustering support

被引:4
|
作者
de Sousa, Rafael Gaspar [1 ]
Meira Neto, Antonio Carlos [1 ]
Fantinato, Marcelo [1 ]
Peres, Sarajane Marques [1 ]
Reijers, Hajo Alexander [2 ]
机构
[1] Univ Sao Paulo, Sch Arts Sci & Humanities, Sao Paulo, Brazil
[2] Univ Utrecht, Dept Informat & Comp Sci, Utrecht, Netherlands
基金
巴西圣保罗研究基金会;
关键词
Concept drift; Trace clustering; Process mining; Business processes; Data mining;
D O I
10.1016/j.datak.2023.102253
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Process mining can help organizations by extracting knowledge from event logs. However, process mining techniques often assume business processes are stationary, while actual business processes are constantly subject to change because of the complexity of organizations and their external environment. Thus, addressing process changes over time - known as concept drifts - allows for a better understanding of process behavior and can provide a competitive edge for organizations, especially in an online data stream scenario. Current approaches to handling process concept drift focus primarily on detecting and locating concept drifts, often through an integrated, albeit offline, approach. However, part of these integrated approaches rely on complex data structures related to tree-based process models, usually discovered through algorithms whose results are influenced by specific heuristic rules. Moreover, most of the proposed approaches have not been tested on public true concept drift-labeled event logs commonly used as benchmark, making comparative analysis difficult. In this article, we propose an online approach to detect and localize concept drifts in an integrated way using batch and stream trace clustering support. In our approach, cluster models provide input information for both concept drift detection and localization methods. Each cluster abstracts a behavior profile underlying the process and reveals descriptive information about the discovered concept drifts. Experiments with benchmark synthetic event logs with different control-flow changes, as well as with real-world event logs, showed that our approach, when relying on the same clustering model, is competitive in relation to baselines concept drift detection method. In addition, the experiment showed our approach is able to correctly locate the concept drifts detected and allows the analysis of such concept drifts through different process behavior profiles.
引用
收藏
页数:33
相关论文
共 50 条
  • [1] Integrated detection and localization of concept drifts in process mining with batch and stream trace clustering support
    de Sousa, Rafael Gaspar
    Meira Neto, Antonio Carlos
    Fantinato, Marcelo
    Peres, Sarajane Marques
    Reijers, Hajo Alexander
    Data and Knowledge Engineering, 2024, 149
  • [2] Concept Drift Detection and Localization in Process Mining An Integrated and Efficient Approach Enabled by Trace Clustering
    de Sousa, Rafael Gaspar
    Peres, Sarajane Marques
    Fantinato, Marcelo
    Reijers, Hajo Alexander
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 364 - 373
  • [3] A Stream Data Mining Approach to Handle Concept Drifts in Process Discovery
    Pasquadibisceglie, Vincenzo
    Lucente, Donato
    Malerba, Donato
    FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2024, 2024, 14670 : 136 - 145
  • [4] Formal Concept Analysis for Trace Clustering in Process Mining
    Boukhetta, Salah Eddine
    Trabelsi, Marwa
    GRAPH-BASED REPRESENTATION AND REASONING, ICCS 2023, 2023, 14133 : 73 - 88
  • [5] Dealing With Concept Drifts in Process Mining
    Bose, R. P. Jagadeesh Chandra
    van der Aalst, Wil M. P.
    Zliobaite, Indre
    Pechenizkiy, Mykola
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (01) : 154 - 171
  • [6] Trace Clustering in Process Mining
    Song, Minseok
    Gunther, Christian W.
    van der Aalst, Wil M. P.
    BUSINESS PROCESS MANAGEMENT WORKSHOPS, 2009, 17 : 109 - 120
  • [7] Unraveling Process Evolution by Handling Concept Drifts in Process Mining
    Li, Tianyang
    He, Ting
    Wang, Zhongjie
    Zhang, Yufeng
    Chu, Dianhui
    2017 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC), 2017, : 442 - 449
  • [8] Combining process mining with trace clustering
    Meincheim, Alex
    Garcia, Cleiton dos Santos
    Nievola, Julio Cesar
    Scalabrin, Edson Emilio
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 498 - 505
  • [9] A Generic Framework for Trace Clustering in Process Mining
    Zandkarimi, Fareed
    Rehse, Jana-Rebecca
    Soudmand, Pouya
    Hoehle, Hartmut
    2020 2ND INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2020), 2020, : 177 - 184
  • [10] Concept drift and trace clustering: insights for managers from mining
    Ghourchian, Nikoo
    Noughabi, Elham Akhondzadeh
    MANAGEMENT DECISION, 2025,