Process discovery from event data: Relating models and logs through abstractions

被引:31
|
作者
van der Aalst, Wil M. P. [1 ]
机构
[1] Rhein Westfal TH Aachen, Proc & Data Sci PADS, Aachen, Germany
关键词
business process management; data science; process discovery; process mining; process modeling; MINING PROCESS MODELS; OF-THE-ART;
D O I
10.1002/widm.1244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event data are collected in logistics, manufacturing, finance, health care, customer relationship management, e-learning, e-government, and many other domains. The events found in these domains typically refer to activities executed by resources at particular times and for a particular case (i.e., process instances). Process mining techniques are able to exploit such data. In this article, we focus on process discovery. However, process mining also includes conformance checking, performance analysis, decision mining, organizational mining, predictions, recommendations, and so on. These techniques help to diagnose problems and improve processes. All process mining techniques involve both event data and process models. Therefore, a typical first step is to automatically learn a control-flow model from the event data. This is very challenging, but in recent years, many powerful discovery techniques have been developed. It is not easy to compare these techniques since they use different representations and make different assumptions. Users often need to resort to trying different algorithms in an ad-hoc manner. Developers of new techniques are often trying to solve specific instances of a more general problem. Therefore, we aim to unify existing approaches by focusing on log and model abstractions. These abstractions link observed and modeled behavior: Concrete behaviors recorded in event logs are related to possible behaviors represented by process models. Hence, such behavioral abstractions provide an interface between both of them. We discuss four discovery approaches involving three abstractions and different types of process models (Petri nets, block-structured models, and declarative models). The goal is to provide a comprehensive understanding of process discovery and show how to develop new techniques. Examples illustrate the different approaches and pointers to software are given. The discussion on abstractions and process representations is also presented to reflect on the gap between process mining literature and commercial process mining tools. This facilitates users to select an appropriate process discovery technique. Moreover, structuring the role of internal abstractions and representations helps broaden the view and facilitates the creation of new discovery approaches. This article is categorized under: Algorithmic Development > Spatial and Temporal Data Mining Application Areas > Business and Industry Technologies > Machine Learning Application Areas > Data Mining Software Tools
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Process mining using BPMN: relating event logs and process models
    Kalenkova, Anna A.
    van der Aalst, Wil M. P.
    Lomazova, Irina A.
    Rubin, Vladimir A.
    [J]. SOFTWARE AND SYSTEMS MODELING, 2017, 16 (04): : 1019 - 1048
  • [2] Process Mining Using BPMN: Relating Event Logs and Process Models
    Kalenkova, Anna A.
    van der Aalst, Wil M. P.
    Lomazova, Irina A.
    Rubin, Vladimir A.
    [J]. 19TH ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS (MODELS'16), 2016, : 123 - 123
  • [3] Process mining using BPMN: relating event logs and process models
    Anna A. Kalenkova
    Wil M. P. van der Aalst
    Irina A. Lomazova
    Vladimir A. Rubin
    [J]. Software & Systems Modeling, 2017, 16 : 1019 - 1048
  • [4] Automated Discovery of Process Models from Event Logs: Review and Benchmark
    Augusto, Adriano
    Conforti, Raffaele
    Dumas, Marlon
    La Rosa, Marcello
    Maggi, Fabrizio Maria
    Marrella, Andrea
    Mecella, Massimo
    Soo, Allar
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (04) : 686 - 705
  • [5] Automated discovery of business process simulation models from event logs
    Camargo, Manuel
    Dumas, Marlon
    Gonzalez-Rojas, Oscar
    [J]. DECISION SUPPORT SYSTEMS, 2020, 134
  • [6] Stage-based discovery of business process models from event logs
    Hoang Nguyen
    Dumas, Marlon
    ter Hofstede, Arthur H. M.
    La Rosa, Marcello
    Maggi, Fabrizio Maria
    [J]. INFORMATION SYSTEMS, 2019, 84 : 214 - 237
  • [7] Discovering Data Models from Event Logs
    Bano, Dorina
    Weske, Mathias
    [J]. CONCEPTUAL MODELING, ER 2020, 2020, 12400 : 62 - 76
  • [8] Discovery of Fuzzy DMN Decision Models from Event Logs
    Bazhenova, Ekaterina
    Haarmann, Stephan
    Ihde, Sven
    Solti, Andreas
    Weske, Mathias
    [J]. ADVANCED INFORMATION SYSTEMS ENGINEERING (CAISE 2017), 2017, 10253 : 629 - 647
  • [9] Process Discovery from Dependence-Complete Event Logs
    Song, Wei
    Jacobsen, Hans-Arno
    Ye, Chunyang
    Ma, Xiaoxing
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (05) : 714 - 727
  • [10] Process Discovery from Low-Level Event Logs
    Fazzinga, Bettina
    Flesca, Sergio
    Furfaro, Filippo
    Pontieri, Luigi
    [J]. ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2018, 2018, 10816 : 257 - 273