Prototype Selection Using Clustering and Conformance Metrics for Process Discovery

被引:4
|
作者
Sani, Mohammadreza Fani [1 ]
Boltenhagen, Mathilde [2 ]
van der Aalst, Wil [1 ]
机构
[1] Rhein Westfal TH Aachen, Aachen, Germany
[2] Univ Paris Saclay, CNRS, LSV, ENS Paris Saclay,Inria, Cachan, France
关键词
Process mining; Process discovery; Prototype selection; Trace clustering; Event log preprocessing; Quality enhancement;
D O I
10.1007/978-3-030-66498-5_21
中图分类号
F [经济];
学科分类号
02 ;
摘要
Automated process discovery algorithms aim to automatically create process models based on event data that is captured during the execution of business processes. These algorithms usually tend to use all of the event data to discover a process model. Using all (i.e., less common) behavior may lead to discover imprecise and/or complex process models that may conceal important information of processes. In this paper, we introduce a new incremental prototype selection algorithm based on the clustering of process instances to address this problem. The method iteratively computes a unique process model from a different set of selected prototypes that are representative of whole event data and stops when conformance metrics decrease. This method has been implemented using both ProM and RapidProM. We applied the proposed method on several real event datasets with state-of-the-art process discovery algorithms. Results show that using the proposed method leads to improve the general quality of discovered process models.
引用
收藏
页码:281 / 294
页数:14
相关论文
共 50 条
  • [31] Prototype Selection for Graph Embedding Using Instance Selection
    Jimenez-Guarneros, Magdiel
    Ariel Carrasco-Ochoa, Jesus
    Fco Martinez-Trinidad, Jose
    PATTERN RECOGNITION (MCPR 2015), 2015, 9116 : 84 - 92
  • [32] Designing RBFNNs Using Prototype Selection
    Cecilia Tenorio-Gonzalez, Ana
    Fco Martinez-Trinidad, Jose
    Ariel Carrasco-Ochoa, Jesus
    ADVANCES IN PATTERN RECOGNITION, 2010, 6256 : 189 - 198
  • [33] Conformance Checking Approximation Using Subset Selection and Edit Distance
    Sani, Mohammadreza Fani
    van Zelst, Sebastiaan J.
    van der Aalst, Wil M. P.
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2020, 2020, 12127 : 234 - 251
  • [34] Prototype Discovery Using Quality-Diversity
    Hagg, Alexander
    Asteroth, Alexander
    Baeck, Thomas
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XV, PT I, 2018, 11101 : 500 - 511
  • [35] Representation Discovery for MDPs Using Bisimulation Metrics
    Ruan, Sherry Shanshan
    Comanici, Gheorghe
    Panangaden, Prakash
    Precup, Doina
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 4202 - 4203
  • [36] Representation Discovery for MDPs Using Bisimulation Metrics
    Ruan, Sherry Shanshan
    Comanici, Gheorghe
    Panangaden, Prakash
    Precup, Doina
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3578 - 3584
  • [37] A New Component Selection Algorithm Based on Metrics and Fuzzy Clustering Analysis
    Serban, Camelia
    Vescan, Andreea
    Pop, Horia F.
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2009, 5572 : 621 - 628
  • [38] Discovery visualization using fast clustering
    Ribarsky, W
    Katz, J
    Jiang, F
    Holland, A
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 1999, 19 (05) : 32 - 39
  • [39] Discovery of contextual factors using clustering
    Bhaskaran, Subhashini Sailesh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (34) : 80889 - 80918
  • [40] Discovery visualization using fast clustering
    Ribarsky, William
    Katz, Jochen
    Jiang, Frank
    Holland, Aubrey
    IEEE Computer Graphics and Applications, 19 (05): : 32 - 39