Process Mining over Unordered Event Streams

被引:8
|
作者
Awad, Ahmed [1 ,2 ]
Weidlich, Matthias [3 ]
Sakr, Sherif [2 ]
机构
[1] Cairo Univ, Giza, Egypt
[2] Univ Tartu, Tartu, Estonia
[3] Humboldt Univ, Berlin, Germany
关键词
D O I
10.1109/ICPM49681.2020.00022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Process mining is no longer limited to the one-off analysis of static event logs extracted from a single enterprise system. Rather, process mining may strive for immediate insights based on streams of events that are continuously generated by diverse information systems. This requires online algorithms that, instead of keeping the whole history of event data, work incrementally and update analysis results upon the arrival of new events. While such online algorithms have been proposed for several process mining tasks, from discovery through conformance checking to time prediction, they all assume that an event stream is ordered, meaning that the order of event generation coincides with their arrival at the analysis engine. Yet, once events are emitted by independent, distributed systems, this assumption may not hold true, which compromises analysis accuracy. In this paper, we provide the first contribution towards handling unordered event streams in process mining. Specifically, we formalize the notion of out-of-order arrival of events, where an online analysis algorithm needs to process events in an order different from their generation. Using directly-follows graphs as a basic model for many process mining tasks, we provide two approaches to handle such unorderedness, either through buffering or speculative processing. Our experiments with synthetic and real-life event data show that these techniques help mitigate the accuracy loss induced by unordered streams.
引用
收藏
页码:81 / 88
页数:8
相关论文
共 50 条
  • [31] Event detection over twitter social media streams
    Zhou, Xiangmin
    Chen, Lei
    [J]. VLDB JOURNAL, 2014, 23 (03): : 381 - 400
  • [32] Incremental causal network construction over event streams
    Acharya, Saurav
    Lee, Byung Suk
    [J]. INFORMATION SCIENCES, 2014, 261 : 32 - 51
  • [33] Probabilistic timing join over uncertain event streams
    Mok, Aloysius K.
    Woo, Honguk
    Lee, Chan-Gun
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2006, : 17 - +
  • [34] Differentially Private Event Sequences over Infinite Streams
    Kellaris, Georgios
    Papadopoulos, Stavros
    Xiao, Xiaokui
    Papadias, Dimitris
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (12): : 1155 - 1166
  • [35] Unordered tree mining with applications to phylogeny
    Shasha, D
    Wang, JTL
    Zhang, S
    [J]. 20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 708 - 719
  • [36] Scalable Contrast Pattern Mining over Data Streams
    Alipourchavary, Elaheh
    Erfani, Sarah M.
    Leckie, Christopher
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2842 - 2846
  • [37] Mining multidimensional sequential patterns over data streams
    Raissi, Chedy
    Plantevit, Marc
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 263 - 272
  • [38] Association Rules Mining over Data Streams: Review
    Tan, Jun
    [J]. ADVANCES IN CIVIL ENGINEERING II, PTS 1-4, 2013, 256-259 : 2890 - 2893
  • [39] To Share, or not to Share Online Event Trend Aggregation Over Bursty Event Streams
    Poppe, Olga
    Lei, Chuan
    Ma, Lei
    Rozet, Allison
    Rundensteiner, Elke A.
    [J]. SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 1452 - 1464
  • [40] PErrCas: Process Error Cascade Mining in Trace Streams
    Wimbauer, Anna
    Richter, Florian
    Seidl, Thomas
    [J]. PROCESS MINING WORKSHOPS, ICPM 2021, 2022, 433 : 224 - 236