Process Mining over Unordered Event Streams

被引:8
|
作者
Awad, Ahmed [1 ,2 ]
Weidlich, Matthias [3 ]
Sakr, Sherif [2 ]
机构
[1] Cairo Univ, Giza, Egypt
[2] Univ Tartu, Tartu, Estonia
[3] Humboldt Univ, Berlin, Germany
关键词
D O I
10.1109/ICPM49681.2020.00022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Process mining is no longer limited to the one-off analysis of static event logs extracted from a single enterprise system. Rather, process mining may strive for immediate insights based on streams of events that are continuously generated by diverse information systems. This requires online algorithms that, instead of keeping the whole history of event data, work incrementally and update analysis results upon the arrival of new events. While such online algorithms have been proposed for several process mining tasks, from discovery through conformance checking to time prediction, they all assume that an event stream is ordered, meaning that the order of event generation coincides with their arrival at the analysis engine. Yet, once events are emitted by independent, distributed systems, this assumption may not hold true, which compromises analysis accuracy. In this paper, we provide the first contribution towards handling unordered event streams in process mining. Specifically, we formalize the notion of out-of-order arrival of events, where an online analysis algorithm needs to process events in an order different from their generation. Using directly-follows graphs as a basic model for many process mining tasks, we provide two approaches to handle such unorderedness, either through buffering or speculative processing. Our experiments with synthetic and real-life event data show that these techniques help mitigate the accuracy loss induced by unordered streams.
引用
收藏
页码:81 / 88
页数:8
相关论文
共 50 条
  • [1] Event Mining over Distributed Text Streams
    Martinez, John Calvo
    [J]. WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 745 - 746
  • [2] Differentially private frequent episode mining over event streams
    Qin, Jiawen
    Wang, Jinyan
    Li, Qiyu
    Fang, Shijian
    Li, Xianxian
    Lei, Lei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110
  • [3] Complex-Event Mining Over Centralized and Distributed Data Streams
    Garofalakis, Minos
    [J]. 2022 4TH INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2022), 2022, : XV - XV
  • [4] Event mining in multimedia streams
    Xie, Lexing
    Sundaram, Hari
    Campbell, Murray
    [J]. PROCEEDINGS OF THE IEEE, 2008, 96 (04) : 623 - 647
  • [5] Frequent episode mining within the latest time windows over event streams
    Lin, Shukuan
    Qiao, Jianzhong
    Wang, Ya
    [J]. APPLIED INTELLIGENCE, 2014, 40 (01) : 13 - 28
  • [6] Frequent episode mining within the latest time windows over event streams
    Shukuan Lin
    Jianzhong Qiao
    Ya Wang
    [J]. Applied Intelligence, 2014, 40 : 13 - 28
  • [7] Efficient Process Discovery From Event Streams Using Sequential Pattern Mining
    Hassani, Marwan
    Siccha, Sergio
    Richter, Florian
    Seidl, Thomas
    [J]. 2015 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2015, : 1366 - 1373
  • [8] A Scalable Complex Event Analytical System with Incremental Episode Mining over Data Streams
    Tseng, Jerry C. C.
    Gu, Jia-Yuan
    Tseng, Vincent S.
    Wang, P. F.
    Chen, Ching-Yu
    Li, Chu-Feng
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 648 - 655
  • [9] Event evolution model for cybersecurity event mining in tweet streams
    Liu, Xiuwen
    Fu, Jianming
    Chen, Yanjiao
    [J]. INFORMATION SCIENCES, 2020, 524 : 254 - 276
  • [10] Efficient Episode Mining of Dynamic Event Streams
    Patnaik, Debprakash
    Laxman, Srivatsan
    Chandramouli, Badrish
    Ramakrishnan, Naren
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 605 - 614