Frequent pattern mining-based log file partition for process mining

被引:3
|
作者
Bantay, Laszlo [1 ]
Abonyi, Janos [1 ]
机构
[1] Univ Pannonia, ELKH PE Complex Syst Monitoring Res Grp, Egyet U 10, H-8200 Veszprem, Hungary
关键词
Frequent itemset mining; Frequent sequential pattern mining; Process mining; Log file pre-processing;
D O I
10.1016/j.engappai.2023.106221
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Process mining is a technique for exploring models based on event sequences, growing in popularity in the process industry. Process mining algorithms assume that the processed log files contain events generated by only one unknown process, which can lead to extremely complex and inaccurate models when this assumption is not met. To address this issue, this article proposes a frequent pattern mining-based method for log file partitioning, allowing for the exploration of parallel processes. The key idea is that frequent pattern mining can identify grouped events and generate sub-logs of overlapping sub-processes. Thanks to the pre-processing of the log files, more compact and interpretable process models can be identified. We developed a set of goal-oriented metrics to evaluate the complexity of process mining problems and the resulting models. The applicability and effectiveness of the method are demonstrated in the analysis of process alarms of an industrial plant. The results confirm that the proposed method enables the discovery of targeted sub-process models by partitioning the log file using frequent pattern mining, and the effectiveness of the method increases with the number of parallel processes stored in the same log file. We recommend applying the method in every case where there is no clear start and end of the logged events so that the log file can describe different processes.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Frequent pattern mining-based sales forecasting
    Murlidharan, Vijayalakshmi
    Menezes, Bernard
    [J]. OPSEARCH, 2013, 50 (04) : 455 - 474
  • [2] ABACUS: frequent pAttern mining-BAsed Community discovery in mUltidimensional networkS
    Michele Berlingerio
    Fabio Pinelli
    Francesco Calabrese
    [J]. Data Mining and Knowledge Discovery, 2013, 27 : 294 - 320
  • [3] ABACUS: frequent pAttern mining-BAsed Community discovery in mUltidimensional networkS
    Berlingerio, Michele
    Pinelli, Fabio
    Calabrese, Francesco
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 27 (03) : 294 - 320
  • [4] Mining frequent trajectory pattern based on vague space partition
    Wang, Liang
    Hu, Kunyuan
    Ku, Tao
    Yan, Xiaohui
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 50 : 100 - 111
  • [5] Frequent Pattern Mining in Web Log Data
    Ivancsy, Renata
    Vajk, Istvan
    [J]. ACTA POLYTECHNICA HUNGARICA, 2006, 3 (01) : 77 - 90
  • [6] Log File Anomaly Detection Based on Process Mining Graphs
    Luftensteiner, Sabrina
    Praher, Patrick
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022 WORKSHOPS, 2022, 1633 : 383 - 391
  • [7] Data Mining-based DNS Log Analysis
    Cui H.
    Yang J.
    Liu Y.
    Zheng Z.
    Wu K.
    [J]. Annals of Data Science, 2014, 1 (3-4) : 311 - 323
  • [8] Local support-based partition algorithm for frequent pattern mining
    Vijayakumar Kadappa
    Shivaraju Nagesh
    [J]. Pattern Analysis and Applications, 2019, 22 : 1137 - 1147
  • [9] Local support-based partition algorithm for frequent pattern mining
    Kadappa, Vijayakumar
    Nagesh, Shivaraju
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (03) : 1137 - 1147
  • [10] Mining-based File Caching in a Hybrid Storage System
    Lee, Seongjin
    Won, Youjip
    Hong, Sungwoo
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (06) : 1733 - 1754