A Deep Learning Approach for Repairing Missing Activity Labels in Event Logs for Process Mining

被引:5
|
作者
Lu, Yang [1 ]
Chen, Qifan [1 ]
Poon, Simon K. [1 ]
机构
[1] Univ Sydney, Sch Comp Sci, Sydney, NSW 2006, Australia
关键词
process mining; business process management; incomplete event logs; data quality; data management; PROCESS MODELS; DISCOVERY; ACCURATE;
D O I
10.3390/info13050234
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Process mining is a relatively new subject that builds a bridge between traditional process modeling and data mining. Process discovery is one of the most critical parts of process mining, which aims at discovering process models automatically from event logs. Like other data mining techniques, the performance of existing process discovery algorithms can be affected when there are missing activity labels in event logs. In this paper, we assume that the control-flow information in event logs could be useful in repairing missing activity labels. We propose an LSTM-based prediction model, which takes both the prefix and suffix sequences of the events with missing activity labels as input to predict missing activity labels. Additional attributes of event logs are also utilized to improve the performance. Our evaluation of several publicly available datasets shows that the proposed method performed consistently better than existing methods in terms of repairing missing activity labels in event logs.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Configurable Process Mining: Semantic Variability in Event Logs
    Khannat, Aicha
    Sbai, Hanae
    Kjiri, Laila
    [J]. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS 2021), VOL 1, 2021, : 768 - 775
  • [22] Optimal Process Mining for Large and Complex Event Logs
    Prodel, Martin
    Augusto, Vincent
    Jouaneton, Baptiste
    Lamarsalle, Ludovic
    Xie, Xiaolan
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2018, 15 (03) : 1309 - 1325
  • [23] Mining variable fragments from process event logs
    Asef Pourmasoumi
    Mohsen Kahani
    Ebrahim Bagheri
    [J]. Information Systems Frontiers, 2017, 19 : 1423 - 1443
  • [24] Process Mining of Event Logs from Horde Helpdesk
    Dolak, Radim
    Botlik, Josef
    [J]. SMART TECHNOLOGIES AND INNOVATION FOR A SUSTAINABLE FUTURE, 2019, : 303 - 309
  • [25] Explainable Deep Learning Approach for Multilabel Classification of Antimicrobial Resistance With Missing Labels
    Tharmakulasingam, Mukunthan
    Gardner, Brian
    La Ragione, Roberto
    Fernando, Anil
    [J]. IEEE ACCESS, 2022, 10 : 113073 - 113085
  • [26] Generating Synthetic Sensor Event Logs for Process Mining
    Zisgen, Yorck
    Janssen, Dominik
    Koschmider, Agnes
    [J]. INTELLIGENT INFORMATION SYSTEMS (CAISE FORUM 2022), 2022, 452 : 130 - 137
  • [27] Mining Business Process Stages from Event Logs
    Hoang Nguyen
    Dumas, Marlon
    ter Hofstede, Arthur H. M.
    La Rosa, Marcello
    Maggi, Fabrizio Maria
    [J]. ADVANCED INFORMATION SYSTEMS ENGINEERING (CAISE 2017), 2017, 10253 : 577 - 594
  • [28] Mining variable fragments from process event logs
    Pourmasoumi, Asef
    Kahani, Mohsen
    Bagheri, Ebrahim
    [J]. INFORMATION SYSTEMS FRONTIERS, 2017, 19 (06) : 1423 - 1443
  • [29] An optimization-based process mining approach for explainable classification of timed event logs
    De Oliveira, Hugo
    Augusto, Vincent
    Jouaneton, Baptiste
    Lamarsalle, Ludovic
    Prodel, Martin
    Xie, Xiaolan
    [J]. 2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 43 - 48
  • [30] Process Mining Reloaded: Event Structures as a Unified Representation of Process Models and Event Logs
    Dumas, Marlon
    Garcia-Banuelos, Luciano
    [J]. APPLICATION AND THEORY OF PETRI NETS AND CONCURRENCY, 2015, 9115 : 33 - 48