Autoencoders for improving quality of process event logs

被引:36
|
作者
Hoang Thi Cam Nguyen [1 ]
Lee, Suhwan [2 ]
Kim, Jongchan [2 ]
Ko, Jonghyeon [2 ]
Comuzzi, Marco [2 ]
机构
[1] Trusting Social, Ho Chi Minh City, Vietnam
[2] Ulsan Natl Inst Sci & Technol, Ulsan, South Korea
关键词
Autoencoder; Event log; Business process management; Event log cleaning; Event log reconstruction; Event log quality; IMPUTATION;
D O I
10.1016/j.eswa.2019.04.052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low quality of business process event logs, as determined by anomalous and missing values, is often unavoidable in practical contexts. The output of process analysis that uses event logs with missing and anomalous values is also likely to be of low quality, thus decreasing the quality of any decisions based on it While previous work has focused on reconstructing missing events in an event log or removing anomalous traces, in this paper we focus on detecting anomalous values and reconstructing missing values at the level of attributes in event logs. We propose methods based on autoencoders, which are a class of neural networks that can reconstruct their own input and are particularly suitable to learn a model of the complex relationships among attribute values in an event log. These methods do not rely on any a-priori knowledge about the business process that generated an event log and are evaluated using real world and artificially-generated event logs. The paper also discusses a qualitative analysis of the impact of event log cleaning and reconstruction on the output of process discovery. The proposed approach shows remarkable performance regarding activity labels and timestamps in artificial event logs. The performance in the case of real world event logs, in particular timestamp anomaly detection, is lower, which may be due to high variability of attribute values in the chosen event logs. Process models discovered from reconstructed event logs are characterised by lower variability of allowed behaviour and, therefore, are more usable in practice. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:132 / 147
页数:16
相关论文
共 50 条
  • [1] An ontology-based method for improving the quality of process event logs using database bin logs
    Ghalibafan, Shokoufeh
    Behkamal, Behshid
    Kahani, Mohsen
    Allahbakhsh, Mohammad
    [J]. International Journal of Metadata, Semantics and Ontologies, 2020, 14 (04): : 279 - 289
  • [2] Unsupervised Anomaly Detection in Noisy Business Process Event Logs Using Denoising Autoencoders
    Nolle, Timo
    Seeliger, Alexander
    Muehlhaeuser, Max
    [J]. DISCOVERY SCIENCE, (DS 2016), 2016, 9956 : 442 - 456
  • [3] Improving Documentation by Repairing Event Logs
    Rogge-Solti, Andreas
    Mans, Ronny S.
    van der Aalst, Wil M. P.
    Weske, Mathias
    [J]. PRACTICE OF ENTERPRISE MODELING, POEM 2013, 2013, 165 : 129 - 144
  • [4] Assessing and improving measurability of process performance indicators based on quality of logs
    Cappiello, Cinzia
    Comuzzi, Marco
    Plebani, Pierluigi
    Fim, Matheus
    [J]. INFORMATION SYSTEMS, 2022, 103
  • [5] Process mining with real world financial loan applications: Improving inference on incomplete event logs
    Moreira, Catarina
    Haven, Emmanuel
    Sozzo, Sandro
    Wichert, Andreas
    [J]. PLOS ONE, 2018, 13 (12):
  • [6] A generic import framework for process event logs
    Gunther, Christian W.
    van der Aalst, Wil M. P.
    [J]. BUSINESS PROCESS MANAGEMENT WORKSHOPS, 2006, 4103 : 81 - 92
  • [7] Auditing Between Event Logs and Process Trees
    Li, Hongxia
    Hou, Haixia
    Du, Yuyue
    Liu, Zhi
    [J]. DIGITAL TV AND MULTIMEDIA COMMUNICATION, 2019, 1009 : 227 - 237
  • [8] Sampling business process event logs with guarantees
    Su, Xuan
    Liu, Cong
    Zhang, Shuaipeng
    Zeng, Qingtian
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (13):
  • [9] Optimal process mining of timed event logs
    De Oliveira, Hugo
    Augusto, Vincent
    Jouaneton, Baptiste
    Lamarsalle, Ludovic
    Prodel, Martin
    Xie, Xiaolan
    [J]. INFORMATION SCIENCES, 2020, 528 : 58 - 78
  • [10] Mining Process Performance from Event Logs
    Adriansyah, Arya
    Buijs, Joos C. A. M.
    [J]. BUSINESS PROCESS MANAGEMENT WORKSHOPS (BPM), 2013, 132 : 217 - 218