Bagging Recurrent Event Imputation for Repair of Imperfect Event Log With Missing Categorical Events

被引:4
|
作者
Sim, Sunghyun [1 ]
Bae, Hyerim [2 ]
Liu, Ling [3 ]
机构
[1] Pusan Natl Univ, Inst Intelligent Logist Big Data, Busan, South Korea
[2] Pusan Natl Univ, Dept Ind Engn, 30 Jan Jeon Dong, Busan 609753, South Korea
[3] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
基金
新加坡国家研究基金会;
关键词
Process mining; event log quality; missing event imputation; event chain; IMPACT; VALUES; MODELS; MICE;
D O I
10.1109/TSC.2021.3118381
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In most computing services, imperfect event logs with missing events are generated for a variety of reasons. Because missing events in imperfect event logs adversely affect the results of process mining analysis, it is essential to handle them effectively. Most existing process mining studies focus on methodologies for generation of good process models, very few methodologies, in fact, have been developed to deal with missing events. To the best of our knowledge, there is a lack of high performance methods for restoration of missing events in actual event log data. In this paper, we propose a new categorical event imputation method that can restore missing categorical events by learning the structural features between observed events in the event log. We evaluated the proposed method by way of comparative experiments with previous studies using six real datasets, and the results demonstrate that the restoration performance was greatly improved and that thereby, our proposed method can significantly improve both the quality of event logs (specifically by restoring missing events in imperfect event logs) and the overall quality of process mining analysis.
引用
收藏
页码:108 / 121
页数:14
相关论文
共 50 条
  • [21] Sensitivity Analysis of Missing Data and Imputation Techniques in Flight Safety Event Detection
    Hayachiguti, Elton Shinji Okuma
    Arra, Aditya
    Bhanpato, Jirat
    Gautier, Raphael
    Kirby, Michelle
    Mavris, Dimitri N.
    AIAA AVIATION FORUM AND ASCEND 2024, 2024,
  • [22] Imputation of Missing Data for Time-to-Event Endpoints Using Retrieved Dropouts
    Wang, Shuai
    Frederich, Robert
    Mancuso, James P.
    THERAPEUTIC INNOVATION & REGULATORY SCIENCE, 2024, 58 (01) : 114 - 126
  • [23] An Event Log Repair Method Based on Masked Transformer Model
    Wu, Ping
    Fang, Xianwen
    Fang, Huan
    Gong, Ziyou
    Kan, Daoyu
    APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [24] A Bayesian Framework for Event Prediction in Clinical Trials with Recurrent Event Endpoints and Terminal Events
    Ren, Yangfan
    Schloemer, Patrick
    Wang, Ming-Dauh
    STATISTICS IN BIOPHARMACEUTICAL RESEARCH, 2025,
  • [25] Imputing Missing Events in Continuous-Time Event Streams
    Mei, Hongyuan
    Qin, Guanghui
    Eisner, Jason
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [26] Shared frailty models for recurrent events and a terminal event
    Liu, L
    Wolfe, RA
    Huang, XL
    BIOMETRICS, 2004, 60 (03) : 747 - 756
  • [27] Income and recurrent events after a coronary event in women
    Laszlo, Krisztina D.
    Janszky, Imre
    Ahnve, Staffan
    EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2008, 23 (10) : 669 - 680
  • [28] A Bayesian joint model of recurrent events and a terminal event
    Li, Zheng
    Chinchilli, Vernon M.
    Wang, Ming
    BIOMETRICAL JOURNAL, 2019, 61 (01) : 187 - 202
  • [29] Tests for multivariate recurrent events in the presence of a terminal event
    Chen, BSE
    Cook, RJ
    BIOSTATISTICS, 2004, 5 (01) : 129 - 143
  • [30] Income and recurrent events after a coronary event in women
    Krisztina D. László
    Imre Janszky
    Staffan Ahnve
    European Journal of Epidemiology, 2008, 23 : 669 - 680