Can recurrent neural networks learn process model structure?

被引:6
|
作者
Peeperkorn, Jari [1 ]
Broucke, Seppe vanden [1 ,2 ]
De Weerdt, Jochen [1 ]
机构
[1] Katholieke Univ Leuven, Res Ctr Informat Syst Engn LIRIS, Leuven, Belgium
[2] Univ Ghent, Dept Business Informat & Operat Management, Ghent, Belgium
关键词
Process mining; Predictive process analytics; LSTM; Fitness; Precision; Generalization;
D O I
10.1007/s10844-022-00765-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various methods using machine and deep learning have been proposed to tackle different tasks in predictive process monitoring, forecasting for an ongoing case e.g. the most likely next event or suffix, its remaining time, or an outcome-related variable. Recurrent neural networks (RNNs), and more specifically long short-term memory nets (LSTMs), stand out in terms of popularity. In this work, we investigate the capabilities of such an LSTM to actually learn the underlying process model structure of an event log. We introduce an evaluation framework that combines variant-based resampling and custom metrics for fitness, precision and generalization. We evaluate 4 hypotheses concerning the learning capabilities of LSTMs, the effect of overfitting countermeasures, the level of incompleteness in the training set and the level of parallelism in the underlying process model. We confirm that LSTMs can struggle to learn process model structure, even with simplistic process data and in a very lenient setup. Taking the correct anti-overfitting measures can alleviate the problem. However these measures did not present themselves to be optimal when selecting hyperparameters purely on predicting accuracy. We also found that decreasing the amount of information seen by the LSTM during training, causes a sharp drop in generalization and precision scores. In our experiments, we could not identify a relationship between the extent of parallelism in the model and the generalization capability, but they do indicate that the process' complexity might have impact.
引用
收藏
页码:27 / 51
页数:25
相关论文
共 50 条
  • [31] Learning of Process Representations Using Recurrent Neural Networks
    Seeliger, Alexander
    Luettgen, Stefan
    Nolle, Timo
    Muehlhaeuser, Max
    ADVANCED INFORMATION SYSTEMS ENGINEERING (CAISE 2021), 2021, 12751 : 109 - 124
  • [32] Classifying Process Instances Using Recurrent Neural Networks
    Hinkka, Markku
    Lehto, Teemu
    Heljanko, Keijo
    Jung, Alexander
    BUSINESS PROCESS MANAGEMENT WORKSHOPS, BPM 2018 INTERNATIONAL WORKSHOPS, 2019, 342 : 313 - 324
  • [33] Prediction Model Using Recurrent Neural Networks
    Jahan, Israt
    Sajal, Sayeed Z.
    Nygard, Kendall E.
    2019 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY (EIT), 2019, : 390 - 395
  • [34] MODEL OF A DC MOTOR IN RECURRENT NEURAL NETWORKS
    Orlovskiy, I. A.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2006, 1 : 151 - 159
  • [35] Recurrent Neural Networks for Word Alignment Model
    Tamura, Akihiro
    Watanabe, Taro
    Sumita, Eiichiro
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1470 - 1480
  • [36] Recurrent Neural Networks for Local Model Prediction
    Cherif, Aymen
    Bone, Romuald
    ADVANCES IN COGNITIVE NEURODYNAMICS (II), 2011, : 621 - 628
  • [37] DYNAMICS OF COMPARTMENTAL MODEL RECURRENT NEURAL NETWORKS
    BRESSLOFF, PC
    PHYSICAL REVIEW E, 1994, 50 (03): : 2308 - 2319
  • [38] Neural Dynamics Discovery via Gaussian Process Recurrent Neural Networks
    She, Qi
    Wu, Anqi
    35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 454 - 464
  • [39] Visualisation and 'Diagnostic Classifiers' Reveal how Recurrent and Recursive Neural Networks Process Hierarchical Structure
    Hupkes, Dieuwke
    Veldhoen, Sara
    Zuidema, Willem
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 61 : 907 - 926
  • [40] Recurrent Neural Networks with Multi-Branch Structure
    Yamashita, Takashi
    Mabu, Shingo
    Hirasawa, Kotaro
    Furuzuki, Takayuki
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2008, 91 (09) : 37 - 44