Validation set sampling strategies for predictive process monitoring

被引:2
|
作者
Peeperkorn, Jari [1 ]
vanden Broucke, Seppe [1 ,2 ]
De Weerdt, Jochen [1 ]
机构
[1] Katholieke Univ Leuven, Res Ctr Informat Syst Engn LIRIS, Leuven, Belgium
[2] Univ Ghent, Dept Business Informat & Operat Management, Ghent, Belgium
基金
欧盟地平线“2020”;
关键词
Process mining; Predictive process monitoring; LSTM; Generalization; Validation set; Log completeness;
D O I
10.1016/j.is.2023.102330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous studies investigating the efficacy of long short-term memory (LSTM) recurrent neural networks in predictive process monitoring and their ability to capture the underlying process structure have raised concerns about their limited ability to generalize to unseen behavior. Event logs often fail to capture the full spectrum of behavior permitted by the underlying processes. To overcome these challenges, this study introduces innovative validation set sampling strategies based on control-flow variant-based resampling. These strategies have undergone extensive evaluation to assess their impact on hyperparameter selection and early stopping, resulting in notable enhancements to the generalization capabilities of trained LSTM models. In addition, this study expands the experimental framework to enable accurate interpretation of underlying process models and provide valuable insights. By conducting experiments with event logs representing process models of varying complexities, this research elucidates the effectiveness of the proposed validation strategies. Furthermore, the extended framework facilitates investigations into the influence of event log completeness on the learning quality of predictive process models. The novel validation set sampling strategies proposed in this study facilitate the development of more effective and reliable predictive process models, ultimately bolstering generalization capabilities and improving the understanding of underlying process dynamics.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Metrology Sampling Strategies for Process Monitoring Applications
    Vincent, Tyrone L.
    Stirton, James Broc
    Poolla, Kameshwar
    IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2011, 24 (04) : 489 - 498
  • [2] A review of some sampling and aggregation strategies for basic statistical process monitoring
    Zwetsloot, Inez M.
    Woodall, William H.
    JOURNAL OF QUALITY TECHNOLOGY, 2021, 53 (01) : 1 - 16
  • [3] Event Log Sampling for Predictive Monitoring
    Sani, Mohammadreza Fani
    Vazifehdoostirani, Mozhgan
    Park, Gyunam
    Pegoraro, Marco
    van Zelst, Sebastiaan J.
    van der Aalst, Wil M. P.
    PROCESS MINING WORKSHOPS, ICPM 2021, 2022, 433 : 154 - 166
  • [4] OPTIMAL SAMPLING STRATEGIES FOR VALIDATION STUDIES
    OSBURN, HG
    GREENER, JM
    JOURNAL OF APPLIED PSYCHOLOGY, 1978, 63 (05) : 602 - 608
  • [5] A Control Chart Using Quartile Pair Ranked Set Sampling for Monitoring the Process Mean
    Muhammad Tayyab
    Muhammad Noor-ul-Amin
    Muhammad Hanif
    Journal of Statistical Theory and Practice, 2020, 14
  • [6] A Control Chart Using Quartile Pair Ranked Set Sampling for Monitoring the Process Mean
    Tayyab, Muhammad
    Noor-ul-Amin, Muhammad
    Hanif, Muhammad
    JOURNAL OF STATISTICAL THEORY AND PRACTICE, 2020, 14 (01)
  • [7] Process validation and monitoring
    不详
    AGRO FOOD INDUSTRY HI-TECH, 1998, 9 (03): : 48 - 48
  • [8] EFFECTIVENESS OF SAMPLING STRATEGIES FOR INTERTIDAL MONITORING
    HAWKINS, SJ
    HARTNOLL, RG
    WILLIAMS, GA
    AZZOPARDI, PJ
    BURROWS, MT
    ELLARD, FM
    WATER SCIENCE AND TECHNOLOGY, 1986, 18 (4-5) : 63 - 72
  • [9] Novel predictive estimators using ranked set sampling
    Bhushan, Shashi
    Kumar, Anoop
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (03):
  • [10] PREDICTIVE ESTIMATION OF POPULATION MEAN IN RANKED SET SAMPLING
    Ahmed, Shakeel
    Shabbir, Javid
    Gupta, Sat
    REVSTAT-STATISTICAL JOURNAL, 2019, 17 (04) : 551 - 562