Validation set sampling strategies for predictive process monitoring

被引:2
|
作者
Peeperkorn, Jari [1 ]
vanden Broucke, Seppe [1 ,2 ]
De Weerdt, Jochen [1 ]
机构
[1] Katholieke Univ Leuven, Res Ctr Informat Syst Engn LIRIS, Leuven, Belgium
[2] Univ Ghent, Dept Business Informat & Operat Management, Ghent, Belgium
基金
欧盟地平线“2020”;
关键词
Process mining; Predictive process monitoring; LSTM; Generalization; Validation set; Log completeness;
D O I
10.1016/j.is.2023.102330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous studies investigating the efficacy of long short-term memory (LSTM) recurrent neural networks in predictive process monitoring and their ability to capture the underlying process structure have raised concerns about their limited ability to generalize to unseen behavior. Event logs often fail to capture the full spectrum of behavior permitted by the underlying processes. To overcome these challenges, this study introduces innovative validation set sampling strategies based on control-flow variant-based resampling. These strategies have undergone extensive evaluation to assess their impact on hyperparameter selection and early stopping, resulting in notable enhancements to the generalization capabilities of trained LSTM models. In addition, this study expands the experimental framework to enable accurate interpretation of underlying process models and provide valuable insights. By conducting experiments with event logs representing process models of varying complexities, this research elucidates the effectiveness of the proposed validation strategies. Furthermore, the extended framework facilitates investigations into the influence of event log completeness on the learning quality of predictive process models. The novel validation set sampling strategies proposed in this study facilitate the development of more effective and reliable predictive process models, ultimately bolstering generalization capabilities and improving the understanding of underlying process dynamics.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Towards Reliable Predictive Process Monitoring
    Klinkmueller, Christopher
    van Beest, Nick R. T. P.
    Weber, Ingo
    INFORMATION SYSTEMS IN THE BIG DATA ERA, 2018, 317 : 163 - 181
  • [32] Temporal stability in predictive process monitoring
    Irene Teinemaa
    Marlon Dumas
    Anna Leontjeva
    Fabrizio Maria Maggi
    Data Mining and Knowledge Discovery, 2018, 32 : 1306 - 1338
  • [33] A new approach towards the validation of weed sampling strategies
    Backes, M
    Dörschlag, D
    Plümer, L
    ZEITSCHRIFT FUR PFLANZENKRANKHEITEN UND PFLANZENSCHUTZ-JOURNAL OF PLANT DISEASES AND PROTECTION, 2004, : 439 - 443
  • [34] VALIDATION OF LIMITED SAMPLING STRATEGIES FOR ESTIMATION OF TACROLIMUS AUC
    Antczak, Carina
    Sid-Otmane, Lamia
    Kassir, Nastia
    Litalien, Catherine
    Raboisson, Marie Josee
    PEDIATRIC TRANSPLANTATION, 2013, 17 : 70 - 70
  • [35] Comparison of spatial sampling strategies for ground sampling and validation of MODIS LAI products
    Ding, Yanling
    Ge, Yong
    Hu, Maogui
    Wang, Jinfeng
    Wang, Jianghao
    Zheng, Xingming
    Zhao, Kai
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2014, 35 (20) : 7230 - 7244
  • [36] A study on sampling strategies in the figure cognitive process
    Cao, LR
    Su, H
    Liang, FH
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2004, 39 (5-6) : 184 - 185
  • [37] A study on sampling strategies in the figure cognitive process
    曹立人
    苏昊
    曹珍副
    Journal of Zhejiang University Science, 2004, (09) : 154 - 158
  • [38] Study on sampling strategies in the figure cognitive process
    Cao L.-R.
    Su H.
    Cao Z.-F.
    Journal of Zhejiang University-SCIENCE A, 2004, 5 (9): : 1160 - 1164
  • [39] Discussion of article by Zwetsloot and Woodall: A review of some sampling and aggregation strategies for basic statistical process monitoring
    Kenett, Ron S.
    JOURNAL OF QUALITY TECHNOLOGY, 2021, 53 (01) : 29 - 32
  • [40] A new cumulative sum control chart for monitoring the process mean using varied L ranked set sampling
    Awais, Muhammad
    Haq, Abdul
    JOURNAL OF INDUSTRIAL AND PRODUCTION ENGINEERING, 2018, 35 (02) : 74 - 90