Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection

被引:2
|
作者
Elaziz, Eman Abd [1 ]
Fathalla, Radwa [1 ]
Shaheen, Mohamed [1 ]
机构
[1] Arab Acad Sci Technol & Maritime Transport, Coll Comp & Informat Technol, Alexandria, Egypt
关键词
Business process; Deep reinforcement learning; Weakly supervised learning; Variational autoencoder; Long short-term memory; Self-attention; Transformers; Imbalanced data; Process mining; Process discovery; Conformance checking; Balanced accuracy;
D O I
10.1186/s40537-023-00708-5
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The detection of anomalous behavior in business process data is a crucial task for preventing failures that may jeopardize the performance of any organization. Supervised learning techniques are impracticable because of the difficulties of gathering huge amounts of labeled business process anomaly data. For this reason, unsupervised learning techniques and semi-supervised learning approaches trained on entirely labeled normal data have dominated this domain for a long time. However, these methods do not work well because of the absence of prior knowledge of true anomalies. In this study, we propose a deep weakly supervised reinforcement learning-based approach to identify anomalies in business processes by leveraging limited labeled anomaly data. The proposed approach is intended to use a small collection of labeled anomalous data while exploring a huge set of unlabeled data to find new classes of anomalies that are outside the scope of the labeled anomalous data. We created a unique reward function that combined the supervisory signal supplied by a variational autoencoder trained on unlabeled data with the supervisory signal provided by the environment's reward. To further reduce data deficiency, we introduced a sampling method to allow the effective exploration of the unlabeled data and to address the imbalanced data problem, which is a common problem in the anomaly detection field. This approach depends on the proximity between the data samples in the latent space of the variational autoencoder. Furthermore, to efficiently model the sequential nature of business process data and to handle the long-term dependences, we used a long short-term memory network combined with a self-attention mechanism to develop the agent of our reinforcement learning model. Multiple scenarios were used to test the proposed approach on real-world and synthetic datasets. The findings revealed that the proposed approach outperformed five competing approaches by efficiently using the few available anomalous examples.
引用
收藏
页数:35
相关论文
共 50 条
  • [41] A data-efficient self-supervised deep learning model for design and characterization of nanophotonic structures
    Wei Ma
    Yongmin Liu
    [J]. Science China Physics, Mechanics & Astronomy, 2020, 63
  • [42] A data-efficient self-supervised deep learning model for design and characterization of nanophotonic structures
    Wei Ma
    Yongmin Liu
    [J]. Science China(Physics,Mechanics & Astronomy), 2020, (08) : 27 - 34
  • [43] BINet: Multivariate Business Process Anomaly Detection Using Deep Learning
    Nolle, Timo
    Seeliger, Alexander
    Muhlhauser, Max
    [J]. BUSINESS PROCESS MANAGEMENT (BPM 2018), 2018, 11080 : 271 - 287
  • [44] Data-Efficient Automatic Model Selection in Unsupervised Anomaly Detection
    Gudur, Gautham Krishna
    Raaghul, R.
    Adithya, K.
    Vasudevan, Shrihari
    [J]. 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1443 - 1448
  • [45] Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
    Yue, Yang
    Kang, Bingyi
    Xu, Zhongwen
    Huang, Gao
    Yan, Shuicheng
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11069 - 11077
  • [46] WAKE: A Weakly Supervised Business Process Anomaly Detection Framework via a Pre-Trained Autoencoder
    Guan, Wei
    Cao, Jian
    Zhao, Haiyan
    Gu, Yang
    Qian, Shiyou
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (06) : 2745 - 2758
  • [47] Masked and Inverse Dynamics Modeling for Data-Efficient Reinforcement Learning
    Lee, Young Jae
    Kim, Jaehoon
    Park, Young Joon
    Kwak, Mingu
    Kim, Seoung Bum
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [48] Data-Efficient Deep Reinforcement Learning-Based Optimal Generation Control in DC Microgrids
    Fan, Zhen
    Zhang, Wei
    Liu, Wenxin
    [J]. IEEE SYSTEMS JOURNAL, 2024, 18 (01): : 426 - 437
  • [49] A data-efficient goal-directed deep reinforcement learning method for robot visuomotor skill
    Jiang, Rong
    Wang, Zhipeng
    He, Bin
    Zhou, Yanmin
    Li, Gang
    Zhu, Zhongpan
    [J]. NEUROCOMPUTING, 2021, 462 (462) : 389 - 401
  • [50] Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control
    Kamthe, Sanket
    Deisenroth, Marc Peter
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84