Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection

被引:2
|
作者
Elaziz, Eman Abd [1 ]
Fathalla, Radwa [1 ]
Shaheen, Mohamed [1 ]
机构
[1] Arab Acad Sci Technol & Maritime Transport, Coll Comp & Informat Technol, Alexandria, Egypt
关键词
Business process; Deep reinforcement learning; Weakly supervised learning; Variational autoencoder; Long short-term memory; Self-attention; Transformers; Imbalanced data; Process mining; Process discovery; Conformance checking; Balanced accuracy;
D O I
10.1186/s40537-023-00708-5
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The detection of anomalous behavior in business process data is a crucial task for preventing failures that may jeopardize the performance of any organization. Supervised learning techniques are impracticable because of the difficulties of gathering huge amounts of labeled business process anomaly data. For this reason, unsupervised learning techniques and semi-supervised learning approaches trained on entirely labeled normal data have dominated this domain for a long time. However, these methods do not work well because of the absence of prior knowledge of true anomalies. In this study, we propose a deep weakly supervised reinforcement learning-based approach to identify anomalies in business processes by leveraging limited labeled anomaly data. The proposed approach is intended to use a small collection of labeled anomalous data while exploring a huge set of unlabeled data to find new classes of anomalies that are outside the scope of the labeled anomalous data. We created a unique reward function that combined the supervisory signal supplied by a variational autoencoder trained on unlabeled data with the supervisory signal provided by the environment's reward. To further reduce data deficiency, we introduced a sampling method to allow the effective exploration of the unlabeled data and to address the imbalanced data problem, which is a common problem in the anomaly detection field. This approach depends on the proximity between the data samples in the latent space of the variational autoencoder. Furthermore, to efficiently model the sequential nature of business process data and to handle the long-term dependences, we used a long short-term memory network combined with a self-attention mechanism to develop the agent of our reinforcement learning model. Multiple scenarios were used to test the proposed approach on real-world and synthetic datasets. The findings revealed that the proposed approach outperformed five competing approaches by efficiently using the few available anomalous examples.
引用
收藏
页数:35
相关论文
共 50 条
  • [1] Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection
    Eman Abd Elaziz
    Radwa Fathalla
    Mohamed Shaheen
    [J]. Journal of Big Data, 10
  • [2] Spiking Reinforcement Learning for Weakly-Supervised Anomaly Detection
    Jin, Ao
    Wu, Zhichao
    Zhu, Li
    Xia, Qianchen
    Yang, Xin
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT V, 2024, 14451 : 175 - 187
  • [3] Data-Efficient Deep Reinforcement Learning with Symmetric Consistency
    Zhang, Xianchao
    Yang, Wentao
    Zhang, Xiaotong
    Liu, Han
    Wang, Guanglu
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2430 - 2436
  • [4] A Data-Efficient Training Method for Deep Reinforcement Learning
    Feng, Wenhui
    Han, Chongzhao
    Lian, Feng
    Liu, Xia
    [J]. ELECTRONICS, 2022, 11 (24)
  • [5] Toward Deep Supervised Anomaly Detection: Reinforcement Learning from Partially Labeled Anomaly Data
    Pang, Guansong
    van den Hengel, Anton
    Shen, Chunhua
    Cao, Longbing
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1298 - 1308
  • [6] Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
    Frauenknecht, Bernd
    Ehlgen, Tobias
    Trimpe, Sebastian
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 894 - 901
  • [7] A Data-Efficient Method of Deep Reinforcement Learning for Chinese Chess
    Xu, Changming
    Ding, Hengfeng
    Zhang, Xuejian
    Wang, Cong
    Yang, Hongji
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 687 - 693
  • [8] Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning
    Maulana, Muhammad Rizki
    Lee, Wee Sun
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 122 - 138
  • [9] DeMis: Data-Efficient Misinformation Detection Using Reinforcement Learning
    Kawintiranon, Kornraphop
    Singh, Lisa
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II, 2023, 13714 : 224 - 240
  • [10] Data-Efficient Hierarchical Reinforcement Learning
    Nachum, Ofir
    Gu, Shixiang
    Lee, Honglak
    Levine, Sergey
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31