Pre-Processing Event Logs by Chaotic Filtering Approaches Based on the Direct Following Relationship

被引:0
|
作者
Lv, Tengzi [1 ]
Gong, Xiugang [1 ]
Gong, Na [1 ]
Li, Kaiyu [1 ]
机构
[1] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo 255049, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 16期
关键词
process mining; chaotic activities; pre-processing; process model; direct following relationship; event log;
D O I
10.3390/app14166994
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Process discovery aims to discover process models from event logs to describe actual business processes. The quality of event logs has an impact on the quality of process models, so preprocessing methods can be used to improve the quality of event logs. Chaotic activities may exist in real business scenarios, and the occurrence of chaotic activities is independent of other activities in the process and can occur at any location in the event log at any frequency. Therefore, chaotic activities seriously affect the model quality of process discovery. Filtering chaotic activities in event logs can effectively improve the quality of event logs and thus improve the quality of process models. The traditional chaotic activity filtering algorithm makes it difficult to balance accuracy and time performance. Therefore, a direct method for filtering chaotic activities is proposed in this paper. By analyzing the relationship between activities, chaotic activities are identified in the log according to the characteristics of chaotic activities and the direct following relationship of activities as the judgment condition, and the filtering of chaotic activities in the event log is realized. In addition, this paper proposes an indirect chaotic activity filtering method, which identifies and filters chaotic activities in the log by analyzing the influence of the existence of different activities on the overall chaos degree of the log. The proposed method is compared with the traditional chaotic activity filtering method on several simulation/real data sets, and the accuracy and running time between the multi-group event logs and the process models generated before and after chaotic activity filtering are analyzed, further verifying the effectiveness and feasibility of the proposed method. By summarizing the experimental results, it is found that the accuracy of the proposed chaotic activity filtering methods is greater than that of the frequency-based filtering method and is close to that of the entropy-based chaotic activity filtering methods. Moreover, compared with other filtering methods used in the experiment, the chaotic activity filtering method proposed in this paper can improve the efficiency by 23.4% on average for simulation logs, and by 84.25% on average for real event logs. It is concluded that compared with other filtering methods, the proposed chaotic activity filtering methods have higher accuracy and can effectively improve the time performance of chaotic activity filtering. Therefore, the chaotic activity filtering method proposed in this paper can balance the accuracy and time performance, and can ensure the integrity of the filtered event log to a certain extent.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Pre-processing approaches for collaborative filtering based on hierarchical clustering
    de Aguiar Neto, Fernando S.
    da Costa, Arthur F.
    Manzato, Marcelo G.
    Campello, Ricardo J. G. B.
    [J]. INFORMATION SCIENCES, 2020, 534 : 172 - 191
  • [2] Multidimensional filtering approaches for pre-processing thermal images
    Maria del C. Valdes
    Minoru Inamura
    J. D. R. Valera
    Yao Lu
    [J]. Multidimensional Systems and Signal Processing, 2006, 17 : 299 - 325
  • [3] Event Logs Pre-processing for Configurable Process Discovery: Ontology-Based Approach
    Khannat, Aicha
    Sbai, Hanae
    Kjiri, Laila
    [J]. 2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 139 - 144
  • [4] Multidimensional filtering approaches for pre-processing thermal images
    del C. Valdes, Maria
    Inamura, Minoru
    Valera, J. D. R.
    Lu, Yao
    [J]. MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2006, 17 (04) : 299 - 325
  • [5] Speaker recognition based on pre-processing approaches
    Samia Abd El-Moneim
    El-Sayed M. EL-Rabaie
    M. A. Nassar
    Moawad I. Dessouky
    Nabil A. Ismail
    Adel S. El-Fishawy
    Fathi E. Abd El-Samie
    [J]. International Journal of Speech Technology, 2020, 23 : 435 - 442
  • [6] Speaker recognition based on pre-processing approaches
    Abd El-Moneim, Samia
    El-Rabaie, El-Sayed Mahmoud
    Nassar, M. A.
    Dessouky, Moawad, I
    Ismail, Nabil A.
    El-Fishawy, Adel S.
    Abd El-Samie, Fathi E.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (02) : 435 - 442
  • [7] Declarative Process Mining: Reducing Discovered Models Complexity by Pre-Processing Event Logs
    Piccoli Richetti, Pedro H.
    Baiao, Fernanda Araujo
    Santoro, Flavia Maria
    [J]. BUSINESS PROCESS MANAGEMENT, BPM 2014, 2014, 8659 : 400 - 407
  • [8] Adaptive Volterra Filtering Algorithm Based on Lattice Pre-processing
    Zhang Xiu-mei
    Zhao Zhi-jin
    [J]. PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4049 - 4052
  • [9] Pre-Processing of Query Logs in Web Usage Mining
    Abdullah, Norhaiza Ya
    Husin, Husna Sarirah
    Ramadhani, Herny
    Nadarajan, Shanmuga Vivekanada
    [J]. INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2012, 11 (01): : 82 - 86
  • [10] Video pre-processing with JND-based Gaussian filtering of superpixels
    Ding, Lei
    Li, Ge
    Wang, Ronggang
    Wang, Wenmin
    [J]. VISUAL INFORMATION PROCESSING AND COMMUNICATION VI, 2015, 9410