Utilizing domain knowledge in data-driven process discovery: A literature review

被引:29
|
作者
Schuster, Daniel [1 ,2 ]
van Zelst, Sebastiaan J. [1 ,2 ]
van der Aalsta, Wil M. P. [1 ,2 ]
机构
[1] Fraunhofer FIT, Proc Min Res Grp, D-53757 St Augustin, North Rhine Wes, Germany
[2] Rhein Westfal TH Aachen, Chair Proc & Data Sci, Ahornstr 55, D-52074 Aachen, North Rhine Wes, Germany
关键词
Process mining; Process discovery; Process models; Human-in-the-loop; Hybrid intelligence; PROCESS MODELS; NETS;
D O I
10.1016/j.compind.2022.103612
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Process mining aims to improve operational processes in a data-driven manner. To this end, process mining offers methods and techniques for systematically analyzing event data. These data are generated during the execution of processes and stored in organizations' information systems. Process discovery, a key discipline in process mining, comprises techniques used to (automatically) learn a process model from event data. However, existing algorithms typically provide low-quality models from real-life event data due to data-quality issues and incompletely captured process behavior. Automated filtering of event data is valuable in obtaining better process models. At the same time, it is often too rigorous, i.e., it also removes valuable and correct data. In many cases, prior knowledge about the process under investigation can be additionally used for process discovery besides event data. Therefore, a new family of discovery algorithms has been developed that utilizes domain knowledge about the process in addition to event data. To organize this research, we present a literature review of process discovery approaches exploiting domain knowledge. We define a taxonomy that systematically classifies and compares existing approaches. Finally, we identify remaining challenges for future work. (C) 2022 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Integrating Domain Knowledge in Data-Driven Earth Observation with Process Convolutions
    Svendsen, Daniel Heestermans
    Piles, Maria
    Munoz-Mari, Jordi
    Luengo, David
    Martino, Luca
    Camps-Valls, Gustau
    IEEE Transactions on Geoscience and Remote Sensing, 2022, 60
  • [2] Integrating Domain Knowledge in Data-Driven Earth Observation With Process Convolutions
    Heestermans Svendsen, Daniel
    Piles, Maria
    Munoz-Mari, Jordi
    Luengo, David
    Martino, Luca
    Camps-Valls, Gustau
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] Data-Driven Domain Discovery for Structured Datasets
    Ota, Masayo
    Mueller, Heiko
    Freire, Juliana
    Srivastava, Divesh
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (07): : 953 - 965
  • [4] Paleontology Knowledge Graph for Data-Driven Discovery
    Yiying Deng
    Sicun Song
    Junxuan Fan
    Mao Luo
    Le Yao
    Shaochun Dong
    Yukun Shi
    Linna Zhang
    Yue Wang
    Haipeng Xu
    Huiqing Xu
    Yingying Zhao
    Zhaohui Pan
    Zhangshuai Hou
    Xiaoming Li
    Boheng Shen
    Xinran Chen
    Shuhan Zhang
    Xuejin Wu
    Lida Xing
    Qingqing Liang
    Enze Wang
    Journal of Earth Science, 2024, 35 (03) : 1024 - 1034
  • [5] Paleontology Knowledge Graph for Data-Driven Discovery
    Deng, Yiying
    Song, Sicun
    Fan, Junxuan
    Luo, Mao
    Yao, Le
    Dong, Shaochun
    Shi, Yukun
    Zhang, Linna
    Wang, Yue
    Xu, Haipeng
    Xu, Huiqing
    Zhao, Yingying
    Pan, Zhaohui
    Hou, Zhangshuai
    Li, Xiaoming
    Shen, Boheng
    Chen, Xinran
    Zhang, Shuhan
    Wu, Xuejin
    Xing, Lida
    Liang, Qingqing
    Wang, Enze
    JOURNAL OF EARTH SCIENCE, 2024, 35 (03) : 1024 - 1034
  • [6] Data-driven Discovery: A New Era of Exploiting the Literature and Data
    Ying Ding
    Kyle Stirling
    Journal of Data and Information Science, 2016, (04) : 1 - 9
  • [7] Data-driven Discovery: A New Era of Exploiting the Literature and Data
    Ying Ding
    Kyle Stirling
    JournalofDataandInformationScience, 2016, 1 (04) : 1 - 9
  • [8] A Review of Data-Driven Discovery for Dynamic Systems
    North, Joshua S.
    Wikle, Christopher K.
    Schliep, Erin M.
    INTERNATIONAL STATISTICAL REVIEW, 2023, 91 (03) : 464 - 492
  • [9] Integrative Systems Biology for Data-Driven Knowledge Discovery
    Greene, Casey S.
    Troyanskaya, Olga G.
    SEMINARS IN NEPHROLOGY, 2010, 30 (05) : 443 - 454
  • [10] Data-Driven Design in the Design Process: A Systematic Literature Review on Challenges and Opportunities
    Quinones-Gomez, Juan Carlos
    Mor, Enric
    Chacon, Jonathan
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2025, 41 (04) : 2227 - 2252