How Can Interactive Process Discovery Address Data Quality Issues in Real Business Settings? Evidence from a Case Study in Healthcare

被引:13
|
作者
Benevento, Elisabetta [1 ]
Aloini, Davide [1 ]
van der Aalst, Wil M. P. [2 ,3 ]
机构
[1] Univ Pisa, Dept Energy Syst Terr & Construction Engn, Largo Lucio Lazzarino 1, I-56122 Pisa, Italy
[2] Rhein Westfal Tech Hsch RWTH, Ahornstr 55, D-52074 Aachen, Germany
[3] Fraunhofer Inst Appl Informat Technol FIT, D-53757 St Augustin, Germany
关键词
Interactive Process Discovery; Process Mining; Data Quality; Business Process Modelling; Healthcare; PROCESS MODELS; DIGITAL TRANSFORMATION; CURRENT STATE; BIG DATA; MANAGEMENT; KNOWLEDGE; FRAMEWORK; SYSTEMS; IMPACT; MINER;
D O I
10.1016/j.jbi.2022.104083
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The focus of this paper is on how data quality can affect business process discovery in real complex environments, which is a major factor determining the success in any data-driven Business Process Management project. Many real-life event logs, especially healthcare ones, can suffer from several data quality issues, some of which cannot be solved by pre-processing or data cleaning techniques, leading to inaccurate results. We take an innovative Process Mining (PM) approach, termed Interactive Process Discovery (IPD), which combines domain knowledge with available data. This approach can overcome the limitations of noisy and incomplete event logs by putting "humans in the loop", leading to improved business process modelling. This is particularly valuable in healthcare, where physicians have a tacit domain knowledge not available in the event log, and, thus, difficult to elicit. We conducted a two-step approach based on a controlled experiment and a case study in an Italian hospital. At each step, we compared IPD with traditional PM techniques to assess the extent to which domain knowledge helps to improve the accuracy of process models. The case study tests the effectiveness of IPD to uncover knowledge-intensive processes extracted from noisy real-life event logs. The evaluation has been carried out by exploiting a real dataset of an Italian hospital, involving the medical staff. IPD can produce an accurate process model that is fully compliant with the clinical guidelines by addressing data quality issues. Accurate and reliable process models can support healthcare organizations in detecting process-related issues and in taking decisions related to capacity planning and process re-design.
引用
收藏
页数:11
相关论文
共 40 条
  • [21] Strengthening data quality and reporting from small-scale surveys in humanitarian settings: a case study from Yemen, 2011-2019
    Ogbu, Thomas Jideofor
    Guha-Sapir, Debarati
    CONFLICT AND HEALTH, 2021, 15 (01)
  • [22] How Can It Be More Real? A Case Study to Present the Authenticity of a Local Heritage District from the Perspective of Regional Spatial Morphology
    Zhao, Huanxi
    SUSTAINABILITY, 2018, 10 (06):
  • [23] How Do We Sleep - a Case Study of Sleep Duration and Quality Using Data from Oura Ring
    Koskimaki, Hell
    Kinnunen, Hannu
    Kurppa, Teemu
    Roning, Juha
    PROCEEDINGS OF THE 2018 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC'18 ADJUNCT), 2018, : 714 - 717
  • [24] Can Didactic Continuing Education Improve Clinical Decision Making and Reduce Cost of Quality? Evidence From a Case Study
    Vukovic, Mira
    Gvozdenovic, Branislav S.
    Rankovic, Milena
    McCormick, Bryan P.
    Vukovic, Danica D.
    Gvozdenovic, Biljana D.
    Kastratovic, Dragana A.
    Markovic, Srdjan Z.
    Ilic, Miodrag
    Jakovljevic, Mihajlo B.
    JOURNAL OF CONTINUING EDUCATION IN THE HEALTH PROFESSIONS, 2015, 35 (02) : 109 - 118
  • [25] What Can We Expect from Data Assimilation for Air Quality Forecast? Part II: Analysis with a Semi-Real Case
    Bessagnet, Bertrand
    Menut, Laurent
    Couvidat, Florian
    Meleux, Frederik
    Siour, Guillaume
    Mailler, Sylvain
    JOURNAL OF ATMOSPHERIC AND OCEANIC TECHNOLOGY, 2019, 36 (07) : 1433 - 1448
  • [26] Biological data extraction from imagery - How far can we go? A case study from the Mid-Atlantic Ridge
    Cuvelier, Daphne
    de Busserolles, Fanny
    Lavaud, Romain
    Floc'h, Estelle
    Fabri, Marie-Claire
    Sarradin, Pierre-Marie
    Sarrazin, Jozee
    MARINE ENVIRONMENTAL RESEARCH, 2012, 82 : 15 - 27
  • [27] How multi-sourcing can influence management control: Case study evidence from the electronic products supply chain
    O'Connor, Neale G.
    Schloetzer, Jason D.
    Romero, Jorge
    Wu, Anne
    BRITISH ACCOUNTING REVIEW, 2022, 54 (05):
  • [28] How far can reformulation participate in improving the nutritional quality of diets at population level? A modelling study using real food market data in France
    Sarda, Barthelemy
    Kesse-Guyot, Emmanuelle
    Srour, Bernard
    Deschasaux-Tanguy, Melanie
    Fialon, Morgane
    Fezeu, Leopold K.
    Galan, Pilar
    Hercberg, Serge
    Touvier, Mathilde
    Julia, Chantal
    BMJ GLOBAL HEALTH, 2024, 9 (03):
  • [29] The Adequacy of Information Systems for Supporting the Asset Quality Review Process in Banks. Evidence from an Italian Case Study
    Bruno, Elena
    Iacoviello, Giuseppina
    Lazzini, Arianna
    STRENGTHENING INFORMATION AND CONTROL SYSTEMS: THE SYNERGY BETWEEN INFORMATION TECHNOLOGY AND ACCOUNTING MODELS, 2016, 14 : 59 - 75