Enabling semantics-aware process mining through the automatic annotation of event logs

被引:6
|
作者
Rebmann, Adrian [1 ]
van der Aa, Han [1 ]
机构
[1] Univ Mannheim, Data & Web Sci Grp, Mannheim, Germany
关键词
Process mining; Natural language processing; Semantic analysis;
D O I
10.1016/j.is.2022.102111
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Process mining is concerned with the analysis of organizational processes based on event data recorded during their execution. Foundational process mining techniques analyze such data in an abstract manner, without taking the meaning of these events or their payload into consideration. By contrast, other techniques may exploit specific kinds of information contained in event data, such as resources in organizational mining and business objects in object-centric analysis, to gain more specific insights into an organization's operations. However, the information required for such analyses is typically not readily available. Rather, the meaning of events is often captured in an ad hoc manner, commonly through unstructured textual attributes, such as an event's label, or in unclearly named attributes. In this work, we address this gap by proposing an approach for the automatic annotation of semantic components in event logs. To achieve this, we combine the analysis of textual attribute values, based on a state-of-the-art language model, with novel attribute classification and component categorization techniques. In this manner, our approach first identifies up to eight semantic components per event, revealing information on the actions, business objects, and resources recorded in an event log. Afterwards, our approach further categorizes the identified actions and actors, allowing for a more in-depth analysis of key process perspectives. We demonstrate our approach's efficacy through an evaluation using a broad range of event logs and highlight its usefulness through four application scenarios enabled by our approach. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Dynamic Access Control to Semantics-Aware Streamed Process Logs
    Leida, Marcello
    Ceravolo, Paolo
    Damiani, Ernesto
    Asal, Rasool
    Colombo, Maurizio
    [J]. JOURNAL ON DATA SEMANTICS, 2019, 8 (03) : 203 - 218
  • [2] Generating event logs from non-process-aware systems enabling business process mining
    Perez-Castillo, Ricardo
    Weber, Barbara
    Pinggera, Jakob
    Zugal, Stefan
    Garcia-Rodriguez de Guzman, Ignacio
    Piattini, Mario
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2011, 5 (03) : 301 - 335
  • [3] Semantics-aware mechanisms for control-flow anonymization in process mining
    Fahrenkrog-Petersen, Stephan A.
    Kabierski, Martin
    van der Aa, Han
    Weidlich, Matthias
    [J]. INFORMATION SYSTEMS, 2023, 114
  • [4] SaCoFa: Semantics-aware Control-flow Anonymization for Process Mining
    Fahrenkog-Petersen, Stephan A.
    Kabierski, Martin
    Roesel, Fabian
    van der Aa, Han
    Weidlich, Matthias
    [J]. 2021 3RD INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2021), 2021, : 72 - 79
  • [5] Toward semantics-aware annotation and retrieval of spatial data
    Cristiano Fugazza
    [J]. Earth Science Informatics, 2011, 4 : 225 - 239
  • [6] Toward semantics-aware annotation and retrieval of spatial data
    Fugazza, Cristiano
    [J]. EARTH SCIENCE INFORMATICS, 2011, 4 (04) : 225 - 239
  • [7] A Process Framework for Semantics-Aware Tourism Information Systems
    Daramola, Olawande J.
    [J]. CURRENT TRENDS IN WEB ENGINEERING, 2010, 6385s : 521 - 532
  • [8] SARRE: Semantics-Aware Rule Recommendation and Enforcement for Event Paths on Android
    Li, Yongbo
    Yao, Fan
    Lan, Tian
    Venkataramani, Guru
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (12) : 2748 - 2762
  • [9] Protecting location privacy through semantics-aware obfuscation techniques
    Damiani, Maria Luisa
    Bertino, Elisa
    Silvestri, Claudio
    [J]. TRUST MANAGEMENT II, 2008, 263 : 231 - +
  • [10] Optimal process mining of timed event logs
    De Oliveira, Hugo
    Augusto, Vincent
    Jouaneton, Baptiste
    Lamarsalle, Ludovic
    Prodel, Martin
    Xie, Xiaolan
    [J]. INFORMATION SCIENCES, 2020, 528 : 58 - 78