On the application of sequential pattern mining primitives to process discovery: Overview, outlook and opportunity identification

被引:9
|
作者
Hassani, Marwan [1 ]
van Zelst, Sebastiaan J. [2 ,3 ]
van der Aalst, Wil M. P. [2 ,3 ]
机构
[1] Eindhoven Univ Technol, Dept Math & Comp Sci, Eindhoven, Netherlands
[2] FIT, Fraunhofer Inst Appl Informat Technol, St Augustin, Germany
[3] Rhein Westfal TH Aachen, Proc & Data Sci Grp, Aachen, Germany
关键词
data streams; distributed sequential pattern mining; process mining; sequential pattern mining; MODELS;
D O I
10.1002/widm.1315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential pattern mining (SPM) is a well-studied theme in data mining, in which one aims to discover common sequences of item sets in a large corpus of temporal itemset data. Due to the sequential nature of data streams, supporting SPM in streaming environments is commonly studied in the area of data stream mining as well. On the other hand, stream-based process discovery (PD), originating from the field of process mining, focusses on learning process models on the basis of online event data. In particular, the main goal of the models discovered is to describe the underlying generating process in an end-to-end fashion. As both SPM and PD use data that are comparable in nature, that is, both involve time-stamped instances, one expects that techniques from the SPM domain are (partly) transferable to the PD domain. However, thus far, little work has been done in the intersection of the two fields. In this focus article, we therefore study the possible application of SPM techniques in the context of PD. We provide an overview of the two fields, covering their commonalities and differences, highlight the challenges of applying them, and, present an outlook and several avenues for future work. This article is categorized under: Algorithmic Development > Spatial and Temporal Data Mining Fundamental Concepts of Data and Knowledge > Key Design Issues in Data Mining Fundamental Concepts of Data and Knowledge > Big Data Mining
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Identification of hot regions in protein-protein interactions by sequential pattern mining
    Hsu, Chen-Ming
    Chen, Chien-Yu
    Liu, Baw-Jhiune
    Huang, Chih-Chang
    Laio, Min-Hung
    Lin, Chien-Chieh
    Wu, Tzung-Lin
    BMC BIOINFORMATICS, 2007, 8 (Suppl 5)
  • [22] Exploring the collective process of classroom dialogue using sequential pattern mining technique
    Song, Yu
    Cheng, Bo
    Zhu, Jia
    Hu, Xiaoyong
    INTERNATIONAL JOURNAL OF EDUCATIONAL RESEARCH, 2022, 115
  • [23] Enhancing medical evidence discovery through Interactive Pattern Recognition and Process Mining
    Traver, V.
    Martinez-Romero, A.
    Bayo, J. L.
    Sala, P.
    Carvalho, P.
    Henriques, J.
    Ruano, M. G.
    Bianchi, A.
    Fernandez-Llatas, C.
    2016 GLOBAL MEDICAL ENGINEERING PHYSICS EXCHANGES/PAN AMERICAN HEALTH CARE EXCHANGES (GMEPE/PAHCE), 2016,
  • [24] Application of sequential pattern mining algorithm in alarm information processing of power system
    Fan, X.-H. (xihuifan2002@163.com), 2005, Automation of Electric Power Systems Press (29):
  • [25] Online sequential pattern mining and association discovery by advanced artificial intelligence and machine learning techniques
    Shian-Chang Huang
    Chei-Chang Chiou
    Jui-Te Chiang
    Cheng-Feng Wu
    Soft Computing, 2020, 24 : 8021 - 8039
  • [26] Online sequential pattern mining and association discovery by advanced artificial intelligence and machine learning techniques
    Huang, Shian-Chang
    Chiou, Chei-Chang
    Chiang, Jui-Te
    Wu, Cheng-Feng
    SOFT COMPUTING, 2020, 24 (11) : 8021 - 8039
  • [27] Application of syntactic methods of pattern recognition for data mining and knowledge discovery in medicine
    Ogiela, MR
    Tadeusiewicz, R
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY II, 2000, 4057 : 308 - 318
  • [28] Analyzing Sequence Pattern Variants in Sequential Pattern Mining and Its Application to Electronic Medical Record Systems
    Le, Hieu Hanh
    Yamada, Tatsuhiro
    Honda, Yuichi
    Kayahara, Masaaki
    Kushima, Muneo
    Araki, Kenji
    Yokota, Haruo
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, 2019, 11707 : 393 - 408
  • [29] An efficient method of web sequential pattern mining based on session filter and transaction identification
    Zhu, Jingjun
    Wu, Haiyan
    Gao, Guozhu
    Journal of Networks, 2010, 5 (09) : 1017 - 1024
  • [30] Dynamic customer preference analysis for product portfolio identification using sequential pattern mining
    Yu, Li
    Zhang, Zaifang
    Shen, Jin
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2017, 117 (02) : 365 - 381