On the application of sequential pattern mining primitives to process discovery: Overview, outlook and opportunity identification

被引:9
|
作者
Hassani, Marwan [1 ]
van Zelst, Sebastiaan J. [2 ,3 ]
van der Aalst, Wil M. P. [2 ,3 ]
机构
[1] Eindhoven Univ Technol, Dept Math & Comp Sci, Eindhoven, Netherlands
[2] FIT, Fraunhofer Inst Appl Informat Technol, St Augustin, Germany
[3] Rhein Westfal TH Aachen, Proc & Data Sci Grp, Aachen, Germany
关键词
data streams; distributed sequential pattern mining; process mining; sequential pattern mining; MODELS;
D O I
10.1002/widm.1315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential pattern mining (SPM) is a well-studied theme in data mining, in which one aims to discover common sequences of item sets in a large corpus of temporal itemset data. Due to the sequential nature of data streams, supporting SPM in streaming environments is commonly studied in the area of data stream mining as well. On the other hand, stream-based process discovery (PD), originating from the field of process mining, focusses on learning process models on the basis of online event data. In particular, the main goal of the models discovered is to describe the underlying generating process in an end-to-end fashion. As both SPM and PD use data that are comparable in nature, that is, both involve time-stamped instances, one expects that techniques from the SPM domain are (partly) transferable to the PD domain. However, thus far, little work has been done in the intersection of the two fields. In this focus article, we therefore study the possible application of SPM techniques in the context of PD. We provide an overview of the two fields, covering their commonalities and differences, highlight the challenges of applying them, and, present an outlook and several avenues for future work. This article is categorized under: Algorithmic Development > Spatial and Temporal Data Mining Fundamental Concepts of Data and Knowledge > Key Design Issues in Data Mining Fundamental Concepts of Data and Knowledge > Big Data Mining
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Knowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey
    Shivaprasad, G.
    Subbareddy, N. V.
    Acharya, U. Dinesh
    INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN SCIENCE AND TECHNOLOGY (ICM2ST-10), 2010, 1324 : 319 - 323
  • [42] CISP-Growth: A Contiguous Item Sequential Pattern mining algorithm with application level IO patterns
    Zhang Jing-Liang
    Zhang Jun-Wei
    Zhang Jian-Gang
    Han Xiao-Ming
    Xu Lu
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS, PROCEEDINGS, 2009, : 119 - 126
  • [43] B-mine: Frequent Pattern Mining and Its Application to Knowledge Discovery from Social Networks
    Jiang, Fan
    Leung, Carson K.
    Zhang, Hao
    WEB TECHNOLOGIES AND APPLICATIONS, PT I, 2016, 9931 : 316 - 328
  • [44] Application of data mining and process knowledge discovery in sheet metal assembly dimensional variation diagnosis
    Lian, J
    Lai, XM
    Lin, ZQ
    Yao, FS
    JOURNAL OF MATERIALS PROCESSING TECHNOLOGY, 2002, 129 (1-3) : 315 - 320
  • [46] Supervised sequential pattern mining of event sequences in sport to identify important patterns of play: An application to rugby union
    Bunker, Rory
    Fujii, Keisuke
    Hanada, Hiroyuki
    Takeuchi, Ichiro
    PLOS ONE, 2021, 16 (09):
  • [47] TapTree: Process-Tree Based Host Behavior Modeling and Threat Detection Framework via Sequential Pattern Mining
    Mamun, Mohammad
    Buffett, Scott
    INFORMATION AND COMMUNICATIONS SECURITY, ICICS 2022, 2022, 13407 : 546 - 565
  • [48] Sequential Pattern Mining to Predict Medical In-Hospital Mortality from Administrative Data: Application to Acute Coronary Syndrome
    Pinaire, Jessica
    Chabert, Etienne
    Aze, Jerome
    Bringay, Sandra
    Landais, Paul
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [49] A local and global statistics pattern analysis method and its application to process fault identification
    Zhang, Hanyuan
    Tian, Xuemin
    Deng, Xiaogang
    Cai, Lianfang
    CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2015, 23 (11) : 1782 - 1792
  • [50] Extending Process Discovery with Model Complexity Optimization and Cyclic States Identification: Application to Healthcare Processes
    Elkhovskaya, Liubov O.
    Kshenin, Alexander D.
    Balakhontceva, Marina A.
    Ionov, Mikhail V.
    Kovalchuk, Sergey V.
    ALGORITHMS, 2023, 16 (01)