On the application of sequential pattern mining primitives to process discovery: Overview, outlook and opportunity identification

被引:9
|
作者
Hassani, Marwan [1 ]
van Zelst, Sebastiaan J. [2 ,3 ]
van der Aalst, Wil M. P. [2 ,3 ]
机构
[1] Eindhoven Univ Technol, Dept Math & Comp Sci, Eindhoven, Netherlands
[2] FIT, Fraunhofer Inst Appl Informat Technol, St Augustin, Germany
[3] Rhein Westfal TH Aachen, Proc & Data Sci Grp, Aachen, Germany
关键词
data streams; distributed sequential pattern mining; process mining; sequential pattern mining; MODELS;
D O I
10.1002/widm.1315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential pattern mining (SPM) is a well-studied theme in data mining, in which one aims to discover common sequences of item sets in a large corpus of temporal itemset data. Due to the sequential nature of data streams, supporting SPM in streaming environments is commonly studied in the area of data stream mining as well. On the other hand, stream-based process discovery (PD), originating from the field of process mining, focusses on learning process models on the basis of online event data. In particular, the main goal of the models discovered is to describe the underlying generating process in an end-to-end fashion. As both SPM and PD use data that are comparable in nature, that is, both involve time-stamped instances, one expects that techniques from the SPM domain are (partly) transferable to the PD domain. However, thus far, little work has been done in the intersection of the two fields. In this focus article, we therefore study the possible application of SPM techniques in the context of PD. We provide an overview of the two fields, covering their commonalities and differences, highlight the challenges of applying them, and, present an outlook and several avenues for future work. This article is categorized under: Algorithmic Development > Spatial and Temporal Data Mining Fundamental Concepts of Data and Knowledge > Key Design Issues in Data Mining Fundamental Concepts of Data and Knowledge > Big Data Mining
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Sequential Pattern Mining Method for Analysis of Programming Learning History Based on the Learning Process
    Nakamura, Shoichi
    Nozaki, Kaname
    Morimoto, Yasuhiko
    Miyadera, Youzou
    2014 INTERNATIONAL CONFERENCE ON EDUCATION TECHNOLOGIES AND COMPUTERS (ICETC), 2014, : 55 - 60
  • [32] Weighted Sequential Pattern Mining Algorithm Research based on Well Completion Business Process
    Du, Ruishan
    Shang, Fuhua
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 5226 - 5231
  • [33] Discovery of deep order-preserving submatrix in DNA microarray data based on sequential pattern mining
    Liu, Zhiwen
    Xue, Yun
    Li, Meihang
    Ma, Bo
    Zhang, Meizhen
    Chen, Xin
    Hu, Xiaohui
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 17 (03) : 217 - 237
  • [34] Pattern Discovery and Rule Mining of Drivers' Perception and Operation During Lane Changing Process
    Long Y.
    Huang J.-L.
    Zhao X.-H.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2021, 21 (03): : 237 - 246
  • [35] Sequential Pattern Mining and it's Application in the Analysis of Library Readers Lending Historical Data
    Chen Geng
    Zhu Yuquan
    Han Zhigeng
    Chen Shenglei
    ELECTRONIC INFORMATION AND ELECTRICAL ENGINEERING, 2012, 19 : 691 - 695
  • [36] An overview of emerging pattern mining in supervised descriptive rule discovery: taxonomy, empirical study, trends, and prospects
    Garcia-Vico, A. M.
    Carmona, C. J.
    Martin, D.
    Garcia-Borroto, M.
    del Jesus, M. J.
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (01)
  • [37] An application of data mining and knowledge discovery process in the field of natural gas exploration
    Acar, Mehmet Akif
    Tolun, Mehmet R.
    Elbasi, Ersin
    2014 IEEE 8th International Conference on Application of Information and Communication Technologies (AICT), 2014, : 252 - 257
  • [38] Application of FP_Growth Algorithm of Sequential Pattern Mining on Container Maintenance Components Association
    Lingxizhu
    Yufeiguo
    Jingyiwang
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 1026 - 1031
  • [39] Learning Process Analysis Based on Sequential Pattern Mining and Lag Sequential Analysis in a Web-based Inquiry Science Environment
    Lin, Fang-Chun
    Chen, Chih-Ming
    Wang, Wen-Fang
    2017 6TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI), 2017, : 655 - 660
  • [40] Application of microRNA Database Mining in Biomarker Discovery and Identification of Therapeutic Targets for Complex Disease
    Major, Jennifer L.
    Bagchi, Rushita A.
    Pires da Silva, Julie
    METHODS AND PROTOCOLS, 2021, 4 (01) : 1 - 11