Accelerated Frequent Closed Sequential Pattern Mining for uncertain data

被引:7
|
作者
You, Tao [1 ]
Sun, Yue [1 ]
Zhang, Ying [1 ]
Chen, Jinchao [1 ]
Zhang, Peng [1 ]
Yang, Mei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 610072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Uncertain database; Frequent closed sequences; Possible world semantics; SEQUENCES; ALGORITHM; ITEMSETS;
D O I
10.1016/j.eswa.2022.117254
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data uncertainty has been taken into a consideration for mining and discovery of its hidden knowledge in a variety of applications. Due to the fact that the nature of closed sequences is closely related to possible world, more recent studies on uncertain closed sequential data mining has usually been challenged by the explosive possible worlds, which is exponential to the number of uncertain sequences in the database. Although basic Probabilistic Frequent Closed Sequences Mining (PFCSM-FF) strategy can solve this problem preliminarily, the inclusion-exclusion rules and closure checking methods used in PFCSM-FF makes mining algorithm very inefficient. And on this basis, another two improvements, PFCSM-CF and PFCSM-CC algorithms, are designed to reduce the search space and simplify the candidate sequence database, which significantly compress the computational scale. Substantial experiments on the real and synthetic datasets have demonstrated the efficiency improvement on the proposed PFCSM-CC and PFCSM-CF methods. Besides, the high usability of the proposed PFCSM-CC algorithm is further demonstrated according to the similarity of the time spent on existing probabilistic frequent sequence mining algorithm.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Finding frequent trajectories by clustering and sequential pattern mining
    Shaw, Arthur A.
    Gopalan, N. P.
    JOURNAL OF TRAFFIC AND TRANSPORTATION ENGINEERING-ENGLISH EDITION, 2014, 1 (06) : 393 - 403
  • [42] Similarity analysis of frequent sequential activity pattern mining
    Shou, Zhenyu
    Di, Xuan
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 96 : 122 - 143
  • [43] Mining frequent closed patterns in microarray data
    Cong, G
    Tan, KL
    Tung, AKH
    Pan, F
    FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 363 - 366
  • [44] Mining frequent closed itemsets for large data
    Fu, HG
    Nguifo, EM
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 328 - 335
  • [45] An Efficient Parallel Method for Mining Frequent Closed Sequential Patterns
    Bao Huynh
    Bay Vo
    Snasel, Vaclav
    IEEE ACCESS, 2017, 5 : 17392 - 17402
  • [46] Probabilistic Frequent Pattern Growth for Itemset Mining in Uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Renz, Matthias
    Verhein, Florian
    Zuefle, Andreas
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012, 2012, 7338 : 38 - 55
  • [47] Mining of Frequent Itemsets from Streams of Uncertain Data
    Leung, Carson Kai-Sang
    Hao, Boyu
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1663 - 1670
  • [48] Mining frequent patterns from univariate uncertain data
    Liu, Ying-Ho
    DATA & KNOWLEDGE ENGINEERING, 2012, 71 (01) : 47 - 68
  • [49] Closed sequential pattern mining in high dimensional sequences
    Han, Meng
    Wang, Zhihai
    Yuan, Jidong
    Journal of Software, 2013, 8 (06) : 1368 - 1373
  • [50] A closed sequential pattern mining algorithm in time order
    Fu, Yu
    Yu, Yan-Hua
    Song, Mei-Na
    Zhan, Xiao-Su
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2013, 36 (04): : 19 - 22