An Expanded Prefix Tree-based Mining Algorithm for Sequential Pattern Maintenance with Deletions

被引:0
|
作者
Hoang Thi Hong Van [1 ]
Vo Thi Ngoc Chau [2 ]
Nguyen Hua Phung [2 ]
机构
[1] Ton Duc Thang Univ, Fac Informat Technol, Ctr Appl Informat Technol, Ho Chi Minh City, Vietnam
[2] Ho Chi Minh City Univ Technol, Fac Comp Sci & Engn, Dept Comp Sci, Dept Informat Syst, Ho Chi Minh City, Vietnam
关键词
sequential pattern mining; sequence deletion; incremental mining; expanded prefix tree;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sequential pattern mining is an important mining task for discovering sequential patterns along with their insight relationships in many real-world applications. In practice, sequence databases are kept changing over the time along with their business. For some reasons, some sequences in the database are asked to be deleted from the database. In order to have a synchronization of discovered sequential patterns with the database from which they have been discovered, the sequential pattern mining task is re-considered with many challenges. As the number of deleted sequences is often smaller than the size of the entire database, re-mining from scratch the updated database might incur a high cost because sequential pattern mining is a computationally expensive task. In this paper, our work aims at an efficient incremental mining solution to the sequential pattern mining task with sequence deletions. Different from the existing works, we propose an expanded prefix tree by extending the existing prefix tree with additional structures for capturing more necessary information for the incremental mining process. Based on this tree, we propose an incremental sequential pattern mining algorithm, SPMD, for finding a complete set of sequential patterns with no re-scanning the original database, when a number of sequences in the database are deleted. Experimental results on the benchmark databases have confirmed that our SPMD algorithm outperforms the re-mining from scratch with the PrefixSpan algorithm with less running time.
引用
收藏
页码:11 / 16
页数:6
相关论文
共 50 条
  • [1] Prefix tree-based algorithm for mining frequent patterns over data streams
    Yang, Junrui
    Huang, Wei
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2010, 38 (07): : 107 - 110
  • [2] Mining Sequential Rules Based on Prefix-Tree
    Thien-Trang Van
    Bay Vo
    Bac Le
    [J]. NEW CHALLENGES FOR INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2011, 351 : 147 - +
  • [3] A frequent biological pattern mining algorithm using prefix tree
    Yao, Xuecun
    [J]. Journal of Computational Information Systems, 2011, 7 (13): : 4980 - 4988
  • [4] An efficient tree-based algorithm for mining sequential patterns with multiple minimum supports
    Hu, Ya-Han
    Wu, Fan
    Liao, Yi-Jiun
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (05) : 1224 - 1238
  • [5] Prefix and Suffix Sequential Pattern Mining
    Singh, Rina
    Graves, Jeffrey A.
    Talbert, Douglas A.
    Eberle, William
    [J]. ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS (ICDM 2018), 2018, 10933 : 309 - 324
  • [6] A sequential pattern mining algorithm based on improved FP-tree
    Sui, Yi
    Shao, FengJing
    Sun, RenCheng
    Wang, JinLong
    [J]. PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 440 - +
  • [7] An Algorithm of Mining Sequential Pattern with Wildcards Based on Index-Tree
    Wang, Le
    Wang, Shui
    Liu, Sheng-Lan
    Wang, Hui-Bing
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (03): : 554 - 565
  • [8] Efficient frequent pattern mining based on Linear Prefix tree
    Pyun, Gwangbum
    Yun, Unil
    Ryu, Keun Ho
    [J]. KNOWLEDGE-BASED SYSTEMS, 2014, 55 : 125 - 139
  • [9] Performance Analysis of Tree-Based Approaches for Pattern Mining
    Borah, Anindita
    Nath, Bhabesh
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, 2019, 711 : 435 - 448
  • [10] An efficient frequent pattern mining algorithm using a highly compressed prefix tree
    Zhu, Xiaolin
    Liu, Yongguo
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 : S153 - S173