An Expanded Prefix Tree-based Mining Algorithm for Sequential Pattern Maintenance with Deletions

被引:0
|
作者
Hoang Thi Hong Van [1 ]
Vo Thi Ngoc Chau [2 ]
Nguyen Hua Phung [2 ]
机构
[1] Ton Duc Thang Univ, Fac Informat Technol, Ctr Appl Informat Technol, Ho Chi Minh City, Vietnam
[2] Ho Chi Minh City Univ Technol, Fac Comp Sci & Engn, Dept Comp Sci, Dept Informat Syst, Ho Chi Minh City, Vietnam
关键词
sequential pattern mining; sequence deletion; incremental mining; expanded prefix tree;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sequential pattern mining is an important mining task for discovering sequential patterns along with their insight relationships in many real-world applications. In practice, sequence databases are kept changing over the time along with their business. For some reasons, some sequences in the database are asked to be deleted from the database. In order to have a synchronization of discovered sequential patterns with the database from which they have been discovered, the sequential pattern mining task is re-considered with many challenges. As the number of deleted sequences is often smaller than the size of the entire database, re-mining from scratch the updated database might incur a high cost because sequential pattern mining is a computationally expensive task. In this paper, our work aims at an efficient incremental mining solution to the sequential pattern mining task with sequence deletions. Different from the existing works, we propose an expanded prefix tree by extending the existing prefix tree with additional structures for capturing more necessary information for the incremental mining process. Based on this tree, we propose an incremental sequential pattern mining algorithm, SPMD, for finding a complete set of sequential patterns with no re-scanning the original database, when a number of sequences in the database are deleted. Experimental results on the benchmark databases have confirmed that our SPMD algorithm outperforms the re-mining from scratch with the PrefixSpan algorithm with less running time.
引用
收藏
页码:11 / 16
页数:6
相关论文
共 50 条
  • [31] Optimizing Tree-Based Contrast Subspace Mining Using Genetic Algorithm
    Sia, Florence
    Alfred, Rayner
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2022, 15 (01)
  • [32] Optimizing Tree-Based Contrast Subspace Mining Using Genetic Algorithm
    Florence Sia
    Rayner Alfred
    International Journal of Computational Intelligence Systems, 15
  • [33] Prefix-Pruning-Based Distributed Frequent Trajectory Pattern Mining Algorithm
    Ding, Jiaman
    Li, Yunpeng
    Li, Ling
    Jia, Lianyin
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [34] Frequent Patterns Algorithm of Biological Sequences based on Pattern Prefix-tree
    Xue, L. Y.
    Zhang, X. K.
    Xie, F.
    Liu, S.
    Lin, P.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2019, 14 (04) : 574 - 589
  • [35] An Evolutive Frequent Pattern Tree-based Incremental Knowledge Discovery Algorithm
    Liu, Xin
    Zheng, Liang
    Zhang, Weishan
    Zhou, Jiehan
    Cao, Shuai
    Yu, Shaowen
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2022, 13 (03)
  • [36] Performance Analysis of Tree-Based Algorithms for Incremental High Utility Pattern Mining
    Ryang, Heungmo
    Yun, Unil
    ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2017, 421 : 127 - 131
  • [37] Utility Pattern Mining Algorithm Based on Improved Utility Pattern Tree
    Xing, Shuning
    Liu, Fangai
    Wang, Jiwei
    Pang, Lin
    Xu, Zhenguo
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2015, : 258 - 261
  • [38] Incremental Mining Algorithm of Sequential Patterns Based on Sequence Tree
    Liu, Jiaxin
    Yan, Shuting
    Wang, Yanyan
    Ren, Jiadong
    2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL IV, 2010, : 1 - 4
  • [39] Incremental Mining Algorithm of Sequential Patterns Based on Sequence Tree
    Liu, Jiaxin
    Yan, Shuting
    Wang, Yanyan
    Ren, Jiadong
    ADVANCES IN INTELLIGENT SYSTEMS, 2012, 138 : 61 - 67
  • [40] A sequential patterns mining incremental algorithm PIN-Prefixspan based on prefix analysis
    Wu, Di
    Ren, Jiadong
    Advances in Information Sciences and Service Sciences, 2012, 4 (19): : 48 - 56