An Expanded Prefix Tree-based Mining Algorithm for Sequential Pattern Maintenance with Deletions

被引:0
|
作者
Hoang Thi Hong Van [1 ]
Vo Thi Ngoc Chau [2 ]
Nguyen Hua Phung [2 ]
机构
[1] Ton Duc Thang Univ, Fac Informat Technol, Ctr Appl Informat Technol, Ho Chi Minh City, Vietnam
[2] Ho Chi Minh City Univ Technol, Fac Comp Sci & Engn, Dept Comp Sci, Dept Informat Syst, Ho Chi Minh City, Vietnam
关键词
sequential pattern mining; sequence deletion; incremental mining; expanded prefix tree;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sequential pattern mining is an important mining task for discovering sequential patterns along with their insight relationships in many real-world applications. In practice, sequence databases are kept changing over the time along with their business. For some reasons, some sequences in the database are asked to be deleted from the database. In order to have a synchronization of discovered sequential patterns with the database from which they have been discovered, the sequential pattern mining task is re-considered with many challenges. As the number of deleted sequences is often smaller than the size of the entire database, re-mining from scratch the updated database might incur a high cost because sequential pattern mining is a computationally expensive task. In this paper, our work aims at an efficient incremental mining solution to the sequential pattern mining task with sequence deletions. Different from the existing works, we propose an expanded prefix tree by extending the existing prefix tree with additional structures for capturing more necessary information for the incremental mining process. Based on this tree, we propose an incremental sequential pattern mining algorithm, SPMD, for finding a complete set of sequential patterns with no re-scanning the original database, when a number of sequences in the database are deleted. Experimental results on the benchmark databases have confirmed that our SPMD algorithm outperforms the re-mining from scratch with the PrefixSpan algorithm with less running time.
引用
收藏
页码:11 / 16
页数:6
相关论文
共 50 条
  • [21] Analysis of tree-based uncertain frequent pattern mining techniques without pattern losses
    Lee, Gangin
    Yun, Unil
    Lee, Kyung-Min
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (11): : 4296 - 4318
  • [22] PREFIX-PROJECTION Global Constraint for Sequential Pattern Mining
    Kemmar, Amina
    Loudni, Samir
    Lebbah, Yahia
    Boizumault, Patrice
    Charnois, Thierry
    PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, CP 2015, 2015, 9255 : 226 - 243
  • [23] On the Sequential Pattern Mining Algorithm Based on Projection position
    Li, Taoshen
    Wang, Weina
    Chen, Qingfeng
    PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 460 - 463
  • [24] The Sequential Pattern Mining Algorithm MHSP Based on MH
    Wang, Jun
    Jiang, Yaqiong
    ADVANCED RESEARCH ON MECHANICAL ENGINEERING, INDUSTRY AND MANUFACTURING ENGINEERING, PTS 1 AND 2, 2011, 63-64 : 425 - +
  • [25] A Improved Sequential Pattern Mining Algorithm Based on PrefixSpan
    Xue Fei
    Shan Zheng
    Yan Li-jing
    Fan Chao
    2016 WORLD AUTOMATION CONGRESS (WAC), 2016,
  • [26] ACV constraint based sequential pattern mining algorithm
    Ye, Hong-Yun
    Ni, Zhi-Wei
    Ni, Li-Ping
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2010, 23 (06): : 802 - 808
  • [27] An Incremental Closed Frequent Itemsets Mining Algorithm Based on Shadow Prefix Tree
    Li, Yun
    Xu, Jie
    Zhang, Xiaobing
    Li, Chen
    Zhang, Yingjuan
    2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 440 - 445
  • [28] A sequential tree approach for incremental sequential pattern mining
    Boghey, Rajesh Kumar
    Singh, Shailendra
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2016, 41 (12): : 1369 - 1380
  • [29] A sequential tree approach for incremental sequential pattern mining
    Rajesh Kumar Boghey
    Shailendra Singh
    Sādhanā, 2016, 41 : 1369 - 1380
  • [30] A MINING ALGORITHM FOR FREQUENT CLOSED PATTERN ON DATA STREAM BASED ON SUB-STRUCTURE COMPRESSED IN PREFIX-TREE
    Fan Muhan
    Shao Sujie
    Rui Lanlan
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 434 - 439