A sequential pattern mining algorithm based on improved FP-tree

被引:2
|
作者
Sui, Yi [1 ]
Shao, FengJing [1 ]
Sun, RenCheng [1 ]
Wang, JinLong [2 ]
机构
[1] Qingdao Univ, Coll Informat & Engn, Qingdao 266071, Peoples R China
[2] Qingdao Technol Univ, Sch Comp Engn, Qingdao, Peoples R China
关键词
D O I
10.1109/SNPD.2008.161
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential pattern mining is an important data mining problem with broad application. Most of the previously developed sequential pattern mining methods need to scan the database many times. In this study, STMFP algorithm based on improved FP-tree is presented for sequential pattern mining. By improving the FP-tree structure, every node of the tree can store a set of items instead of one item. After scanning the sequential database once time, the tree can store all the sequences. In addition, a novel mining method, combining nodes from leaf to root which helps mining sequential patterns, is proposed. The cost of mining pattern sequence is divided into two parts. One is to construct STMFP Tree. The cost of this part associates with the size of sequential database. Another one is to find random assembled nodes from leaf to root in every path of STMFP tree. Because the maximal length of path is bounded by the maximal length of one transaction, and there are exiting common nodes which help reduce the number of leaf nodes, so the cost of this part must be much less than the size of the database. Compared with other methods which need to scan the sequential database many times, the cost Of our method must be less than two passes of the database. Through the whole mining process, it only needs scan the database once time.
引用
收藏
页码:440 / +
页数:2
相关论文
共 50 条
  • [1] An improved algorithm of mining from FP-tree
    Qiu, Y
    Lan, YJ
    Xie, QS
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1665 - 1670
  • [2] Improved algorithm for mining maximum frequent patterns based on FP-Tree
    Liu, Naili
    Ma, Lei
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 833 - 836
  • [3] An improved algorithm for mining maximal frequent itemsets based on FP-tree
    Chen TongQing
    Ye FeiYue
    Ge XiCong
    Liu Qi
    [J]. 2018 17TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES), 2018, : 225 - 228
  • [4] An Improved FP-tree Algorithm for Mining Maximal Frequent Patterns
    Pan Zhaopeng
    Liu Peiyu
    Yi Jing
    [J]. 2018 10TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA), 2018, : 309 - 312
  • [5] A Survey on FP-Tree Based Incremental Frequent Pattern Mining
    Ahmed, Shafiul Alom
    Nath, Bhabesh
    [J]. BIOLOGICALLY INSPIRED TECHNIQUES IN MANY-CRITERIA DECISION MAKING, 2020, 10 : 203 - 210
  • [6] Frequent XML query pattern mining based on FP-TRee
    Gu, Mi Sug
    Hwang, Jeong Hee
    Ryu, Keun Ho
    [J]. DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 555 - +
  • [7] A Parallel Algorithm for Mining Association Rules Based on FP-tree
    Tu, Fei
    He, Bo
    [J]. ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT IV, 2011, 217 : 399 - 403
  • [8] A FP-Tree Based Algorithm for mining knowledge point association
    Hu, Bingtao
    Ju, Xiaoming
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING FOR MECHANICS AND MATERIALS, 2016, 97 : 34 - 41
  • [9] The Mining Algorithm of Frequent Itemsets based on Mapreduce and FP-tree
    He, Bo
    Zhang, Hongyuan
    Pei, Jianhui
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER NETWORK, ELECTRONIC AND AUTOMATION (ICCNEA), 2017, : 108 - 111
  • [10] AN EFFECTIVE ALGORITHM FOR BUSINESS PROCESS MINING BASED ON MODIFIED FP-TREE ALGORITHM
    Kim, Gun-Woo
    Lee, Seung Hoon
    Kim, Jae Hyung
    Son, Jin Hyun
    [J]. SECOND INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS: ICCSN 2010, PROCEEDINGS, 2010, : 119 - 123