Improved pattern tree for incremental frequent-pattern mining

被引:0
|
作者
Zhou M. [1 ]
Wang T. [1 ]
机构
[1] School of Mechanical Engineering, Tianjin University
基金
中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
Association rules; Data mining; Improved pattern tree; Incremental mining;
D O I
10.1007/s12209-010-0023-4
中图分类号
学科分类号
摘要
By analyzing the existing prefix-tree data structure, an improved pattern tree was introduced for processing new transactions. It firstly stored transactions in a lexicographic order tree and then restructured the tree by sorting each path in a frequency-descending order. While updating the improved pattern tree, there was no need to rescan the entire new database or reconstruct a new tree for incremental updating. A test was performed on synthetic dataset T10I4D100K with 100,000 transactions and 870 items. Experimental results show that the smaller the minimum support threshold, the faster the improved pattern tree achieves over CanTree for all datasets. As the minimum support threshold increased from 2% to 3.5%, the runtime decreased from 452.71 s to 186.26 s. Meanwhile, the runtime required by CanTree decreased from 1,367.03 s to 432.19 s. When the database was updated, the execution time of improved pattern tree consisted of construction of original improved pattern trees and reconstruction of initial tree. The experiment results showed that the runtime was saved by about 15% compared with that of CanTree. As the number of transactions increased, the runtime of improved pattern tree was about 25% shorter than that of FP-tree. The improved pattern tree also required less memory than CanTree. © Tianjin University and Springer-Verlag Berlin Heidelberg 2010.
引用
收藏
页码:129 / 134
页数:5
相关论文
共 50 条
  • [1] Improved Pattern Tree for Incremental Frequent-Pattern Mining
    周明
    王太勇
    [J]. Transactions of Tianjin University., 2010, 16 (02) - 134
  • [2] Improved Pattern Tree for Incremental Frequent-Pattern Mining
    周明
    王太勇
    [J]. Transactions of Tianjin University, 2010, (02) : 129 - 134
  • [3] CanTree: a canonical-order tree for incremental frequent-pattern mining
    Leung, Carson Kai-Sang
    Khan, Quamrul I.
    Li, Zhan
    Hoque, Tariqul
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 11 (03) : 287 - 311
  • [4] CanTree: a canonical-order tree for incremental frequent-pattern mining
    Carson Kai-Sang Leung
    Quamrul I. Khan
    Zhan Li
    Tariqul Hoque
    [J]. Knowledge and Information Systems, 2007, 11 : 287 - 311
  • [5] Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach
    Jiawei Han
    Jian Pei
    Yiwen Yin
    Runying Mao
    [J]. Data Mining and Knowledge Discovery, 2004, 8 : 53 - 87
  • [6] Mining condensed frequent-pattern bases
    Pei, J
    Dong, GZ
    Zou, W
    Han, HW
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2004, 6 (05) : 570 - 594
  • [7] Mining frequent patterns without candidate generation: A frequent-pattern tree approach
    Han, JW
    Pei, J
    Yin, YW
    Mao, RY
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) : 53 - 87
  • [8] Mining Condensed Frequent-Pattern Bases
    Jian Pei
    Guozhu Dong
    Wei Zou
    Jiawei Han
    [J]. Knowledge and Information Systems, 2004, 6 : 570 - 594
  • [9] Mining frequent patterns with incremental updating frequent pattern tree
    Zhu, Qunxiong
    Lin, Xiaoyong
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 5923 - +
  • [10] ExAnte: A processing method for frequent-pattern mining
    Bonchi, F
    Giannotti, F
    Mazzanti, A
    Pedreschi, D
    [J]. IEEE INTELLIGENT SYSTEMS, 2005, 20 (03) : 25 - 31