Efficient indexing structures for mining frequent patterns

被引:7
|
作者
Bin, L [1 ]
Ooi, BC [1 ]
Tan, KL [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 117543, Singapore
关键词
D O I
10.1109/ICDE.2002.994758
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a variant of the signature file, called Bit-Sliced Bloom-Filtered Signature File (BBS), as the basis for implementing filter-and-refine strategies for mining frequent patterns. In the filtering step, the candidate patterns are obtained by scanning BBS instead of the database. The resultant candidate set contains a superset of the frequent patterns. In the refinement phase, each algorithm refines the candidate set to prune away the false drops. Based on this indexing structure, we study two filtering (single and dual filter) and two refinement (sequential scan and probe) mechanisms, thus giving rise to four different strategies. We conducted an extensive performance study to study the effectiveness of BBS, and compared the four proposed processing schemes with the traditional Apriori algorithm and the recently proposed FP-tree scheme. Our results show that BBS, as a whole, outperforms the Apriori strategy. Moreover, one of the schemes that is based on dual filter and probe refinement performs the best in all cases.
引用
收藏
页码:453 / 462
页数:10
相关论文
共 50 条
  • [1] Fast mining of frequent tree structures by hashing and indexing
    Katsaros, D
    Nanopoulos, A
    Manolopoulos, Y
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2005, 47 (02) : 129 - 140
  • [2] Efficient Mining of Frequent Patterns on Uncertain Graphs
    Chen, Yifan
    Zhao, Xiang
    Lin, Xuemin
    Wang, Yang
    Guo, Deke
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 287 - 300
  • [3] A pattern growth method based on memory indexing for frequent patterns mining
    Hou, Junjie
    Li, Chunping
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 1, PROCEEDINGS, 2006, : 663 - +
  • [4] An efficient approach with memory indexing for discovering frequent sequential patterns
    Dan, Cao
    Peng, Hui-Li
    Zhang, Xiao-Jian
    Du, Xing-Zheng
    [J]. PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1001 - 1006
  • [5] Efficient mining of constrained frequent patterns from streams
    Leung, Carson Kai-Sang
    Khan, Quamrul I.
    [J]. 10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 61 - 68
  • [6] Fast and Memory Efficient Mining of Periodic Frequent Patterns
    Nofong, Vincent Mwintieru
    [J]. MODERN APPROACHES FOR INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2018, 769 : 223 - 232
  • [7] DBC: a condensed representation of frequent patterns for efficient mining
    Bykowski, A
    Rigotti, C
    [J]. INFORMATION SYSTEMS, 2003, 28 (08) : 949 - 977
  • [8] An Efficient Algorithm for Mining Frequent Patterns in Data Stream
    Zhang Guang-lu
    Lei Jing-sheng
    [J]. INTERNATIONAL CONFERENCE OF CHINA COMMUNICATION (ICCC2010), 2010, : 160 - +
  • [9] Efficient mining of maximal correlated weight frequent patterns
    Yun, Unil
    Ryu, Keun Ho
    [J]. INTELLIGENT DATA ANALYSIS, 2013, 17 (05) : 917 - 939
  • [10] FSMTree: An efficient algorithm for mining frequent temporal patterns
    Kempe, Steffen
    Hipp, Jochen
    Kruse, Rudolf
    [J]. DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, : 253 - +