Efficient indexing structures for mining frequent patterns

被引:7
|
作者
Bin, L [1 ]
Ooi, BC [1 ]
Tan, KL [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 117543, Singapore
关键词
D O I
10.1109/ICDE.2002.994758
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a variant of the signature file, called Bit-Sliced Bloom-Filtered Signature File (BBS), as the basis for implementing filter-and-refine strategies for mining frequent patterns. In the filtering step, the candidate patterns are obtained by scanning BBS instead of the database. The resultant candidate set contains a superset of the frequent patterns. In the refinement phase, each algorithm refines the candidate set to prune away the false drops. Based on this indexing structure, we study two filtering (single and dual filter) and two refinement (sequential scan and probe) mechanisms, thus giving rise to four different strategies. We conducted an extensive performance study to study the effectiveness of BBS, and compared the four proposed processing schemes with the traditional Apriori algorithm and the recently proposed FP-tree scheme. Our results show that BBS, as a whole, outperforms the Apriori strategy. Moreover, one of the schemes that is based on dual filter and probe refinement performs the best in all cases.
引用
收藏
页码:453 / 462
页数:10
相关论文
共 50 条
  • [41] A Novel Efficient Mining Algorithm For Frequent Patterns On Biological Multiple Sequence
    Liu, Wei
    Chen, Ling
    [J]. FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 3697 - +
  • [42] Memory Efficient Mining of Periodic-Frequent Patterns in Transactional Databases
    Anirudh, A.
    Kiran, R. Uday
    Reddy, P. Krishna
    Kitsuregawa, Masaru
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [43] More Efficient Algorithm for Mining Frequent Patterns with Multiple Minimum Supports
    Gan, Wensheng
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Chao, Han-Chieh
    [J]. WEB-AGE INFORMATION MANAGEMENT, PT I, 2016, 9658 : 3 - 16
  • [44] WTPMiner: Efficient mining of weighted frequent patterns based on graph traversals
    Geng, Runian
    Xu, Wenbo
    Dong, Xiangjun
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2007, 4798 : 412 - +
  • [45] Mining frequent ordered patterns
    Deng, ZH
    Ji, CR
    Zhang, M
    Tang, SW
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 150 - 154
  • [46] Mining Supplemental Frequent Patterns
    Liu, Yintian
    Liu, Yingming
    Zeng, Tao
    Xu, Kaikuo
    Tang, Rong
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 158 - +
  • [47] Frequent mining of subgraph structures
    Guo, Ping
    Wang, Xin-Ru
    Kang, Yan-Rong
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2006, 18 (04) : 513 - 521
  • [48] From path tree to frequent patterns: A framework for mining frequent patterns
    Xu, YB
    Yu, JX
    Liu, GM
    Lu, HJ
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 514 - 521
  • [49] Efficient mining of long frequent patterns from very large dense datasets
    Gopalan, RP
    Sucahyo, YG
    [J]. DESIGN AND APPLICATION OF HYBRID INTELLIGENT SYSTEMS, 2003, 104 : 652 - 661
  • [50] An efficient and effective algorithm for mining top-rank-k frequent patterns
    Quyen Huynh-Thi-Le
    Tuong Le
    Bay Vo
    Bac Le
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (01) : 156 - 164