Efficient indexing structures for mining frequent patterns

被引:7
|
作者
Bin, L [1 ]
Ooi, BC [1 ]
Tan, KL [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 117543, Singapore
关键词
D O I
10.1109/ICDE.2002.994758
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a variant of the signature file, called Bit-Sliced Bloom-Filtered Signature File (BBS), as the basis for implementing filter-and-refine strategies for mining frequent patterns. In the filtering step, the candidate patterns are obtained by scanning BBS instead of the database. The resultant candidate set contains a superset of the frequent patterns. In the refinement phase, each algorithm refines the candidate set to prune away the false drops. Based on this indexing structure, we study two filtering (single and dual filter) and two refinement (sequential scan and probe) mechanisms, thus giving rise to four different strategies. We conducted an extensive performance study to study the effectiveness of BBS, and compared the four proposed processing schemes with the traditional Apriori algorithm and the recently proposed FP-tree scheme. Our results show that BBS, as a whole, outperforms the Apriori strategy. Moreover, one of the schemes that is based on dual filter and probe refinement performs the best in all cases.
引用
收藏
页码:453 / 462
页数:10
相关论文
共 50 条
  • [21] CanTree: A tree structure for efficient incremental mining of frequent patterns
    Leung, CKS
    Khan, QI
    Hoque, T
    [J]. Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 274 - 281
  • [22] Mining Frequent Independent Patterns and Frequent Correlated Patterns Synchronously
    Zhou, Zhongmei
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 552 - 556
  • [23] An Efficient Count Based Transaction Reduction Approach For Mining Frequent Patterns
    Vijayalakshmi, V.
    Pethalakshmi, A.
    [J]. GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 : 52 - 61
  • [24] SPaMi-FTS: An Efficient Algorithm for Mining Frequent Sequential Patterns
    Kadir Febrer-Hernandez, Jose
    Hernandez-Palancar, Jose
    Hernandez-Leon, Raudel
    Feregrino-Uribe, Claudia
    [J]. PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 470 - 477
  • [25] Towards Efficient Mining of Periodic-Frequent Patterns in Transactional Databases
    Kiran, R. Uday
    Reddy, P. Krishna
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT 2, 2010, 6262 : 194 - 208
  • [26] EFFICIENT MINING OF LOCAL FREQUENT PERIODIC PATTERNS IN TIME SERIES DATABASE
    Gu, Cheng-Kui
    Dong, Xiao-Li
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 183 - 186
  • [27] An Efficient Algorithm for Mining Frequent Closed Inter-Transaction Patterns
    Thanh-Ngo Nguyen
    Nguyen, Loan T. T.
    Vo, Bay
    Ngoc-Thanh Nguyen
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 2019 - 2024
  • [28] Efficient prime-based method for interactive mining of frequent patterns
    Nadimi-Shahraki, Mohammad H.
    Mustapha, Norwati
    Sulaiman, Md. Nasir
    Mamat, Ali
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 12654 - 12670
  • [29] An efficient algorithm for mining maximal frequent patterns over data streams
    Yang, Junrui
    Wei, Yanjun
    Zhou, Fenfen
    [J]. 2015 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS IHMSC 2015, VOL II, 2015,
  • [30] An efficient algorithm for mining top-rank-k frequent patterns
    Thu-Lan Dam
    Li, Kenli
    Fournier-Viger, Philippe
    Quang-Huy Duong
    [J]. APPLIED INTELLIGENCE, 2016, 45 (01) : 96 - 111