BIDE: Efficient mining of frequent closed sequences

被引:284
|
作者
Wang, JY [1 ]
Han, JW [1 ]
机构
[1] Univ Minnesota Twin Cities, Digital Technol Ctr, Minneapolis, MN 55455 USA
来源
20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS | 2004年
关键词
D O I
10.1109/ICDE.2004.1319986
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous studies have presented convincing arguments that a frequent pattern mining algorithm should not mine all frequent patterns but only the closed ones because the latter leads to not only more compact yet complete result set but also better efficiency. However most of the previously developed closed pattern mining algorithms work under the candidate maintenance-and-test paradigm which is inherently costly in both runtime and space usage when the support threshold is low or the patterns become long. In this paper we present, BIDE, an efficient algorithm for mining frequent closed sequences without candidate maintenance. It adopts a novel sequence closure checking scheme called BI-Directional Extension, and prunes the search space more deeply compared to the previous algorithms by using the BackScan pruning method and the Scan Skip optimization technique. A thorough performance study with both sparse and dense real-life data sets has demonstrated that BIDE significantly outperforms the previous algorithms: it consumes order(s) of magnitude less memory and can be more than an order of magnitude faster It is also linearly scalable in terms of database size.
引用
收藏
页码:79 / 90
页数:12
相关论文
共 50 条
  • [41] Efficient Data Streams Based Closed Frequent Itemsets Mining Algorithm
    Tan, Jun
    ADVANCES IN CIVIL ENGINEERING II, PTS 1-4, 2013, 256-259 : 2910 - 2913
  • [42] Efficient algorithms of mining top-k frequent closed itemsets
    Lan Yongjie
    Qiu Yong
    ICEMI 2007: PROCEEDINGS OF 2007 8TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL II, 2007, : 551 - 554
  • [43] FCHUIM: Efficient Frequent and Closed High-Utility Itemsets Mining
    Wei, Tianyou
    Wang, Bin
    Zhang, Yuntian
    Hu, Keyong
    Yao, Yinfeng
    Liu, Hao
    IEEE ACCESS, 2020, 8 : 109928 - 109939
  • [44] GRG: An efficient method for association rules mining on frequent closed itemsets
    Li, L
    Zhai, DH
    Jin, F
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2003, : 854 - 859
  • [45] An efficient closed frequent itemset miner for the MOA stream mining system
    Quadrana, Massimo
    Bifet, Albert
    Gavalda, Ricard
    AI COMMUNICATIONS, 2015, 28 (01) : 143 - 158
  • [46] An Efficient Algorithm for Mining Frequent Closed Itemsets over Data Stream
    Li Guodong
    Xia Kewen
    NEW TRENDS IN MECHATRONICS AND MATERIALS ENGINEERING, 2012, 151 : 570 - 575
  • [47] Efficient Mining of Frequent Target Episodes from Complex Event Sequences
    Lin, Yu-Feng
    Jiang, Pei-Wen
    Tseng, Vincent S.
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 501 - 510
  • [48] Efficient algorithms for mining maximal frequent concatenate sequences in biological datasets
    Pan, J
    Wang, P
    Wang, W
    Shi, B
    Yang, GX
    FIFTH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - PROCEEDINGS, 2005, : 98 - 104
  • [49] Combination of dynamic bit vectors and transaction information for mining frequent closed sequences efficiently
    Minh-Thai Tran
    Bac Le
    Bay Vo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 38 : 183 - 189
  • [50] Mining frequent closed itemsets with the frequent pattern list
    Tseng, FC
    Hsu, CC
    Chen, H
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 653 - 654