BIDE: Efficient mining of frequent closed sequences

被引:284
|
作者
Wang, JY [1 ]
Han, JW [1 ]
机构
[1] Univ Minnesota Twin Cities, Digital Technol Ctr, Minneapolis, MN 55455 USA
来源
20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS | 2004年
关键词
D O I
10.1109/ICDE.2004.1319986
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous studies have presented convincing arguments that a frequent pattern mining algorithm should not mine all frequent patterns but only the closed ones because the latter leads to not only more compact yet complete result set but also better efficiency. However most of the previously developed closed pattern mining algorithms work under the candidate maintenance-and-test paradigm which is inherently costly in both runtime and space usage when the support threshold is low or the patterns become long. In this paper we present, BIDE, an efficient algorithm for mining frequent closed sequences without candidate maintenance. It adopts a novel sequence closure checking scheme called BI-Directional Extension, and prunes the search space more deeply compared to the previous algorithms by using the BackScan pruning method and the Scan Skip optimization technique. A thorough performance study with both sparse and dense real-life data sets has demonstrated that BIDE significantly outperforms the previous algorithms: it consumes order(s) of magnitude less memory and can be more than an order of magnitude faster It is also linearly scalable in terms of database size.
引用
收藏
页码:79 / 90
页数:12
相关论文
共 50 条
  • [1] Efficient mining of frequent closed sequences with time constraints
    Huang G.
    Li M.
    Ren J.
    Journal of Convergence Information Technology, 2011, 6 (10) : 129 - 136
  • [2] FMaxCloHUSM: An efficient algorithm for mining frequent closed and maximal high utility sequences
    Tin Truong
    Hai Duong
    Bac Le
    Fournier-Viger, Philippe
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 85 : 1 - 20
  • [3] Frequent Closed Partial Orders Mining in Sequences
    Wang, Ye
    Jia, Yan
    Zhang, Lumin
    ADVANCES IN MECHATRONICS, AUTOMATION AND APPLIED INFORMATION TECHNOLOGIES, PTS 1 AND 2, 2014, 846-847 : 1304 - 1307
  • [4] Parallel algorithm for mining frequent closed sequences
    Ma, CX
    Li, QH
    AUTONOMOUS INTELLIGENT SYSTEMS: AGENTS AND DATA MINING, PROCEEDINGS, 2005, 3505 : 184 - 192
  • [5] Mining frequent closed structures in stremying melody sequences
    Li, HF
    Lee, SY
    Shan, MK
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 2031 - 2034
  • [6] An efficient algorithm for mining frequent closed itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    Informatica (Slovenia), 2015, 39 (01): : 87 - 98
  • [7] An Efficient Algorithm for Mining Frequent Closed Itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2015, 39 (01): : 87 - 98
  • [8] SPADE: An Efficient Algorithm for Mining Frequent Sequences
    Mohammed J. Zaki
    Machine Learning, 2001, 42 : 31 - 60
  • [9] SPADE: An efficient algorithm for mining frequent sequences
    Zaki, MJ
    MACHINE LEARNING, 2001, 42 (1-2) : 31 - 60
  • [10] Mining Weighted Frequent Closed Episodes over Multiple Sequences
    Liao, Guoqiong
    Yang, Xiaoting
    Xie, Sihong
    Yu, Philip S.
    Wan, Changxuan
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2018, 25 (02): : 510 - 518