Efficient mining of long frequent patterns from very large dense datasets

被引:0
|
作者
Gopalan, RP [1 ]
Sucahyo, YG [1 ]
机构
[1] Curtin Univ Technol, Dept Comp, Bentley, WA 6102, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering association rules that identify relationships among sets of items in a transaction database is an important problem in Data Mining. Finding frequent itemsets has been an active research area since it is the crucial step in association rule discovery. However, efficiently mining frequent itemsets from dense datasets is still a challenging problem. In this paper, we describe a new and more efficient algorithm named CT-GIN for mining complete frequent itemsets from dense datasets. The algorithm uses a compact prefix tree for succinctly representing transaction data and an item group intersection method for efficient extraction of frequent itemsets from the tree. Performance comparisons show that our algorithm outperforms the fastest Apriori algorithm, Eclat and FP-Growth, on several widely used test data sets. CT-GIN has been extended for mining very large datasets, and we also present test results showing its scalability.
引用
收藏
页码:652 / 661
页数:10
相关论文
共 50 条
  • [21] Effectively and efficiently mining frequent patterns from dense graph streams on disk
    Braun, Peter
    Cameron, Juan J.
    Cuzzocrea, Alfredo
    Jiang, Fan
    Leung, Carson K.
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 18TH ANNUAL CONFERENCE, KES-2014, 2014, 35 : 338 - 347
  • [22] Compressed hierarchical mining of frequent closed patterns from dense data sets
    Ji, Liping
    Tan, Kian-Lee
    Tung, Anthony K. H.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (09) : 1175 - 1187
  • [23] Linear and sublinear time algorithms for mining frequent traversal path patterns from very large web logs
    Chen, ZX
    Fowler, RH
    Fu, AWC
    Wang, CY
    [J]. SEVENTH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2003, : 117 - 122
  • [24] Efficient Mining of Frequent Patterns on Uncertain Graphs
    Chen, Yifan
    Zhao, Xiang
    Lin, Xuemin
    Wang, Yang
    Guo, Deke
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 287 - 300
  • [25] Efficient indexing structures for mining frequent patterns
    Bin, L
    Ooi, BC
    Tan, KL
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 453 - 462
  • [26] Mining frequent approximate patterns in large networks
    Driss, Kaouthar
    Boulila, Wadii
    Leborgne, Aurelie
    Gancarski, Pierre
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2021, 31 (03) : 1265 - 1279
  • [27] Mining frequent δ-free patterns in large databases
    Hébert, C
    Crémilleux, B
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2005, 3735 : 124 - 136
  • [28] Efficient Algorithms for Mining Erasable Closed Patterns From Product Datasets
    Vo, Bay
    Le, Tuong
    Nguyen, Giang
    Hong, Tzung-Pei
    [J]. IEEE ACCESS, 2017, 5 : 3111 - 3120
  • [29] Efficient algorithms for mining maximal frequent concatenate sequences in biological datasets
    Pan, J
    Wang, P
    Wang, W
    Shi, B
    Yang, GX
    [J]. FIFTH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - PROCEEDINGS, 2005, : 98 - 104
  • [30] Frequent Item Set Mining of Large Datasets Using CUDA Computing
    Karthik, Peddi
    Banu, J. Saira
    [J]. SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2018, VOL 2, 2020, 1057 : 739 - 747