Efficient mining of long frequent patterns from very large dense datasets

被引:0
|
作者
Gopalan, RP [1 ]
Sucahyo, YG [1 ]
机构
[1] Curtin Univ Technol, Dept Comp, Bentley, WA 6102, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering association rules that identify relationships among sets of items in a transaction database is an important problem in Data Mining. Finding frequent itemsets has been an active research area since it is the crucial step in association rule discovery. However, efficiently mining frequent itemsets from dense datasets is still a challenging problem. In this paper, we describe a new and more efficient algorithm named CT-GIN for mining complete frequent itemsets from dense datasets. The algorithm uses a compact prefix tree for succinctly representing transaction data and an item group intersection method for efficient extraction of frequent itemsets from the tree. Performance comparisons show that our algorithm outperforms the fastest Apriori algorithm, Eclat and FP-Growth, on several widely used test data sets. CT-GIN has been extended for mining very large datasets, and we also present test results showing its scalability.
引用
收藏
页码:652 / 661
页数:10
相关论文
共 50 条
  • [1] Efficiently mining frequent patterns from dense datasets using a cluster of computers
    Sucahyo, YG
    Gopalan, RP
    Rudra, A
    [J]. AI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2903 : 233 - 244
  • [2] Efficient Algorithms for Mining Frequent Patterns from Sparse and Dense Databases
    Vu, Lan
    Alaghband, Gita
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2015, 24 (02) : 181 - 197
  • [3] Mining frequent tree-like patterns in large datasets
    Chen, TS
    Hsu, SC
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2005, 3453 : 561 - 567
  • [4] Mining frequent tree-like patterns in large datasets
    Chen, Tzung-Shi
    Hsu, Shih-Chun
    [J]. DATA & KNOWLEDGE ENGINEERING, 2007, 62 (01) : 65 - 83
  • [5] EIFDD: An efficient approach for erasable itemset mining of very dense datasets
    Giang Nguyen
    Tuong Le
    Bay Vo
    Bac Le
    [J]. Applied Intelligence, 2015, 43 : 85 - 94
  • [6] EIFDD: An efficient approach for erasable itemset mining of very dense datasets
    Giang Nguyen
    Tuong Le
    Bay Vo
    Bac Le
    [J]. APPLIED INTELLIGENCE, 2015, 43 (01) : 85 - 94
  • [7] MapFIM: Memory Aware Parallelized Frequent Itemset Mining in Very Large Datasets
    Duong, Khanh-Chuong
    Bamha, Mostafa
    Giacometti, Arnaud
    Li, Dominique
    Soulet, Arnaud
    Vrain, Christel
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2017, PT I, 2017, 10438 : 478 - 495
  • [8] HDminer: Efficient Mining of High Dimensional Frequent Closed Patterns from Dense Data
    Xu, Jianpeng
    Ji, Shufan
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 1061 - 1067
  • [9] Efficient Frequent Itemset Mining from Dense Data Streams
    Cuzzocrea, Alfredo
    Jiang, Fan
    Lee, Wookey
    Leung, Carson K.
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 593 - 601
  • [10] Efficient mining of temporal traversal patterns from very large Web logs
    Chen, ZX
    [J]. DMIN '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON DATA MINING, 2005, : 10 - 16