Fast algorithms for frequent itemset mining using FP-trees

被引:334
|
作者
Grahne, G [1 ]
Zhu, JF [1 ]
机构
[1] Concordia Univ, Dept Comp Sci, Montreal, PQ H3G 1M8, Canada
关键词
data mining; association rules;
D O I
10.1109/TKDE.2005.166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient algorithms for mining frequent itemsets are crucial for mining association rules as well as for many other data mining tasks. Methods for mining frequent itemsets have been implemented using a prefix-tree structure, known as an FP-tree, for storing compressed information about frequent itemsets. Numerous experimental results have demonstrated that these algorithms perform extremely well. In this paper, we present a novel FP-array technique that greatly reduces the need to traverse FP-trees, thus obtaining significantly improved performance for FP-tree-based algorithms. Our technique works especially well for sparse data sets. Furthermore, we present new algorithms for mining all, maximal, and closed frequent itemsets. Our algorithms use the FP-tree data structure in combination with the FP-array technique efficiently and incorporate various optimization techniques. We also present experimental results comparing our methods with existing algorithms. The results show that our methods are the fastest for many cases. Even though the algorithms consume much memory when the data sets are sparse, they are still the fastest ones when the minimum support is low. Moreover, they are always among the fastest algorithms and consume less memory than other methods when the data sets are dense.
引用
下载
收藏
页码:1347 / 1362
页数:16
相关论文
共 50 条
  • [1] Mining φ-Frequent Itemset Using FP-Tree
    李天瑞
    Railway Engineering Science, 2001, (01) : 67 - 74
  • [2] Linguistic data mining with fuzzy FP-trees
    Lin, Chun-Wei
    Hong, Tzung-Pei
    Lu, Wen-Hsiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (06) : 4560 - 4567
  • [3] Fast Algorithms for Frequent Itemset Mining from Uncertain Data
    Leung, Carson Kai-Sang
    MacKinnon, Richard Kyle
    Tanbeer, Syed K.
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 893 - 898
  • [4] Fuzzy Data Mining Based on the Compressed Fuzzy FP-trees
    Lin, Chun-Wei
    Hong, Tzung-Pei
    Lu, Wen-Hsiang
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 1068 - +
  • [5] Parallel Architecture for Implementation of Frequent Itemset Mining Using FP-Growth
    Tehreem, Amna
    Khawaja, Sajid Gul
    Akram, Muhammad Usman
    Khan, Shoab A.
    Ali, Muhammad
    2017 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2017, : 92 - 98
  • [6] Mining Closed Frequent Itemset based on FP-Tree
    Li, Shengwei
    Li, Lingsheng
    Han, Chong
    2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 354 - 357
  • [7] Algorithms for frequent itemset mining: a literature review
    Chee, Chin-Hoong
    Jaafar, Jafreezal
    Aziz, Izzatdin Abdul
    Hasan, Mohd Hilmi
    Yeoh, William
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (04) : 2603 - 2621
  • [8] Frequent Itemset Mining Algorithms :A Literature Survey
    Jamsheela, O.
    Raju, G.
    2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 1099 - 1104
  • [9] Algorithms for frequent itemset mining: a literature review
    Chin-Hoong Chee
    Jafreezal Jaafar
    Izzatdin Abdul Aziz
    Mohd Hilmi Hasan
    William Yeoh
    Artificial Intelligence Review, 2019, 52 : 2603 - 2621
  • [10] Top-k Frequent Itemsets via Differentially Private FP-trees
    Lee, Jaewoo
    Clifton, Christopher W.
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 931 - 940