Fast algorithms for frequent itemset mining using FP-trees

被引:334
|
作者
Grahne, G [1 ]
Zhu, JF [1 ]
机构
[1] Concordia Univ, Dept Comp Sci, Montreal, PQ H3G 1M8, Canada
关键词
data mining; association rules;
D O I
10.1109/TKDE.2005.166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient algorithms for mining frequent itemsets are crucial for mining association rules as well as for many other data mining tasks. Methods for mining frequent itemsets have been implemented using a prefix-tree structure, known as an FP-tree, for storing compressed information about frequent itemsets. Numerous experimental results have demonstrated that these algorithms perform extremely well. In this paper, we present a novel FP-array technique that greatly reduces the need to traverse FP-trees, thus obtaining significantly improved performance for FP-tree-based algorithms. Our technique works especially well for sparse data sets. Furthermore, we present new algorithms for mining all, maximal, and closed frequent itemsets. Our algorithms use the FP-tree data structure in combination with the FP-array technique efficiently and incorporate various optimization techniques. We also present experimental results comparing our methods with existing algorithms. The results show that our methods are the fastest for many cases. Even though the algorithms consume much memory when the data sets are sparse, they are still the fastest ones when the minimum support is low. Moreover, they are always among the fastest algorithms and consume less memory than other methods when the data sets are dense.
引用
下载
收藏
页码:1347 / 1362
页数:16
相关论文
共 50 条
  • [31] Single Scan Polynomial Algorithms for Frequent Itemset Mining in Big Databases
    Djenouri, Youcef
    Djenouri, Djamel
    Lin, Jerry Chun-Wei
    Belhadi, Asma
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 1453 - 1460
  • [32] Frequent Itemset Mining in Big Data With Effective Single Scan Algorithms
    Djenouri, Youcef
    Djenouri, Djamel
    Lin, Jerry Chun-Wei
    Belhadi, Asma
    IEEE ACCESS, 2018, 6 : 68013 - 68026
  • [33] Frequent itemset mining using cellular learning automata
    Sohrabi, Mohammad Karim
    Roshani, Reza
    COMPUTERS IN HUMAN BEHAVIOR, 2017, 68 : 244 - 253
  • [34] Parallel frequent itemset mining using systolic arrays
    Sohrabi, Mohammad Karim
    Barforoush, Ahmad Abdollahzadeh
    KNOWLEDGE-BASED SYSTEMS, 2013, 37 : 462 - 471
  • [35] Efficiently Using Matrix in Mining Maximum Frequent Itemset
    Liu Zhen-yu
    Xu Wei-xiang
    Liu Xumin
    THIRD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING: WKDD 2010, PROCEEDINGS, 2010, : 50 - 54
  • [36] Recommendation using Frequent Itemset Mining in Big Data
    Kunjachan, Honeytta
    Hareesh, M. J.
    Sreedevi, K. M.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 561 - 566
  • [37] Improving frequent itemset mining algorithms performance using efficient implementation techniques: A benchmark solution
    Bashir, Shariq
    Shuaib, Muhammad
    Sultan, Yasir
    Baig, A. Rauf
    SECOND INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES 2006, PROCEEDINGS, 2006, : 257 - 262
  • [38] Probabilistic Support Prediction: Fast Frequent Itemset Mining in Dense Data
    Sadeequllah, Muhammad
    Rauf, Azhar
    Rehman, Saif Ur
    Alnazzawi, Noha
    IEEE ACCESS, 2024, 12 : 39330 - 39350
  • [39] A Review of the Maximal Frequent Itemset Mining Algorithms over Dynamically Changed Data
    Li, Haifeng
    Proceedings of the 2016 International Symposium on Advances in Electrical, Electronics and Computer Engineering (ISAEECE), 2016, 69 : 346 - 350
  • [40] Parallelization of Frequent Itemset Mining Methods with FP-tree: An Experiment with PrePost+ Algorithm
    Jamsheela, Olakara
    Gopalakrishna, Raju
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2021, 18 (02) : 208 - 213