Efficient mining of long frequent patterns from very large dense datasets

被引:0
|
作者
Gopalan, RP [1 ]
Sucahyo, YG [1 ]
机构
[1] Curtin Univ Technol, Dept Comp, Bentley, WA 6102, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering association rules that identify relationships among sets of items in a transaction database is an important problem in Data Mining. Finding frequent itemsets has been an active research area since it is the crucial step in association rule discovery. However, efficiently mining frequent itemsets from dense datasets is still a challenging problem. In this paper, we describe a new and more efficient algorithm named CT-GIN for mining complete frequent itemsets from dense datasets. The algorithm uses a compact prefix tree for succinctly representing transaction data and an item group intersection method for efficient extraction of frequent itemsets from the tree. Performance comparisons show that our algorithm outperforms the fastest Apriori algorithm, Eclat and FP-Growth, on several widely used test data sets. CT-GIN has been extended for mining very large datasets, and we also present test results showing its scalability.
引用
收藏
页码:652 / 661
页数:10
相关论文
共 50 条
  • [31] Mining very large datasets with support vector machine algorithms
    Poulet, F
    Do, TN
    [J]. ENTERPRISE INFORMATION SYSTEMS V, 2004, : 177 - 184
  • [32] CloSpan: Mining closed sequential patterns in large datasets
    Yan, XF
    Han, JW
    Afshar, R
    [J]. PROCEEDINGS OF THE THIRD SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2003, : 166 - 177
  • [33] Efficient Representation of Very Large Linked Datasets as Graphs
    Krommyda, Maria
    Kantere, Verena
    Vassiliou, Yannis
    [J]. PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS), VOL 1, 2020, : 106 - 115
  • [34] Fast and Memory Efficient Mining of Periodic Frequent Patterns
    Nofong, Vincent Mwintieru
    [J]. MODERN APPROACHES FOR INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2018, 769 : 223 - 232
  • [35] DBC: a condensed representation of frequent patterns for efficient mining
    Bykowski, A
    Rigotti, C
    [J]. INFORMATION SYSTEMS, 2003, 28 (08) : 949 - 977
  • [36] Database methods for efficient manipulation of very large datasets
    Ferreira, R
    Sussman, A
    Saltz, J
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, PROCEEDINGS, 1999, : 460 - 466
  • [37] An Efficient Algorithm for Mining Frequent Patterns in Data Stream
    Zhang Guang-lu
    Lei Jing-sheng
    [J]. INTERNATIONAL CONFERENCE OF CHINA COMMUNICATION (ICCC2010), 2010, : 160 - +
  • [38] Efficient mining of maximal correlated weight frequent patterns
    Yun, Unil
    Ryu, Keun Ho
    [J]. INTELLIGENT DATA ANALYSIS, 2013, 17 (05) : 917 - 939
  • [39] FSMTree: An efficient algorithm for mining frequent temporal patterns
    Kempe, Steffen
    Hipp, Jochen
    Kruse, Rudolf
    [J]. DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, : 253 - +
  • [40] An Efficient Approach for Updating the Structure for Mining Frequent Patterns
    Yen, Show-Jane
    Lee, Yue-Shi
    Gu, Jia-Yuan
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2012, : 879 - 883