BitTableFI: An efficient mining frequent itemsets algorithm

被引:86
|
作者
Dong, Jie [1 ]
Han, Min [1 ]
机构
[1] Dalian Univ Technol, Sch Elect & Informat Engn, Dalian 116023, Peoples R China
关键词
data mining; frequent itemsets; BitTable; database compressing;
D O I
10.1016/j.knosys.2006.08.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining frequent itemsets in transaction databases, time-series databases and many other kinds of databases is an important task and has been studied popularly in data mining research. The problem of mining frequent itemsets can be solved by constructing a candidate set of itemsets first, and then, identifying those itemsets that meet the frequent itemset requirement within this candidate set. Most of the previous research mainly focuses on pruning to reduce the candidate itemsets amounts and the times of scanning databases. However, many algorithms adopt an Apriori-like candidate itemsets generation and support count approach that is the most time-wasted process. To address this issue, the paper proposes an effective algorithm named as BitTableFI. In the algorithm, a special data structure BitTable is used horizontally and vertically to compress database for quick candidate itemsets generation and support count, respectively. The algorithm can also be used in many Apriori-like algorithms to improve the performance. Experiments with both synthetic and real databases show that BitTableFI outperforms Apriori and CBAR which uses ClusterTable for quick support count. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:329 / 335
页数:7
相关论文
共 50 条
  • [41] FIAST: A Novel Algorithm for Mining Frequent Itemsets
    Duemong, Fudailah
    Preechaveerakul, Ladda
    Vanichayobon, Sirirut
    [J]. INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATIONS, PROCEEDINGS, 2009, : 140 - 144
  • [42] The Algorithm of Mining Frequent Itemsets Based on MapReduce
    He, Bo
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND ENGINEERING APPLICATION, ICSCTEA 2013, 2014, 250 : 529 - 534
  • [43] Scalable algorithm for mining maximal frequent itemsets
    Li, QH
    Wang, H
    He, Y
    Jiang, SY
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 143 - 146
  • [44] A fast Algorithm for mining fuzzy frequent itemsets
    Lin, Jerry Chun-Wei
    Li, Ting
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 29 (06) : 2373 - 2379
  • [45] Improved algorithm for mining frequent closed itemsets
    Song, Wei
    Yang, Bingru
    Xu, Zhangyan
    Gao, Jing
    [J]. 2008, Science Press, 18,Shuangqing Street,Haidian, Beijing, 100085, China (45):
  • [46] An efficient approach for interactive mining of frequent itemsets
    Deng, ZH
    Li, X
    Tang, SW
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 138 - 149
  • [47] MREclat: an Algorithm for Parallel Mining Frequent Itemsets
    Zhang, Zhigang
    Ji, Genlin
    Tang, Mengmeng
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 177 - 180
  • [48] New algorithm of mining frequent closed itemsets
    School of Computer and Information Technology, Liaoning Normal University, Dalian 116029, China
    [J]. J. Southeast Univ. Engl. Ed., 2008, 3 (335-338):
  • [49] An Algorithm of Mining Frequent Itemsets in Pervasive Computing
    Teng, Shaohua
    Su, Jiangyu
    Zhang, Wei
    Fu, Xiufen
    Chen, Shuqing
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND APPLICATIONS, VOLS 1 AND 2, 2008, : 561 - 565
  • [50] A fast parallel algorithm for frequent itemsets mining
    Souliou, Dora
    Pagourtzis, Aris
    Tsanakas, Panayiotis
    [J]. ARTIFICIAL INTELLIGENCE AND INNOVATIONS 2007: FROM THEORY TO APPLICATIONS, 2007, : 213 - +