BitTableFI: An efficient mining frequent itemsets algorithm

被引:86
|
作者
Dong, Jie [1 ]
Han, Min [1 ]
机构
[1] Dalian Univ Technol, Sch Elect & Informat Engn, Dalian 116023, Peoples R China
关键词
data mining; frequent itemsets; BitTable; database compressing;
D O I
10.1016/j.knosys.2006.08.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining frequent itemsets in transaction databases, time-series databases and many other kinds of databases is an important task and has been studied popularly in data mining research. The problem of mining frequent itemsets can be solved by constructing a candidate set of itemsets first, and then, identifying those itemsets that meet the frequent itemset requirement within this candidate set. Most of the previous research mainly focuses on pruning to reduce the candidate itemsets amounts and the times of scanning databases. However, many algorithms adopt an Apriori-like candidate itemsets generation and support count approach that is the most time-wasted process. To address this issue, the paper proposes an effective algorithm named as BitTableFI. In the algorithm, a special data structure BitTable is used horizontally and vertically to compress database for quick candidate itemsets generation and support count, respectively. The algorithm can also be used in many Apriori-like algorithms to improve the performance. Experiments with both synthetic and real databases show that BitTableFI outperforms Apriori and CBAR which uses ClusterTable for quick support count. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:329 / 335
页数:7
相关论文
共 50 条
  • [1] Index-BitTableFI: An improved algorithm for mining frequent itemsets
    Song, Wei
    Yang, Bingru
    Xu, Zhangyan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2008, 21 (06) : 507 - 513
  • [2] An efficient algorithm for mining frequent closed itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    [J]. Informatica (Slovenia), 2015, 39 (01): : 87 - 98
  • [3] An Efficient Algorithm for Mining Frequent Closed Itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2015, 39 (01): : 87 - 98
  • [4] An efficient algorithm of frequent itemsets mining based on MapReduce
    Wang, Le
    Feng, Lin
    Zhang, Jing
    Liao, Pengyu
    [J]. Journal of Information and Computational Science, 2014, 11 (08): : 2809 - 2816
  • [5] GenMax: An Efficient Algorithm for Mining Maximal Frequent Itemsets
    Karam Gouda
    Mohammed J. Zaki
    [J]. Data Mining and Knowledge Discovery, 2005, 11 : 223 - 242
  • [6] An efficient algorithm for incrementally mining frequent closed itemsets
    Yen, Show-Jane
    Lee, Yue-Shi
    Wang, Chiu-Kuang
    [J]. APPLIED INTELLIGENCE, 2014, 40 (04) : 649 - 668
  • [7] An efficient algorithm for incrementally mining frequent closed itemsets
    Show-Jane Yen
    Yue-Shi Lee
    Chiu-Kuang Wang
    [J]. Applied Intelligence, 2014, 40 : 649 - 668
  • [8] An Efficient Algorithm for Mining Frequent Itemsets with Single Constraint
    Hai Duong
    Tin Truong
    Bac Le
    [J]. ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2013, 479 : 367 - 378
  • [9] negFIN: An efficient algorithm for fast mining frequent itemsets
    Aryabarzan, Nader
    Minaei-Bidgoli, Behrouz
    Teshnehlab, Mohammad
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 105 : 129 - 143
  • [10] PNPFI: An Efficient Parallel Frequent Itemsets Mining Algorithm
    Zhang, Fang
    Zhang, Yu
    Liao, Xiaofei
    Jin, Hai
    [J]. PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 172 - 177