Index-BitTableFI: An improved algorithm for mining frequent itemsets

被引:91
|
作者
Song, Wei [1 ]
Yang, Bingru [2 ]
Xu, Zhangyan [3 ]
机构
[1] N China Univ Technol, Coll Informat Engn, Beijing 100144, Peoples R China
[2] Univ Sci & Technol, Sch Informat Engn, Beijing 100083, Taipei, Peoples R China
[3] Guanxi Normal Univ, Dept Comp, Guilin 541004, Peoples R China
关键词
data mining; association rule; frequent itemset; BitTable; index array; subsume index;
D O I
10.1016/j.knosys.2008.03.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient algorithms for mining frequent itemsets are crucial for mining association rules as well as for many other data mining tasks. Methods for mining frequent itemsets have been implemented using a BitTable structure. BitTableFI is such a recently proposed efficient BitTable-based algorithm, which exploits BitTable both horizontally and vertically. Although making use of efficient bit wise operations, BitTableFI still may suffer from the high cost of candidate generation and test. To address this problem, a new algorithm Index-BitTableFI is proposed. Index-BitTableFI also uses BitTable horizontally and vertically. To make use of BitTable horizontally, index array and the corresponding computing method are proposed. By computing the subsume index, those itemsets that co-occurrence with representative item can be identified quickly by using breadth-first search at one time. Then, for the resulting itemsets generated through the index array, depth-first search strategy is used to generate all other frequent itemsets. Thus, the hybrid search is implemented, and the search space is reduced greatly. The advantages of the proposed methods are as follows. On the one hand, the redundant operations on intersection of tidsets and frequency-checking can be avoided greatly; On the other hand, it is proved that frequent itemsets, including representative item and having the same supports as representative item, can be identified directly by connecting the representative item with all the combinations of items in its subsume index. Thus, the cost for processing this kind of itemsets is lowered, and the efficiency is improved. Experimental results show that the proposed algorithm is efficient especially for dense datasets. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:507 / 513
页数:7
相关论文
共 50 条
  • [21] A fast Algorithm for mining fuzzy frequent itemsets
    Lin, Jerry Chun-Wei
    Li, Ting
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 29 (06) : 2373 - 2379
  • [22] An efficient algorithm for mining frequent closed itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    [J]. Informatica (Slovenia), 2015, 39 (01): : 87 - 98
  • [23] An Efficient Algorithm for Mining Frequent Closed Itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2015, 39 (01): : 87 - 98
  • [24] Improved algorithm for parallel mining collaborative frequent itemsets in multiple data streams
    Liu, Fang'ai
    Wang, Qianqian
    Wang, Xin
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3): : S6133 - S6141
  • [25] Improved algorithm for parallel mining collaborative frequent itemsets in multiple data streams
    Fang’ai Liu
    Qianqian Wang
    Xin Wang
    [J]. Cluster Computing, 2019, 22 : 6133 - 6141
  • [26] MREclat: an Algorithm for Parallel Mining Frequent Itemsets
    Zhang, Zhigang
    Ji, Genlin
    Tang, Mengmeng
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 177 - 180
  • [27] New algorithm of mining frequent closed itemsets
    School of Computer and Information Technology, Liaoning Normal University, Dalian 116029, China
    [J]. J. Southeast Univ. Engl. Ed., 2008, 3 (335-338):
  • [28] An Algorithm of Mining Frequent Itemsets in Pervasive Computing
    Teng, Shaohua
    Su, Jiangyu
    Zhang, Wei
    Fu, Xiufen
    Chen, Shuqing
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND APPLICATIONS, VOLS 1 AND 2, 2008, : 561 - 565
  • [29] A fast parallel algorithm for frequent itemsets mining
    Souliou, Dora
    Pagourtzis, Aris
    Tsanakas, Panayiotis
    [J]. ARTIFICIAL INTELLIGENCE AND INNOVATIONS 2007: FROM THEORY TO APPLICATIONS, 2007, : 213 - +
  • [30] A parallel Apriori algorithm for frequent itemsets mining
    Ye, Yanbin
    Chiang, Chia-Chu
    [J]. FOURTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS, PROCEEDINGS, 2006, : 87 - +