Index-BitTableFI: An improved algorithm for mining frequent itemsets

被引:91
|
作者
Song, Wei [1 ]
Yang, Bingru [2 ]
Xu, Zhangyan [3 ]
机构
[1] N China Univ Technol, Coll Informat Engn, Beijing 100144, Peoples R China
[2] Univ Sci & Technol, Sch Informat Engn, Beijing 100083, Taipei, Peoples R China
[3] Guanxi Normal Univ, Dept Comp, Guilin 541004, Peoples R China
关键词
data mining; association rule; frequent itemset; BitTable; index array; subsume index;
D O I
10.1016/j.knosys.2008.03.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient algorithms for mining frequent itemsets are crucial for mining association rules as well as for many other data mining tasks. Methods for mining frequent itemsets have been implemented using a BitTable structure. BitTableFI is such a recently proposed efficient BitTable-based algorithm, which exploits BitTable both horizontally and vertically. Although making use of efficient bit wise operations, BitTableFI still may suffer from the high cost of candidate generation and test. To address this problem, a new algorithm Index-BitTableFI is proposed. Index-BitTableFI also uses BitTable horizontally and vertically. To make use of BitTable horizontally, index array and the corresponding computing method are proposed. By computing the subsume index, those itemsets that co-occurrence with representative item can be identified quickly by using breadth-first search at one time. Then, for the resulting itemsets generated through the index array, depth-first search strategy is used to generate all other frequent itemsets. Thus, the hybrid search is implemented, and the search space is reduced greatly. The advantages of the proposed methods are as follows. On the one hand, the redundant operations on intersection of tidsets and frequency-checking can be avoided greatly; On the other hand, it is proved that frequent itemsets, including representative item and having the same supports as representative item, can be identified directly by connecting the representative item with all the combinations of items in its subsume index. Thus, the cost for processing this kind of itemsets is lowered, and the efficiency is improved. Experimental results show that the proposed algorithm is efficient especially for dense datasets. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:507 / 513
页数:7
相关论文
共 50 条
  • [1] BitTableFI: An efficient mining frequent itemsets algorithm
    Dong, Jie
    Han, Min
    [J]. KNOWLEDGE-BASED SYSTEMS, 2007, 20 (04) : 329 - 335
  • [2] An Improved Algorithm for Frequent Itemsets Mining
    Jiang, Hao
    He, Xu
    [J]. 2017 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2017, : 314 - 317
  • [3] AN IMPROVED ALGORITHM FOR MINING FREQUENT WEIGHTED ITEMSETS
    Nguyen Duy Ham
    Bay Vo
    Nguyen Thi Hong Minh
    Tzung-Pei Hong
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2579 - 2584
  • [4] Improved algorithm for mining frequent closed itemsets
    Song, Wei
    Yang, Bingru
    Xu, Zhangyan
    Gao, Jing
    [J]. 2008, Science Press, 18,Shuangqing Street,Haidian, Beijing, 100085, China (45):
  • [5] An Improved MapReduce Algorithm for Mining Closed Frequent Itemsets
    Gonen, Yaron
    Gudes, Ehud
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE SCIENCE, TECHNOLOGY AND ENGINEERING (SWSTE 2016), 2016, : 77 - 83
  • [6] DFR: A new improved algorithm for mining frequent itemsets
    Chai, Sheng
    Wang, Hai-Chun
    Qiu, Ji-Fan
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 369 - 373
  • [7] An Algorithm for Mining Frequent Itemsets
    Hernandez Leon, Raudel
    Perez Suarez, Airel
    Feregrino Uribe, Claudia
    Guzman Zavaleta, Zobeida Jezabel
    [J]. 2008 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2008), 2008, : 236 - +
  • [8] Mining Frequent Itemsets in Association Rule Mining Using Improved SETM Algorithm
    Hanirex, D. Kerana
    Kaliyamurthie, K. P.
    [J]. ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2015, 2016, 394 : 765 - 773
  • [9] Parallel algorithm for mining frequent itemsets
    Ruan, YL
    Liu, G
    Li, QH
    [J]. Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 2118 - 2121
  • [10] An Algorithm for Mining Frequent Closed Itemsets
    Zhang Tiejun
    Yang Junrui
    Wang Xiuqin
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 240 - +