Efficient mining frequent itemsets algorithms

被引:0
|
作者
Marghny H. Mohamed
Mohammed M. Darwieesh
机构
[1] Assiut University,Faculty of Computers and Information
[2] Assiut University,Mathematics Department, Faculty of Science
关键词
Association rule mining; Frequent; Apriori; Count table; Efficient;
D O I
暂无
中图分类号
学科分类号
摘要
Efficient algorithms for mining frequent itemsets are crucial for mining association rules as well as for many other data mining tasks. It is well known that countTable is one of the most important facility to employ subsets property for compressing the transaction database to new lower representation of occurrences items. One of the biggest problem in this technique is the cost of candidate generation and test processing which are the two most important steps to find association rules. In this paper, we have developed this method to avoid the costly candidate-generation-and-test processing completely. Moreover, the proposed methods also compress crucial information about all itemsets, maximal length frequent itemsets, minimal length frequent itemsets, avoid expensive, and repeated database scans. The proposed named CountTableFI and BinaryCountTableF are presented, the algorithm has significant difference from the Apriori and all other algorithms extended from Apriori. The idea behind this algorithm is in the representation of the transactions, where, we represent all transactions in binary number and decimal number, so it is simple and fast to use subset and identical set properties. A comprehensive performance study shows that our techniques are efficient and scalable comparing with other methods.
引用
收藏
页码:823 / 833
页数:10
相关论文
共 50 条
  • [41] GenMax: An efficient algorithm for mining maximal frequent itemsets
    Gouda, K
    Zaki, MJ
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 11 (03) : 223 - 242
  • [42] A survey on algorithms for mining frequent itemsets over data streams
    Cheng, James
    Ke, Yiping
    Ng, Wilfred
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (01) : 1 - 27
  • [43] Algorithms for Mining Share Frequent Itemsets Containing Infrequent Subsets
    Barber, Brock
    Hamilton, Howard J.
    [J]. LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 316 - 324
  • [44] Privacy-preserving algorithms for distributed mining of frequent itemsets
    Zhong, Sheng
    [J]. INFORMATION SCIENCES, 2007, 177 (02) : 490 - 503
  • [45] A survey on algorithms for mining frequent itemsets over data streams
    James Cheng
    Yiping Ke
    Wilfred Ng
    [J]. Knowledge and Information Systems, 2008, 16 : 1 - 27
  • [46] An efficient maximal frequent itemsets mining algorithm - Based on frequent pattern tree
    Xue, XR
    Wang, GY
    Wu, Y
    Yang, SX
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2005, 1 : 176 - 181
  • [47] Efficient mining algorithm of frequent itemsets for uncertain data streams
    Wang Qianqian
    Liu Fang-ai
    [J]. PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 443 - 446
  • [48] Efficient frequent itemsets mining through sampling and information granulation
    Zhang, Zhongjie
    Pedrycz, Witold
    Huang, Jian
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 119 - 136
  • [49] An Efficient Algorithm for Mining Closed Frequent Itemsets in Data Streams
    Ao, Fujiang
    Du, Jing
    Yan, Yuejin
    Liu, Baohong
    Huang, Kedi
    [J]. 8TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY WORKSHOPS: CIT WORKSHOPS 2008, PROCEEDINGS, 2008, : 37 - +
  • [50] Efficient search space pruning strategies for mining frequent itemsets
    Kalpana, B.
    Nadarajan, R.
    Moyyad, Rukaiya
    [J]. IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 722 - +