OPTIMIZATION AND REALIZATION OF PARALLEL FREQUENT ITEM SET MINING ALGORITHM

被引:0
|
作者
Yuan, Ling [1 ]
Li, Dan [1 ]
Chen, Yuzhong [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci, Wuhan 430074, Peoples R China
关键词
Data Mining; Frequent item sets; Candidate Item Sets; Key-value pairs;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Associative data mining is the research hotspot in the field of big data, and frequent item sets mining is an important step in the analysis of associative data. This paper focuses on analyzing the frequent item sets mining algorithm based on Apriori parallel algorithm. The paper has found two shortages of Apriori parallel algorithm: one is that the key value pair are too many, another is that in the combiner stage, it occupies two much memory. Therefore, we propose an optimized algorithm. In the optimization algorithm, candidate item sets and local count information are saved in memory, greatly reducing the number of generated keys. Meanwhile, in the short length frequent item sets mining, the method of reducing the number of scanning transaction data without generating candidate item sets can improve the algorithm efficiency. We do the experiments in the Hadoop platform to testify the performance of the proposed optimized algorithm. The experiments demonstrate that the time and I/O of the optimized algorithm have been improved greatly, compared with the non-optimized algorithm.
引用
收藏
页码:546 / 551
页数:6
相关论文
共 50 条
  • [31] Parallel Frequent Patterns Mining Algorithm on GPU
    Zhou, Jiayi
    Yu, Kun-Ming
    Wu, Bin-Chang
    2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [32] Parallel algorithm for mining frequent closed sequences
    Ma, CX
    Li, QH
    AUTONOMOUS INTELLIGENT SYSTEMS: AGENTS AND DATA MINING, PROCEEDINGS, 2005, 3505 : 184 - 192
  • [33] MREclat: an Algorithm for Parallel Mining Frequent Itemsets
    Zhang, Zhigang
    Ji, Genlin
    Tang, Mengmeng
    2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 177 - 180
  • [34] Parallelized Frequent Item Set Mining Using a Tall and Skinny Matrix
    Janakiram, D. Pooja
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 8 - 13
  • [35] Algorithm of Frequent Item Sets Mining Based on Index Table
    Zhang Lin
    Yao Nanzhen
    Zhang Jianli
    MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 1076 - +
  • [36] Design and Implementation of Improved Algorithm for Frequent Item Sets Mining
    Zhang Lin
    Zhang Jianli
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1696 - 1698
  • [37] Frequent Item Set Mining of Large Datasets Using CUDA Computing
    Karthik, Peddi
    Banu, J. Saira
    SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2018, VOL 2, 2020, 1057 : 739 - 747
  • [38] Compressing Neural Networks by Applying Frequent Item-Set Mining
    Dou, Zi-Yi
    Huang, Shu-Jian
    Su, Yi-Fan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 696 - 704
  • [39] Data Elimination Based Technique for Mining Frequent Closed Item Set
    Ahuja, Kamlesh
    Jain, Sarika
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ICT IN BUSINESS INDUSTRY & GOVERNMENT (ICTBIG), 2016,
  • [40] Algorithm for mining frequent itemsets with item constraint based on partition
    Chen, Hui-Ping
    Zhu, Feng
    Wang, Jian-Dong
    Zhou, Xiao-Qin
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2006, 28 (07): : 1082 - 1086