OPTIMIZATION AND REALIZATION OF PARALLEL FREQUENT ITEM SET MINING ALGORITHM

被引:0
|
作者
Yuan, Ling [1 ]
Li, Dan [1 ]
Chen, Yuzhong [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci, Wuhan 430074, Peoples R China
关键词
Data Mining; Frequent item sets; Candidate Item Sets; Key-value pairs;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Associative data mining is the research hotspot in the field of big data, and frequent item sets mining is an important step in the analysis of associative data. This paper focuses on analyzing the frequent item sets mining algorithm based on Apriori parallel algorithm. The paper has found two shortages of Apriori parallel algorithm: one is that the key value pair are too many, another is that in the combiner stage, it occupies two much memory. Therefore, we propose an optimized algorithm. In the optimization algorithm, candidate item sets and local count information are saved in memory, greatly reducing the number of generated keys. Meanwhile, in the short length frequent item sets mining, the method of reducing the number of scanning transaction data without generating candidate item sets can improve the algorithm efficiency. We do the experiments in the Hadoop platform to testify the performance of the proposed optimized algorithm. The experiments demonstrate that the time and I/O of the optimized algorithm have been improved greatly, compared with the non-optimized algorithm.
引用
收藏
页码:546 / 551
页数:6
相关论文
共 50 条
  • [41] Research on parallel frequent item mining on multi-core processors
    1600, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (07):
  • [42] An improved parallel algorithm for finding frequent item-sets
    She, CD
    Li, L
    Wang, HB
    Gao, B
    Deng, HQ
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON INTELLIGENT MECHATRONICS AND AUTOMATION, 2004, : 383 - 386
  • [43] AN EFFICIENT ALGORITHM FOR DETECTING OUTLIERS IN A DISTRIBUTED ENVIRONMENT USING MINIMAL IN-FREQUENT ITEM SET PATTERN MINING
    Chandran, Chandra Ravi
    Padmanabhan, Ajitha
    IIOAB JOURNAL, 2016, 7 (09) : 22 - 25
  • [44] Frequent Item Sets and Association Rules Mining Algorithm Based on Floyd Algorithm
    Zhang Lin
    Zhang Jianli
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (09) : 2574 - 2578
  • [45] An Algorithm of Fast Mining Frequent Neighboring Class Set
    Fang, Gang
    Ying, Hong
    Xiong, Jiang
    INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT I, 2011, 134 (0I): : 290 - 295
  • [46] An Algorithm of Alternately Mining Frequent Neighboring Class Set
    Fang, Gang
    ADVANCES IN SWARM INTELLIGENCE, PT 2, PROCEEDINGS, 2010, 6146 : 588 - 593
  • [47] Parallel Frequent Set Mining Using Inverted Matrix Approach
    Bhanderi, Sanjay D.
    Garg, Sanjay
    3RD NIRMA UNIVERSITY INTERNATIONAL CONFERENCE ON ENGINEERING (NUICONE 2012), 2012,
  • [48] Distributed Frequent Item Mining
    Zhang, Yu
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1202 - 1207
  • [49] Mining Frequent Sequential Rules with An Efficient Parallel Algorithm
    Youssef, Nesma
    Abdulkader, Hatem
    Abdelwahab, Amira
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2022, 19 (01) : 110 - 120
  • [50] Enhanced parallel mining algorithm for frequent sequential rules
    Youssef, Nesma
    Abdulkader, Hatem
    Abdelwahab, Amira
    AIN SHAMS ENGINEERING JOURNAL, 2022, 13 (01)