Mining Algorithm for Association Rules in Big Data Based on Hadoop

被引:9
|
作者
Fu, Chunhua [1 ]
Wang, Xiaojing [1 ]
Zhang, Lijun [1 ]
Qiao, Liying [1 ]
机构
[1] China Agr Means Prod Assoc, Beijing, Peoples R China
关键词
Conference; Data mining; association rules; Hadoop; FP-Growth;
D O I
10.1063/1.5033699
中图分类号
O59 [应用物理学];
学科分类号
摘要
In order to solve the problem that the traditional association rules mining algorithm has been unable to meet the mining needs of large amount of data in the aspect of efficiency and scalability, take FP-Growth as an example, the algorithm is realized in the parallelization based on Hadoop framew ork and Map Reduce model. On the basis, it is improved using the transaction reduce method for further enhancement of the algorithm's mining efficiency. The experiment, which consists of verification of parallel mining results, comparison on efficiency between serials and parallel, variable relationship between mining time and node number and between mining time and data amount, is carried out in the mining results and efficiency by Hadoop clustering. Experiments show that the paralleled FP-Growth algorithm implemented is able to accurately mine frequent item sets, with a better performance and scalability. It can be better to meet the requirements of big data mining and efficiently mine frequent item sets and association rules from large dataset.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] A Big Data Framework for Mining Sensor Data Using Hadoop
    El-Shafeiy, Engy A.
    El-Desouky, Ali I.
    [J]. STUDIES IN INFORMATICS AND CONTROL, 2017, 26 (03): : 365 - 376
  • [42] Optimization of Apriori Algorithm Based on Mining Association Rules
    Peng, Ying-chun
    [J]. 2010 INTERNATIONAL CONFERENCE ON BIO-INSPIRED SYSTEMS AND SIGNAL PROCESSING (ICBSSP 2010), 2010, : 226 - 229
  • [43] An Improvement of Fuzzy Association Rules Mining Algorithm Based on Redundancy of Rules
    Watanabe, Toshihiko
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (09) : 1248 - 1255
  • [44] An algorithm of association rules mining based on digit sequence
    Fang, Gang
    Wu, Yuan-Bin
    Liu, Yu-Lu
    Xiong, Jiang
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 532 - 535
  • [45] Association Rules Mining Based on the Improved Immune Algorithm
    Zhang, Yongqiang
    Bu, Shuyang
    Zhang, Yongjian
    [J]. 2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 2, PROCEEDINGS, 2009, : 453 - +
  • [46] An Algorithm of Mining Association Rules Based on Granular Computing
    Cao, Xiaojun
    [J]. 2012 INTERNATIONAL CONFERENCE ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING (ICMPBE2012), 2012, 33 : 1248 - 1253
  • [47] The Research of Association Rules Mining Algorithm Based on Binary
    Fang, Gang
    Wei, Zu-Kuan
    Yin, Qian
    [J]. 2008 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 879 - +
  • [48] Optimization of Apriori Algorithm Based on Mining Association Rules
    Peng, Ying-chun
    [J]. 2011 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION AND INDUSTRIAL APPLICATION (ICIA2011), VOL I, 2011, : 472 - 475
  • [49] A Novel Classification Algorithm Based on Association Rules Mining
    Bay Vo
    Bac Le
    [J]. KNOWLEDGE ACQUISITION: APPROACHES, ALGORITHMS AND APPLICATIONS, 2009, 5465 : 61 - +
  • [50] Tourism English Based on Association Rules Mining Algorithm
    Cui Jianzhou
    [J]. PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,