Mining Algorithm for Association Rules in Big Data Based on Hadoop

被引:9
|
作者
Fu, Chunhua [1 ]
Wang, Xiaojing [1 ]
Zhang, Lijun [1 ]
Qiao, Liying [1 ]
机构
[1] China Agr Means Prod Assoc, Beijing, Peoples R China
关键词
Conference; Data mining; association rules; Hadoop; FP-Growth;
D O I
10.1063/1.5033699
中图分类号
O59 [应用物理学];
学科分类号
摘要
In order to solve the problem that the traditional association rules mining algorithm has been unable to meet the mining needs of large amount of data in the aspect of efficiency and scalability, take FP-Growth as an example, the algorithm is realized in the parallelization based on Hadoop framew ork and Map Reduce model. On the basis, it is improved using the transaction reduce method for further enhancement of the algorithm's mining efficiency. The experiment, which consists of verification of parallel mining results, comparison on efficiency between serials and parallel, variable relationship between mining time and node number and between mining time and data amount, is carried out in the mining results and efficiency by Hadoop clustering. Experiments show that the paralleled FP-Growth algorithm implemented is able to accurately mine frequent item sets, with a better performance and scalability. It can be better to meet the requirements of big data mining and efficiently mine frequent item sets and association rules from large dataset.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Hadoop based Mining of Distributed Association Rules from Big Data
    Bouraoui, Marwa
    Bouzouita, Ines
    Touzi, Amel Grissa
    [J]. 2017 18TH INTERNATIONAL CONFERENCE ON SCIENCES AND TECHNIQUES OF AUTOMATIC CONTROL AND COMPUTER ENGINEERING (STA), 2017, : 185 - 190
  • [2] Data mining association rule algorithm based on Hadoop
    Huang Suyu
    [J]. PROCEEDINGS OF 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2015), 2015, : 349 - 352
  • [3] Research and Application of a Multidimensional Association Rules Mining Algorithm Based on Hadoop
    Guo Hong
    Guo Nan
    [J]. 19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 636 - 643
  • [4] An Improved Association Rules Mining Algorithm Based on Power Set and Hadoop
    Mao, Weijun
    Guo, Weibin
    [J]. PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CLOUD COMPUTING COMPANION (ISCC-C), 2014, : 236 - 241
  • [5] An evolutionary algorithm for mining rare association rules: a Big Data approach
    Padillo, F.
    Luna, J. M.
    Ventura, S.
    [J]. 2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 2007 - 2014
  • [6] RESEARCH OF DATA MINING ALGORITHM BASED ON ASSOCIATION RULES
    Song, Changxin
    Ma, Ke
    [J]. PROCEEDINGS OF THE 2011 3RD INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATION (ICFCC 2011), 2011, : 243 - +
  • [7] Data mining technology based on association rules algorithm
    Zhang, Guihong
    Liu, Caiming
    Tao, Men
    [J]. International Journal of Mechatronics and Applied Mechanics, 2019, 2019 (05): : 106 - 112
  • [8] Research on adaptive recommendation algorithm for big data mining based on Hadoop platform
    Zhang, Jinming
    [J]. INTERNATIONAL JOURNAL OF INTERNET PROTOCOL TECHNOLOGY, 2019, 12 (04) : 213 - 220
  • [9] A Genetic Algorithm Based Multilevel Association Rules Mining for Big Datasets
    Xu, Yang
    Zeng, Mingming
    Liu, Quanhui
    Wang, Xiaofeng
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [10] Comparative study on the algorithm for mining association rules based on Data Mining
    Guo, Jia
    Ren, Jing-yi
    Zhang, Yu-jing
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING, 2015, 17 : 44 - 48