Research on Improved Apriori Algorithm based on MapReduce and HBase

被引:0
|
作者
Feng, Dongyu [1 ,2 ]
Zhu, Ligu [1 ,2 ]
Zhang, Lei [1 ,2 ]
机构
[1] Commun Univ China, Coll Comp, Beijing, Peoples R China
[2] Beijng Key Lab Big Data Secur & Protect Ind, Beijing, Peoples R China
关键词
Apriori; MapReduce; HBase; data mining; association rules;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to improve the efficiency of Apriori algorithm for mining frequent item sets, MH-Apriori algorithm was designed for big data to address the poor efficiency problem. MH-Apriori takes advantages of MapReduce and HBase together to optimize Apriori algorithm. Compared with the improved Apriori algorithm simply based on MapReduce framework, timestamp of HBase is utilized in this algorithm to avoid generating a large number of key/value pairs. It saves the pattern matching time and scans the database only once. Also, to obtain transaction marks automatically, transaction mark column is added to set list for computing support numbers. MH-Apriori was executed on Hadoop platform. The experimental results show that MH-Apriori has higher efficiency and scalability.
引用
收藏
页码:887 / 891
页数:5
相关论文
共 50 条
  • [1] Research on Improved Apriori Algorithm Based on Coding and MapReduce
    Guo, Jian
    Ren, Yong-gong
    [J]. 2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 294 - 299
  • [2] Apriori Parallel Improved Algorithm Based on MapReduce Distributed Architecture
    She Xiangyang
    Zhang Ling
    [J]. PROCEEDINGS OF 2016 SIXTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2016), 2016, : 517 - 521
  • [3] Apriori Algorithm Optimization Study Based on MapReduce
    Li Chunqing
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 1466 - 1470
  • [4] Parallel implementation of Apriori algorithm based on MapReduce
    Li N.
    Zeng L.
    He Q.
    Shi Z.
    [J]. International Journal of Networked and Distributed Computing, 2013, 1 (2) : 89 - 96
  • [5] Parallel Implementation of Apriori Algorithm Based on MapReduce
    Li, Ning
    Zeng, Li
    He, Qing
    Shi, Zhongzhi
    [J]. INTERNATIONAL JOURNAL OF NETWORKED AND DISTRIBUTED COMPUTING, 2013, 1 (02) : 89 - 96
  • [6] The Research of Improved Apriori Algorithm
    Bi Xujing
    Xu Weixiang
    [J]. PROCEEDINGS OF 2ND CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCE (LISS 2012), VOLS 1 AND 2, 2013,
  • [7] The Research of Improved Apriori Algorithm
    Liao Zhenyun
    Fu Xiufen
    Wang Yaguang
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2179 - 2184
  • [8] Frequent Itemset Mining using Improved Apriori Algorithm with MapReduce
    Tribhuvan, Seema A.
    Gavai, Nitin R.
    Vasgi, Bharti P.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [9] Research and application of improved Apriori algorithm based on matrix
    Liu, Yuan
    Lou, Yuansheng
    [J]. MECHANICAL COMPONENTS AND CONTROL ENGINEERING III, 2014, 668-669 : 1102 - 1105
  • [10] The Parallel Improved Apriori Algorithm Research Based on Spark
    Yang, Shaosong
    Xu, Guoyan
    Wang, Zhijian
    Zhou, Fachao
    [J]. 2015 NINTH INTERNATIONAL CONFERENCE ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY FCST 2015, 2015, : 353 - 358