HDFS Framework for Efficient Frequent Itemset Mining Using MapReduce

被引:0
|
作者
Kulkarni, Prajakta G. [1 ]
Khonde, Shraddha R. [1 ]
机构
[1] Modern Educ Soc Coll Engn, Dept Comp Engn, Pune, Maharashtra, India
关键词
Association rule mining; frequent item sets; mappers; reducers; load balancing; Modified Apriori Algorithm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Association rule mining is a very essential data mining technique in different fields. The enormous development of the information needs increased computational power. To address this issue, it is important to study executions of mining algorithms. To find out the frequent itemsets is an essential and vital issue in numerous information mining applications. There are many algorithms present to extract frequent itemsets like Apriori and FP-Growth. But these algorithms lack properties like parallelization, load balancing, data distribution, and fault tolerance on large clusters or big data. A Modified Apriori method is introduced here in which, the mappers and reducers will work simultaneously. This method uses three MapReduce to calculate frequent itemset. The third MapReduce is used to decompose itemsets and gives the final result. In this paper a new scheme or algorithm is proposed that will reduce the execution time for the massive database and works efficiently on number of nodes by using Modified Apriori algorithm.
引用
收藏
页码:171 / 178
页数:8
相关论文
共 50 条
  • [1] New approach in Big Data Mining for frequent itemset using mapreduce in HDFS
    Nikam, Pallavi V.
    Deshpande, Deepa S.
    [J]. 2018 3RD INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [2] A Parallel Algorithm for Approximate Frequent Itemset Mining using MapReduce
    Fumarola, Fabio
    Malerba, Donato
    [J]. 2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2014, : 335 - 342
  • [3] Frequent Itemset Mining using Improved Apriori Algorithm with MapReduce
    Tribhuvan, Seema A.
    Gavai, Nitin R.
    Vasgi, Bharti P.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [4] ParallelCharMax: An Effective Maximal Frequent Itemset Mining Algorithm Based on MapReduce Framework
    Gahar, Rania Mkhinini
    Arfaoui, Olfa
    Sassi Hidri, Minyar
    Ben Hadj-Alouane, Nejib
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 571 - 578
  • [5] MapReduce-based Closed Frequent Itemset Mining with Efficient Redundancy Filtering
    Wang, Su-Qi
    Yang, Yu-Bin
    Chen, Guang-Peng
    Gao, Yang
    Zhang, Yao
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 449 - 453
  • [6] SmartCache: An Optimized MapReduce Implementation of Frequent Itemset Mining
    Huang, Dachuan
    Song, Yang
    Routray, Ramani
    Qin, Feng
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2015), 2015, : 16 - 25
  • [7] Weighted Frequent Multi Partitioned Itemset Mining of Market-Basket Data using MapReduce on YARN Framework
    Bisoyi, Sudhanshu Shekhar
    Mishra, Pragnyaban
    Mishra, S. N.
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ICT IN BUSINESS INDUSTRY & GOVERNMENT (ICTBIG), 2016,
  • [8] An efficient frequent itemset mining algorithm
    Luo, Ke
    Zhang, Xue-Mao
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 756 - 761
  • [9] MrFIM: A MapReduce Approach for Frequent Itemset Mining in Big Data
    Rahman, Abdul
    Manjaramkar, Arati
    [J]. 2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [10] MapReduce Based Frequent Itemset Mining Algorithm on Stream Data
    Chaudhary, Hemant
    Yadav, Deepak Kumar
    Bhatnagar, Rajat
    Chandrasekhar, Uddagiri
    [J]. 2015 GLOBAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (GCCT), 2015, : 586 - 591