Implementation of an Improved Algorithm for Frequent Itemset Mining using Hadoop

被引:0
|
作者
Agarwal, Ruchi [1 ]
Singh, Sunny [1 ]
Vats, Satvik [1 ]
机构
[1] Sharda Univ, Dept Comp Sci & Engn, Plot 32,34,Knowledge Pk 3, Greater Noida, UP, India
关键词
Frequent Item-set Mining; Big Data; MapReduce; Distributed Computational Environment;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Searching frequent item-sets in large size heterogeneous databases in minimal time is considered as one of the most important data mining problem. As a solution of this problem, various algorithms have been proposed to speed up execution. Most of the recent proposed algorithms focussed on parallelizing the workload using large number of machine in distributed computational environment like MapReduce framework. A few of them are actually capable to determine the appropriate number of required computing computers, considering workload balancing and execution efficiency. But internally not capable to determine exact number of required iteration for any large size datasets in advance to find out the frequent item-set based on iterative sampling. In this paper, we propose an improved and compact algorithm (ICA) for finding frequent item-set in minimal time, using distributed computational environment. It is also capable of determining the exact number of internal iteration required for any large size datasets whether data is in structured or unstructured format.
引用
收藏
页码:13 / 18
页数:6
相关论文
共 50 条
  • [1] Frequent Itemset Mining on Hadoop
    Ferenc Kovacs
    Illes, Janos
    [J]. IEEE 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL CYBERNETICS (ICCC 2013), 2013, : 241 - 245
  • [2] An Improved Version of the Frequent Itemset Mining Algorithm
    Butincu, Cristian Nicolae
    Craus, Mitica
    [J]. 2015 14TH ROEDUNET INTERNATIONAL CONFERENCE - NETWORKING IN EDUCATION AND RESEARCH (ROEDUNET NER), 2015, : 184 - 189
  • [3] Frequent Itemset Mining using Improved Apriori Algorithm with MapReduce
    Tribhuvan, Seema A.
    Gavai, Nitin R.
    Vasgi, Bharti P.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [4] An efficient frequent itemset mining algorithm
    Luo, Ke
    Zhang, Xue-Mao
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 756 - 761
  • [5] A parallel algorithm for frequent itemset mining
    Li, L
    Zhai, DH
    Fan, J
    [J]. PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT'2003, PROCEEDINGS, 2003, : 868 - 871
  • [6] Human resource recommendation algorithm based on improved frequent itemset mining
    Zhaoshan, Liu
    Yiming, Ma
    Huihua, Zheng
    Dege, Liu
    Jing, Liu
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 126 : 284 - 288
  • [7] Index-CloseMiner: An improved algorithm for mining frequent closed itemset
    Song, Wei
    Yang, Bingru
    Xu, Zhangyan
    [J]. INTELLIGENT DATA ANALYSIS, 2008, 12 (04) : 321 - 338
  • [8] An Improved Vertical Algorithm for Frequent Itemset Mining from Uncertain Database
    Yang, Junrui
    Zhang, Yingjie
    Wei, Yanjun
    [J]. 2017 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2017), VOL 1, 2017, : 355 - 358
  • [9] An Improved PrePost Algorithm for Frequent Pattern Mining with Hadoop on Cloud
    Thakare, Sanket
    Rathi, Sheetal
    Sedamkar, R. R.
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING AND VIRTUALIZATION (ICCCV) 2016, 2016, 79 : 207 - 214
  • [10] A Parallel Algorithm for Approximate Frequent Itemset Mining using MapReduce
    Fumarola, Fabio
    Malerba, Donato
    [J]. 2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2014, : 335 - 342