Distributed FP-ARMH Algorithm in Hadoop Map Reduce Framework

被引:0
|
作者
Natarajan, Surendar [1 ]
Sehar, Sountharrajan [2 ]
机构
[1] Bannari Amman Inst Technol, Comp Sci & Engn, Sathyamangalam, Tamil Nadu, India
[2] Bannari Amman Inst Technol, Sathyamangalam, Tamil Nadu, India
关键词
Data Mining; Distributed Computing; Hadoop; Map Reduce; FREQUENT; MIDDLEWARE; PATTERNS;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Evolution of Cloud computing technology over the Internet and drastic increase in data size and intensity (Big Data) persuade Map Reduce and distributed file systems like HDFS (Hadoop Distributed File System) as the paradigm of choice for distributed data mining applications. With size and complexity of data growing every day, distributed data mining algorithms has to be designed to handle Big Data in compatible with the latest technology available on distributed computing. Earlier research activities in data mining comprises, focus on increasing the performance for single task computing algorithms rather than distributed computing which would provide more fast and scalable environment for processing large datasets. Existing algorithms in the field of distributed frequent pattern data mining includes, TPFP-tree, BTP tree, and CARM. But these algorithms suffer from unbalanced workload management among its clusters. In this paper, a novel algorithm, named Association rule mining based on Hadoop (ARMH) has been proposed to utilize the clusters effectively and mining frequent pattern from large databases. Hadoop distributed framework helps in managing the workload among the clusters. The ARMH was implemented in hadoop using Map Reduce programming paradigm.
引用
收藏
页码:264 / 270
页数:7
相关论文
共 50 条
  • [1] A New Scheduling Algorithm in Hadoop Map Reduce
    Peng, Zhiping
    Ma, Yanchun
    [J]. EMERGING RESEARCH IN ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, 2011, 237 : 537 - +
  • [2] Map-optimize-reduce: CAN tree assisted FP-growth algorithm for clusters based FP mining on Hadoop
    Ragaventhiran, J.
    Kavithadevi, M. K.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 103 : 111 - 122
  • [3] Distributed Algorithm for Computing Formal Concepts Using Map-Reduce Framework
    Krajca, Petr
    Vychodil, Vilem
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS VIII, PROCEEDINGS, 2009, 5772 : 333 - 344
  • [4] An Efficient Improved Join Algorithm Using Map Reduce in Hadoop
    Patel, Warish D.
    Vaghela, Dineshkumar B.
    [J]. 2014 INTERNATIONAL CONFERENCE ON SIGNAL PROPAGATION AND COMPUTER TECHNOLOGY (ICSPCT 2014), 2014, : 263 - 272
  • [5] Prediction of Protein Structures Using a Map-Reduce Hadoop Framework Based Simulated Annealing Algorithm
    Li, Hui
    Liu, Chunmei
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [6] Referential DNA Data Compression using Hadoop Map Reduce Framework
    Bhukya, Raju
    Deshmuk, Sumit
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (02) : 207 - 214
  • [7] Improved Resource Exploitation by Combining Hadoop Map Reduce Framework with VirtualBox
    Kaur, Ramanpal
    Kaur, Harjeet
    Dhamija, Archu
    [J]. INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, INDIA 2016, 2016, 433 : 41 - 49
  • [8] Impact of Map-Reduce framework on Hadoop and Spark MR Application Performance
    Lagwankar, Ishaan
    Sankaranarayanan, Ananth Narayan
    Kalambur, Subramaniam
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2763 - 2772
  • [9] Big Data Analytics using Hadoop Map Reduce Framework and Data Migration Process
    Bante, Payal M.
    Rajeswari, K.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [10] TCM Constitution Analysis Method Based on Parallel FP-Growth Algorithm in Hadoop Framework
    Li, Mingzheng
    Lv, Xiaojuan
    Liu, Ye
    Wang, Lin
    Song, Jianqiang
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022