VPCH: A Consistent Hashing Algorithm for Better Load Balancing in a Hadoop Environment

被引:4
|
作者
Liu, Qi [1 ]
Cai, Weidong [1 ]
Shen, Jian [1 ]
Wang, Baowei [1 ]
Fu, Zhangjie [1 ]
Linge, Nigel [2 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Dept Comp & Software, Nanjing, Jiangsu, Peoples R China
[2] Univ Salford, Salford, Lancs, England
关键词
Map Reduce; Load Balancing; Consistent Hashing; PERFORMANCE; MAPREDUCE;
D O I
10.1109/CBD.2015.21
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce (MR) is a popular programming model for the purposes of processing large data sets among data clusters or grids, e.g. a Hadoop environment. Load balancing as a key factor affecting the performance of map resource distribution, has recently gained high concerns to optimize. Current MR processes in the realization of distributing tasks to clusters use hashing with random modulo operations, which can lead to uneven data distribution and inclined loads, thereby obstruct the performance of the entire distribution system. In this paper, a virtual partition consistent hashing (VPCH) algorithm is proposed for the reduce stage of MR processes, in order to achieve such a trade-off on job allocation. According to the results, using our method can reduce task execution time with or without MJR (mapreduce.job.reduce.slowstart.completedmaps) parameter set.
引用
收藏
页码:69 / 72
页数:4
相关论文
共 50 条
  • [1] Adaptive Load-Balancing for Consistent Hashing in Heterogeneous Clusters
    Srinivasan, Lakshminarayanan
    Varma, Vasudeva
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 1135 - 1138
  • [2] Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment
    Fu, Weiyu
    Wang, Lixia
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [3] Node Performance Load Balancing Algorithm for Hadoop Cluster
    Vinutha, D. C.
    Raju, G. T.
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT SUSTAINABLE SYSTEMS (ICISS 2019), 2019, : 468 - 473
  • [4] Load balancing task scheduling algorithm in Hadoop platform
    Cai Yandong
    Liu Yan
    Zhang Qinglei
    [J]. 2015 SEVENTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2015), 2015, : 605 - 608
  • [5] Load Balancing Framework Based on Consistency Hashing Algorithm
    Lin, Peng
    Nie, Hongmin
    Ding, Gang
    [J]. 2014 INTERNATIONAL CONFERENCE ON MECHATRONICS AND CONTROL (ICMC), 2014, : 1483 - 1486
  • [6] Benchmarking Hashing Algorithms for Load Balancing in a Distributed Database Environment
    Slesarev, Alexander
    Mikhailov, Mikhail
    Chernishev, George
    [J]. ADVANCES IN MODEL AND DATA ENGINEERING IN THE DIGITALIZATION ERA, MEDI 2022, 2022, 1751 : 105 - 118
  • [7] A Load-Balancing Algorithm for Hadoop Distributed File System
    Lin, Chi-Yi
    Lin, Ying-Chen
    [J]. PROCEEDINGS 2015 18TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2015), 2015, : 173 - 179
  • [8] An Optimization Algorithm for Heterogeneous Hadoop Clusters Based on Dynamic Load Balancing
    Yan, Wei
    Li, ChunLin
    Du, ShuMeng
    Mao, Xijun
    [J]. 2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 250 - 255
  • [9] Saksham: Resource Aware Block Rearrangement Algorithm for Load Balancing in Hadoop
    Shah, Ankit
    Padole, Mamta
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 47 - 56
  • [10] Load-balancing performance of consistent hashing: Asymptotic analysis of random node join
    Wang, Xiaoming
    Loguinov, Dmitri
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2007, 15 (04) : 892 - 905