Load Balancing in Heterogeneous MapReduce Environments

被引:2
|
作者
Fan, Yuanquan [1 ]
Wu, Weiguo [1 ]
Qian, Depei [1 ]
Xu, Yunlong [1 ]
Wei, Wei [1 ]
机构
[1] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, Xian 710049, Peoples R China
关键词
MapReduce; Load Balancing; heterogeneous cluster; heterogeneity-aware partitioning;
D O I
10.1109/HPCC.and.EUC.2013.209
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce has emerged as a popular computing model for parallel processing of big data. However, we observe that the native hash partitioning of MapReduce systems leads to frequent uneven data distribution among reduce tasks. The uneven data distribution results in load imbalance among reduce tasks, and thus hampers the performance of MapReduce systems. Moreover, the heterogeneity among cluster nodes exacerbates the negative effects of uneven data distribution due to varying performance of the heterogeneous nodes. To address the above issues, in this paper, we propose a novel load balancing approach with respect to the heterogeneity of clusters. This approach consists of two components: (1) performance estimation for reducers that run on heterogeneous nodes based on history of reduce tasks, and (2) heterogeneity-aware partitioning (HAP), which reallocates the input data for reduce tasks based on the performance estimation for reducers. We implement this approach as a plug-in of current MapReduce system. Experiment results show that our approach improves the performance of MapReduce jobs that run in heterogeneous systems, and incurs little overhead.
引用
收藏
页码:1480 / 1489
页数:10
相关论文
共 50 条
  • [1] Thread migration and load balancing in heterogeneous environments
    Thitikamol, K
    Keleher, PJ
    [J]. LANGUAGES, COMPILERS, AND RUN-TIME SYSTEMS FOR SCALABLE COMPUTERS, 2000, 1915 : 260 - 271
  • [2] Load balancing in MapReduce on homogeneous and heterogeneous clusters: an in-depth review
    Kargar, Mohammad Javad
    Vakili, Meysam
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2015, 15 (2-3) : 149 - 168
  • [3] Automatic runtime load balancing of dedicated applications in heterogeneous environments
    Höfinger, S
    [J]. RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2002, 2474 : 62 - 69
  • [4] Load Balancing in Heterogeneous Cloud Environments by Using PROMETHEE Method
    Hourali, Samira
    Jamali, Shahram
    [J]. INTERNATIONAL JOURNAL OF INTEGRATED ENGINEERING, 2018, 10 (08): : 80 - 90
  • [5] HEURISTIC LOAD BALANCING FOR CFD CODES EXECUTED IN HETEROGENEOUS COMPUTING ENVIRONMENTS
    Petcu, Dana
    Vizman, Daniel
    Paprzycki, Marcin
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2006, 7 (02): : 15 - 23
  • [6] Dynamic Resource Management: Load Balancing Algorithms for Heterogeneous Cloud Environments
    Laha, Jasobanta
    Pattnaik, Sabyasachi
    Nayak, Chinmaya Kumar
    Chaudhury, Kumar Surjeet
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (09) : 1524 - 1533
  • [7] Deadline-Aware Load Balancing for MapReduce
    Lai, Zhao-Rong
    Chang, Che-Wei
    Liu, Xue
    Kuo, Tei-Wei
    Hsiu, Pi-Cheng
    [J]. 2014 IEEE 20TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA), 2014,
  • [8] Load Balancing in MapReduce Based on Data Locality
    Chen, Yi
    Liu, Zhaobin
    Wang, Tingting
    Wang, Lu
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT I, 2014, 8630 : 229 - 241
  • [9] Distributed Offline Load Balancing in MapReduce Networks
    Charalambous, Themistoklis
    Kalyvianaki, Evangelia
    Hadjieostis, Christoforos N.
    Johansson, Mikael
    [J]. 2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 835 - 840
  • [10] Heterogeneous Energy-aware Load Balancing for Industry 4.0 and IoT Environments
    Ahmed, Usman
    Lin, Jerry Chun-Wei
    Srivastava, Gautam
    [J]. ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2022, 13 (04)