Improving Performance of Heterogeneous MapReduce Clusters with Adaptive Task Tuning

被引:61
|
作者
Cheng, Dazhao [1 ]
Rao, Jia [2 ]
Guo, Yanfei [4 ]
Jiang, Changjun [3 ]
Zhou, Xiaobo [2 ]
机构
[1] Univ North Carolina Charlotte, Dept Comp Sci, Charlotte, NC 28223 USA
[2] Univ Colorado, Dept Comp Sci, Colorado Springs, CO 80918 USA
[3] Tongji Univ, Dept Comp Sci & Technol, 4800 Caoan Rd, Shanghai 201804, Peoples R China
[4] Argonne Natl Lab, Lemont, IL 60439 USA
基金
美国国家科学基金会;
关键词
MapReduce performance improvement; self-adaptive task tuning; heterogeneous clusters; genetic algorithm;
D O I
10.1109/TPDS.2016.2594765
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Datacenter-scale clusters are evolving toward heterogeneous hardware architectures due to continuous server replacement. Meanwhile, datacenters are commonly shared by many users for quite different uses. It often exhibits significant performance heterogeneity due to multi-tenant interferences. The deployment of MapReduce on such heterogeneous clusters presents significant challenges in achieving good application performance compared to in-house dedicated clusters. As most MapReduce implementations are originally designed for homogeneous environments, heterogeneity can cause significant performance deterioration in job execution despite existing optimizations on task scheduling and load balancing. In this paper, we observe that the homogeneous configuration of tasks on heterogeneous nodes can be an important source of load imbalance and thus cause poor performance. Tasks should be customized with different configurations to match the capabilities of heterogeneous nodes. To this end, we propose a self-adaptive task tuning approach, Ant, that automatically searches the optimal configurations for individual tasks running on different nodes. In a heterogeneous cluster, Ant first divides nodes into a number of homogeneous subclusters based on their hardware configurations. It then treats each subcluster as a homogeneous cluster and independently applies the self-tuning algorithm to them. Ant finally configures tasks with randomly selected configurations and gradually improves tasks configurations by reproducing the configurations from best performing tasks and discarding poor performing configurations. To accelerate task tuning and avoid trapping in local optimum, Ant uses genetic algorithm during adaptive task configuration. Experimental results on a heterogeneous physical cluster with varying hardware capabilities show that Ant improves the average job completion time by 31, 20, and 14 percent compared to stock Hadoop (Stock), customized Hadoop with industry recommendations (Heuristic), and a profilingbased configuration approach (Starfish), respectively. Furthermore, we extend Ant to virtual MapReduce clusters in a multi-tenant private cloud. Specifically, Ant characterizes a virtual node based on two measured performance statistics: I/O rate and CPU steal time. It uses k-means clustering algorithm to classify virtual nodes into configuration groups based on the measured dynamic interference. Experimental results on virtual clusters with varying interferences show that Ant improves the average job completion time by 20, 15, and 11 percent compared to Stock, Heuristic and Starfish, respectively.
引用
收藏
页码:774 / 786
页数:13
相关论文
共 50 条
  • [1] Improving MapReduce Performance in Heterogeneous Environments with Adaptive Task Tuning
    Cheng, Dazhao
    Rao, Jia
    Guo, Yanfei
    Zhou, Xiaobo
    [J]. ACM/IFIP/USENIX MIDDLEWARE 2014, 2014, : 97 - 108
  • [2] Improving Hadoop MapReduce performance on heterogeneous single board computer clusters☆
    Lim, Sooyoung
    Park, Dongchul
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 160 : 752 - 766
  • [3] Energy Utilization Task Scheduling for MapReduce in Heterogeneous Clusters
    Wang, Jia
    Li, Xiaoping
    Ruiz, Ruben
    Yang, Jie
    Chu, Dianhui
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (02) : 931 - 944
  • [4] Design adaptive task allocation scheduler to improve MapReduce performance in heterogeneous clouds
    Yang, Shin-Jer
    Chen, Yi-Ru
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2015, 57 : 61 - 70
  • [5] MrHeter: improving MapReduce performance in heterogeneous environments
    Zhang, Xiao
    Wu, Yanjun
    Zhao, Chen
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 1691 - 1701
  • [6] MrHeter: improving MapReduce performance in heterogeneous environments
    Xiao Zhang
    Yanjun Wu
    Chen Zhao
    [J]. Cluster Computing, 2016, 19 : 1691 - 1701
  • [7] Configuring A MapReduce Framework For Performance-Heterogeneous Clusters
    Hartog, Jessica
    DelValle, Renan
    Govindaraju, Madhusudhan
    Lewis, Michael J.
    [J]. 2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 120 - 127
  • [8] Improving MapReduce Performance in a Heterogeneous Cloud: A Measurement Study
    Zhao, Xu
    Liu, Ling
    Zhang, Qi
    Dong, Xiaoshe
    [J]. 2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 401 - 408
  • [9] Task failure resilience technique for improving the performance of MapReduce in Hadoop
    Kavitha, C.
    Anita, X.
    [J]. ETRI JOURNAL, 2020, 42 (05) : 751 - 763
  • [10] Improving MapReduce Performance by Data Prefetching in Heterogeneous or Shared Environments
    Gu, Tao
    Zuo, Chuang
    Liao, Qun
    Yang, Yulu
    Li, Tao
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2013, 6 (05): : 71 - 81