LaSA: A Locality-aware Scheduling Algorithm for Hadoop-MapReduce Resource Assignment

被引:0
|
作者
Chen, Tseng-Yi [1 ]
Wei, Hsin-Wen [2 ]
Wei, Ming-Feng [1 ]
Chen, Ying-Jie [1 ]
Hsu, Tsan-Sheng [3 ]
Shih, Wei-Kuan [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30043, Taiwan
[2] Tamkang Univ, Dept Management Informat, Taipei, Taiwan
[3] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
关键词
Cloud computing; hadoop; mapreduce; data locality; distributed;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cloud computing has become more popular for a decade; it has been under continuous development with advances in architecture, software, and network. Hadoop-MapReduce is a common software framework processing parallelizable problem across big datasets using a distributed cluster of processors or stand-alone computers. Cloud Hadoop-MapReduce can scale incrementally in the number of processing nodes. Hence, the Hadoop-MapReduce is designed to provide a processing platform with powerful computation. Network traffic is always a most important bottleneck in data-intensive computing and network latency decreases significant performance in data parallel systems. Network bottleneck is caused by network bandwidth and the network speed is much slower than disk data access. So that, good data locality can reduces network traffic and increases performance in data-intensive HPC systems. However, Hadoop's scheduler has a defect of data locality in resource assignment. In this paper, we present a locality-aware scheduling algorithm (LaSA) for Hadoop-MapReduce scheduler. Firstly, we propose a mathematical model of weight of data interference in Hadoop scheduler. Secondly, we present the LaSA algorithm to use weight of data interference to provide data locality-aware resource assignment in Hadoop scheduler. Finally, we build an experimental environment with 3 cluster and 35 VMs to verify the LaSA's performance.
引用
收藏
页码:342 / 346
页数:5
相关论文
共 50 条
  • [1] BOLAS plus : Scalable Lightweight Locality-aware Scheduling for Hadoop
    Gao, Shengli
    Xue, Ruini
    [J]. 2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1077 - 1084
  • [2] Locality-aware and energy-aware job pre-assignment for mapreduce
    Chen, Lei
    Zhang, Jing
    Deng, Ziyun
    Cai, Lijun
    He, Tinqing
    Wang, XuAn
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS), 2016, : 59 - 65
  • [3] Hadoop-MapReduce Job Scheduling Algorithms Survey
    Mohamed, Ehab
    Hong, Zheng
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2016, : 237 - 242
  • [4] Locality-aware and load-balanced static task scheduling for MapReduce
    Selvitopi, Oguz
    Demirci, Gunduz Vehbi
    Turk, Ata
    Aykanat, Cevdet
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 90 : 49 - 61
  • [5] BOLAS: Bipartite-graph Oriented Locality-Aware Scheduling for MapReduce Tasks
    Xue, Ruini
    Gao, Shengli
    Ao, Lixiang
    Guan, Zhongyang
    [J]. 2015 14TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2015, : 37 - 45
  • [6] Similarity-based Node Distance Exploring and Locality-aware Shuffle Optimization for Hadoop MapReduce
    Wang, Jihe
    Wang, Danghui
    Zhang, Meng
    Qiu, Meikang
    Guo, Bing
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2017, : 103 - 108
  • [7] An Locality-Aware Scheduling Based on a Novel Scheduling Model to Improve System Throughput of MapReduce Cluster
    Zhao, Hui
    Yang, Shuqiang
    Chen, Zhikun
    Yin, Hong
    Jin, Songchang
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 111 - 115
  • [8] Locality-Aware Mapping and Scheduling for Multicores
    Ding, Wei
    Zhang, Yuanrui
    Kandemir, Mahmut
    Srinivas, Jithendra
    Yedlapalli, Praveen
    [J]. PROCEEDINGS OF THE 2013 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2013, : 335 - 346
  • [9] Data locality-aware and QoS-aware dynamic cloud workflow scheduling in Hadoop for heterogeneous environment
    Ding, Fan
    Ma, Minjin
    [J]. INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2023, 19 (01) : 113 - 135
  • [10] TaskTracker Aware Scheduling for Hadoop MapReduce
    Manjaly, Jisha S.
    Chooralil, Varghese S.
    [J]. 2013 THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS (ICACC 2013), 2013, : 278 - 281