Residual Traffic Based Task Scheduling in Hadoop

被引:0
|
作者
Tanaka, Daichi [1 ]
Kawarasaki, Masatoshi [2 ]
机构
[1] Univ Tsukuba, Grad Sch Lib Informat & Media Studies, Tsukuba, Ibaraki, Japan
[2] Univ Tsukuba, Fac Lib Informat & Media Sci, Tsukuba, Ibaraki, Japan
关键词
distributed computing; Hadoop; MapReduce; job performance; network simulation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In Hadoop job processing, it is reported that a large amount of data transfer significantly influences job performance. In this paper, we clarify that the cause of performance deterioration in the CPU (Central Processing Unit) heterogeneous environment is the delay of copy phase due to the heavy load in the inter rack links of the cluster network. Thus, we propose a new scheduling method-Residual Traffic Based Task Scheduling-that estimates the amount of inter rack data transfer in the copy phase and regulates task assignment accordingly. We evaluate the scheduling method by using ns-3 (network simulator-3) and show that it can improve Hadoop job performance significantly.
引用
收藏
页码:94 / 102
页数:9
相关论文
共 50 条
  • [21] Network Traffic Analysis Based on Hadoop
    Yang, Jie
    He, Haiyang
    Qiao, Yuanyuan
    [J]. 2014 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, VEHICULAR TECHNOLOGY, INFORMATION THEORY AND AEROSPACE & ELECTRONIC SYSTEMS (VITAE), 2014,
  • [22] The Optimization of Hadoop Scheduling Algorithms on Distributed System for Processing Traffic Information
    Sun, Weizhen
    Wang, Xiujin
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND ENGINEERING APPLICATION, ICSCTEA 2013, 2014, 250 : 389 - 396
  • [23] Improved Particle Optimization Algorithm Solving Hadoop Task Scheduling Problem
    Xu, Jun
    Tang, Yong
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND COGNITIVE INFORMATICS, 2015, : 11 - 14
  • [24] A Priority Task Scheduling Algorithm based on Residual Energy in EH-WSNs
    Li, Wuyungerile
    Gao, Haode
    Liu, Yingcong
    Jia, Bing
    Huang, Baoqi
    [J]. 2020 16TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2020), 2020, : 43 - 48
  • [25] HFSP: Size-based Scheduling for Hadoop
    Pastorelli, Mario
    Barbuzzi, Antonio
    Carra, Damiano
    Dell'Amico, Matteo
    Michiardi, Pietro
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [26] Offline traffic analysis system based on Hadoop
    QIAO Yuan-yuan
    LEI Zhen-ming
    YUAN Lun
    GUO Min-jie
    [J]. The Journal of China Universities of Posts and Telecommunications, 2013, (05) : 97 - 103
  • [27] Offline traffic analysis system based on Hadoop
    QIAO Yuanyuan
    LEI Zhenming
    YUAN Lun
    GUO Minjie
    [J]. TheJournalofChinaUniversitiesofPostsandTelecommunications, 2013, 20 (05) : 97 - 103
  • [28] The bandwidth-aware backup task scheduling strategy using SDN in Hadoop
    Shang, Fengjun
    Chen, Xuanling
    Yan, Chenyun
    Li, Luzhong
    Zhao, Yuting
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3): : S5975 - S5985
  • [29] The bandwidth-aware backup task scheduling strategy using SDN in Hadoop
    Fengjun Shang
    Xuanling Chen
    Chenyun Yan
    Luzhong Li
    Yuting Zhao
    [J]. Cluster Computing, 2019, 22 : 5975 - 5985
  • [30] An Enhanced Data-Locality-Aware Task Scheduling Algorithm for Hadoop Applications
    Choi, Dongjoo
    Jeon, Myunghoon
    Kim, Namgi
    Lee, Byoung-Dai
    [J]. IEEE SYSTEMS JOURNAL, 2018, 12 (04): : 3346 - 3357