Energy-Efficient Task Scheduling for CPU-Intensive Streaming Jobs on Hadoop

被引:20
|
作者
Jin, Peiquan [1 ,2 ]
Hao, Xingjun [1 ]
Wang, Xiaoliang [1 ]
Yue, Lihua [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
[2] Chinese Acad Sci, Key Lab Electromagnet Space Informat, Hefei 230027, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
Energy efficiency; scheduling algorithms; Hadoop; YARN; MAPREDUCE; SERVERS; POWER;
D O I
10.1109/TPDS.2018.2881176
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Hadoop, especially Hadoop 2.0, has been a dominant framework for real-time big data processing. However, Hadoop is not optimized for energy efficiency. Aiming to solve this problem, in this paper, we propose a new framework to improve the energy efficiency of Hadoop 2.0. We focus on the resource manager in Hadoop 2.0, namely YARN, and propose energy-efficient task scheduling mechanisms on YARN. Particularly, we focus on CPU-intensive streaming jobs and classify streaming jobs into two types, namely batch streaming jobs ( i.e., a set of jobs are submitted simultaneously) and online streaming jobs ( i.e., jobs are continuously submitted one by one). We devise different energy-efficient task scheduling algorithms for each kind of streaming jobs. Specially, we first propose to abstractly model performance and energy consumption by considering the characteristics of tasks as well as the computational resources in YARN. Based on this model, we study the energy efficiency of streaming tasks which consist of the performance model and energy consumption model of task. We propose two key principles for improving energy efficiency: 1) CPU usage aware task allocation, partitions tasks to NMs based on the task characteristic in term of CPU usage; and 2) resource efficient task allocation, reduce idle resource. Then, we propose a D-based binning algorithm for the batch task scheduling and K-based binning algorithm for the online task scheduling that can adapt to continuously arriving tasks. We conduct extensive experiments on a real Hadoop 2.0 cluster and use two kinds of workloads to evaluate the performance and energy efficiency of our proposal. Compared with Storm ( the streaming data processing tool in Hadoop 2.0) and other approaches including TAPA and DVFS-MR, our proposal is more energy efficient. The batch task scheduling algorithm reduces up to 10 percent of energy consumption and keeps comparable performance. In addition, the online task scheduling algorithm reduces up to 7 percent over the existing algorithms.
引用
收藏
页码:1298 / 1311
页数:14
相关论文
共 50 条
  • [31] A scheduling selection process for energy-efficient task execution on DVFS processors
    Rauber, Thomas
    Ruenger, Gudula
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (19):
  • [32] An energy-efficient task scheduling algorithm for heterogeneous cloud computing systems
    Sanjaya K. Panda
    Prasanta K. Jana
    [J]. Cluster Computing, 2019, 22 : 509 - 527
  • [33] An energy-efficient task scheduling algorithm for heterogeneous cloud computing systems
    Panda, Sanjaya K.
    Jana, Prasanta K.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02): : 509 - 527
  • [34] Energy-Efficient Task Offloading and Resource Scheduling for Mobile Edge Computing
    Yu, Hongyan
    Wang, Quyuan
    Guo, Songtao
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2018,
  • [35] Online Energy-Efficient Task-Graph Scheduling for Multicore Platforms
    Kanoun, Karim
    Mastronarde, Nicholas
    Atienza, David
    van der Schaar, Mihaela
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2014, 33 (08) : 1194 - 1207
  • [36] Energy-Efficient Task Scheduling in Manycore Processors with Frequency Scaling Overhead
    Eitschberger, Patrick
    Keller, Joerg
    [J]. 23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 541 - 548
  • [37] Energy-efficient scheduling based on task prioritization in mobile fog computing
    Hosseini, Entesar
    Nickray, Mohsen
    Ghanbari, Shamsollah
    [J]. COMPUTING, 2023, 105 (01) : 187 - 215
  • [38] Energy-efficient scheduling based on task prioritization in mobile fog computing
    Entesar Hosseini
    Mohsen Nickray
    Shamsollah Ghanbari
    [J]. Computing, 2023, 105 : 187 - 215
  • [39] Vehicular Cloud Forming and Task Scheduling for Energy-Efficient Cooperative Computing
    Gong, Minyeong
    Yoo, Younghwan
    Ahn, Sanghyun
    [J]. IEEE ACCESS, 2023, 11 : 3858 - 3871
  • [40] An Energy-Efficient Task Scheduling using BAT Algorithm for Cloud Computing
    Ullah, Arif
    Umeriqbal
    Shoukat, Ijaz Ali
    Rauf, Abdul
    Usman, O. Y.
    Ahmed, Sheeraz
    Najam, Zeeshan
    [J]. JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (04): : 613 - 627