PISCES: Optimizing Multi-Job Application Execution in MapReduce

被引:4
|
作者
Chen, Qi [1 ]
Yao, Jinyu [1 ]
Li, Benchao [1 ]
Xiao, Zhen [1 ]
机构
[1] Peking Univ, Dept Comp Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
MapReduce; job dependency; group scheduling; pipeline; OPTIMIZATION;
D O I
10.1109/TCC.2016.2603509
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, many MapReduce applications consist of groups of jobs with dependencies among each other, such as iterative machine learning applications and large database queries. Unfortunately, the MapReduce framework is not optimized for these multi-job applications. It does not explore the execution overlapping opportunities among jobs and can only schedule jobs independently. These issues significantly inflate the application execution time. This paper presents Pipeline Improvement Support with Critical chain Estimation Scheduling (PISCES), a critical chain optimization (a critical chain refers to a series of jobs which will make the application run longer if any one of them is delayed), to provide better support for multi-job applications. PISCES extends the existing MapReduce framework to allow scheduling for multiple jobs with dependencies by dynamically building up a job dependency DAG for current running jobs according to their input and output directories. Then using the dependency DAG, it provides an innovative mechanism to facilitate the data pipelining between the output phase (map phase in the Map-Only job or reduce phase in the Map-Reduce job) of an upstream job and the map phase of a downstream job. This offers a new execution overlapping between dependent jobs in MapReduce which effectively reduces the application runtime. Moreover, PISCES proposes a novel critical chain job scheduling model based on the accurate critical chain estimation. Experiments show that PISCES can increase the degree of system parallelism by up to 68 percent and improve the execution speed of applications by up to 52 percent.
引用
收藏
页码:273 / 286
页数:14
相关论文
共 50 条
  • [41] A Pareto Frontier for Optimizing Data Transfer and Job Execution in Grids
    Taheri, Javid
    Zomaya, Albert Y.
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2130 - 2139
  • [42] Optimizing schedules in the pipeline systems with variable job execution order
    V. I. Levin
    Automation and Remote Control, 2005, 66 : 406 - 421
  • [43] Merge or Separate? Multi-job Scheduling for OpenCL Kernels on CPU/GPU Platforms
    Wen, Yuan
    O'Boyle, Michael F. P.
    PROCEEDINGS OF THE GENERAL PURPOSE GPUS (GPGPU-10), 2017, : 22 - 31
  • [44] SELECTION OF A SET OF PART DELIVERY DATES IN A MULTI-JOB STOCHASTIC ASSEMBLY SYSTEM
    DAS, SK
    SARIN, SC
    IIE TRANSACTIONS, 1988, 20 (01) : 4 - 11
  • [45] Multi-job Hadoop scheduling to process Geo-distributed big data
    Cavallo, Marco
    Di Modica, Giuseppe
    Polito, Carmelo
    Tomarchio, Orazio
    2017 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2017, : 1175 - 1181
  • [46] Forming SPN-MapReduce Model for Estimation Job Execution Time in Cloud Computing
    Chen, Ying-Jun
    Horng, Gwo-Jiun
    Cheng, Sheng-Tzong
    Wang, His-Chuan
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 94 (04) : 3465 - 3493
  • [47] Forming SPN-MapReduce Model for Estimation Job Execution Time in Cloud Computing
    Ying-Jun Chen
    Gwo-Jiun Horng
    Sheng-Tzong Cheng
    His-Chuan Wang
    Wireless Personal Communications, 2017, 94 : 3465 - 3493
  • [48] DeConNet: Deep Neural Network Model to Solve the Multi-Job Assignment Problem in the Multi-Agent System
    Lee, Jungwoo
    Choi, Youngho
    Suh, Jinho
    APPLIED SCIENCES-BASEL, 2022, 12 (11):
  • [49] Optimizing Cost and Performance Trade-Offs for MapReduce Job Processing in the Cloud
    Zhang, Zhuoyao
    Cherkasova, Ludmila
    Loo, Boon Thau
    2014 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2014,
  • [50] Maximizing MapReduce job speed and reliability in the mobile cloud by optimizing task allocation
    Lee, Jin-woo
    Jang, Gwangseon
    Jung, Hohyun
    Lee, Jae-Gil
    Lee, Uichin
    PERVASIVE AND MOBILE COMPUTING, 2019, 60