Scheduling large-scale scientific workflow on virtual machines with different numbers of vCPUs

被引:0
|
作者
Hao Wu
Xin Chen
Xiaoyu Song
Chi Zhang
He Guo
机构
[1] Dalian University of Technology,The School of Software Technology
[2] Liaoning University of Technology,The School of Electronics and Information Engineering
[3] Portland State University,The ECE Department
来源
关键词
Cloud computing; Scientific workflow; DAG splitting; Scheduling; Cost minimization;
D O I
暂无
中图分类号
学科分类号
摘要
With the wide deployment of cloud computing in scientific computing, cost minimization is increasingly critical for large-scale scientific workflow. Unfortunately, due to the highly intricate directed acyclic graph (DAG)-based workflow and the flexible usage of virtual machines (VMs) in cloud platform, the existing workflow scheduling approaches are inefficient to strike a balance between the parallelism and the topology of the DAG-based workflow while using the VMs, which causes a low utilization of VMs and consumes more cost. To address these issues, this paper presents a novel task scheduling framework named cost minimization approach with the DAG splitting method (COMSE) for minimizing the cost of running a deadline-constrained large-scale scientific workflow. First, we provide comprehensive theoretical analyses on how to improve the utilization of a resource-balanced multi-vCPU VM for running multiple tasks simultaneously. Second, considering the balance between the parallelism and the topology of a workflow, we simplify the DAG-based workflow, and based on the simplified DAG, a DAG splitting method is devised to preprocess the workflow. Third, since the cloud is charged by hours, we also design an exact algorithm to find the optimal operation pattern for a given schedule to make the consumed instance hours minimum, and this algorithm is named as instance hours minimization by Dijkstra (TOID). Finally, by employing the DAG splitting method and the TOID, the COMSE schedules a deadline-constrained large-scale scientific workflow on the multi-vCPU VMs and incorporates two important objects: minimizing the computation cost and the communication cost. Our solution approach is evaluated through rigorous performance evaluation study using real-word workflows, and the results show that the proposed COMSE approach outperforms existing algorithms in terms of computation cost and communication cost.
引用
收藏
页码:679 / 710
页数:31
相关论文
共 50 条
  • [1] Scheduling large-scale scientific workflow on virtual machines with different numbers of vCPUs
    Wu, Hao
    Chen, Xin
    Song, Xiaoyu
    Zhang, Chi
    Guo, He
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (01): : 679 - 710
  • [2] Different aspects of workflow scheduling in large-scale distributed systems
    Stavrinides, Georgios L.
    Rodrigo Duro, Francisco
    Karatza, Helen D.
    Garcia Blas, Javier
    Carretero, Jesus
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2017, 70 : 120 - 134
  • [3] Large-scale virtual machines provisioning in clouds: challenges and approaches
    Zhang, Zhaoning
    Li, Dongsheng
    Wu, Kui
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2016, 10 (01) : 2 - 18
  • [4] Large-scale virtual machines provisioning in clouds: challenges and approaches
    Zhaoning Zhang
    Dongsheng Li
    Kui Wu
    [J]. Frontiers of Computer Science, 2016, 10 : 2 - 18
  • [5] Large-scale virtual machines provisioning in clouds:challenges and approaches
    Zhaoning ZHANG
    Dongsheng LI
    Kui WU
    [J]. Frontiers of Computer Science, 2016, 10 (01) : 2 - 18
  • [6] Towards Reactive Scheduling for Large-Scale Virtual Power Plants
    Troeschel, Martin
    Appelrath, Hans-Juergen
    [J]. MULTI-AGENT SYSTEM TECHNOLOGIES, PROCEEDINGS, 2009, 5774 : 141 - 152
  • [7] Effective heuristic for large-scale unrelated parallel machines scheduling problems
    Wang, Haibo
    Alidaee, Bahram
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2019, 83 : 261 - 274
  • [8] Tri-objective Optimization for Large-Scale Workflow Scheduling and Execution in Clouds
    Alrammah, Huda
    Gu, Yi
    Yun, Daqing
    Zhang, Ning
    [J]. JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2024, 32 (04)
  • [9] Aeromancer: A Workflow Manager for Large-Scale MapReduce-Based Scientific Workflows
    Mohamed, Nabeel
    Maji, Nabanita
    Zhang, Jing
    Timoshevskaya, Nataliya
    Feng, Wu-Chun
    [J]. 2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 739 - 746
  • [10] Remote Attestation of Large-scale Virtual Machines in the Cloud Data Center
    Chene, Jie
    Zhang, Kun
    Tu, Bibo
    [J]. 2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 180 - 187