Reliability-Aware Distributed Computing Scheduling Policy

被引:0
|
作者
Abawajy, Jemal [1 ]
Hassan, Mohammad Mehedi [2 ]
机构
[1] Deakin Univ, Sch Informat Technol, Fac Sci & Technol, Geelong, Vic 3217, Australia
[2] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11543, Saudi Arabia
关键词
Cloud computing; Job scheduling; Fault-tolerance; Replication; Performances;
D O I
10.1007/978-3-319-27161-3_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the primary issues associated with the efficient and effective utilization of distributed computing is resource management and scheduling. As distributed computing resource failure is a common occurrence, the issue of deploying support for integrated scheduling and fault-tolerant approaches becomes paramount importance. To this end, we propose a fault-tolerant dynamic scheduling policy that loosely couples dynamic job scheduling with job replication scheme such that jobs are efficiently and reliably executed. The novelty of the proposed algorithm is that it uses passive replication approach under high system load and active replication approach under low system loads. The switch between these two replication methods is also done dynamically and transparently. Performance evaluation of the proposed fault-tolerant scheduler and a comparison with similar fault-tolerant scheduling policy is presented and shown that the proposed policy performs better than the existing approach.
引用
收藏
页码:627 / 632
页数:6
相关论文
共 50 条
  • [1] Reliability-aware scheduling strategy for heterogeneous distributed computing systems
    Tang, Xiaoyong
    Li, Kenli
    Li, Renfa
    Veeravalli, Bharadwaj
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (09) : 941 - 952
  • [2] Reliability-aware DAG scheduling with primary-backup in cloud computing
    Jing, Weipeng
    Liu, Yaqiu
    Shao, Hongrun
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2015, 52 (01) : 86 - 93
  • [3] Instruction Scheduling for Reliability-Aware Compilation
    Rehman, Semeen
    Shafique, Muhammad
    Henkel, Joerg
    2012 49TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2012, : 1288 - 1296
  • [4] Reliability-aware and Deadline-constrained workflow scheduling in Mobile Edge Computing
    Peng, Qinglan
    Jiang, Haochen
    Chen, Mujie
    Liang, Jiawei
    Xia, Yunni
    PROCEEDINGS OF THE 2019 IEEE 16TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2019), 2019, : 236 - 241
  • [5] A Reliability-aware Task Scheduling Algorithm Based on Replication on Heterogeneous Computing Systems
    Wang, Shuli
    Li, Kenli
    Mei, Jing
    Xiao, Guoqing
    Li, Keqin
    JOURNAL OF GRID COMPUTING, 2017, 15 (01) : 23 - 39
  • [6] A Reliability-aware Task Scheduling Algorithm Based on Replication on Heterogeneous Computing Systems
    Shuli Wang
    Kenli Li
    Jing Mei
    Guoqing Xiao
    Keqin Li
    Journal of Grid Computing, 2017, 15 : 23 - 39
  • [7] A reliability-aware scheduling algorithm for parallel task executing on cloud computing system
    Cao J.
    Zhang Z.
    Wang B.
    Cui X.
    Xu J.
    International Journal of Intelligent Systems Technologies and Applications, 2021, 20 (03) : 215 - 232
  • [8] Reliability-Aware Scheduling on Heterogeneous Multicore Processors
    Naithani, Ajeya
    Eyerman, Stijn
    Eeckhout, Lieven
    2017 23RD IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2017, : 397 - 408
  • [9] A Case for Lifetime Reliability-Aware Neuromorphic Computing
    Song, Shihao
    Das, Anup
    2020 IEEE 63RD INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2020, : 596 - 598
  • [10] Reliability-Aware Online Scheduling for DNN Inference Tasks in Mobile-Edge Computing
    Ma, Huirong
    Li, Rui
    Zhang, Xiaoxi
    Zhou, Zhi
    Chen, Xu
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (13) : 11453 - 11464