Reliability-Aware Distributed Computing Scheduling Policy

被引:0
|
作者
Abawajy, Jemal [1 ]
Hassan, Mohammad Mehedi [2 ]
机构
[1] Deakin Univ, Sch Informat Technol, Fac Sci & Technol, Geelong, Vic 3217, Australia
[2] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11543, Saudi Arabia
关键词
Cloud computing; Job scheduling; Fault-tolerance; Replication; Performances;
D O I
10.1007/978-3-319-27161-3_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the primary issues associated with the efficient and effective utilization of distributed computing is resource management and scheduling. As distributed computing resource failure is a common occurrence, the issue of deploying support for integrated scheduling and fault-tolerant approaches becomes paramount importance. To this end, we propose a fault-tolerant dynamic scheduling policy that loosely couples dynamic job scheduling with job replication scheme such that jobs are efficiently and reliably executed. The novelty of the proposed algorithm is that it uses passive replication approach under high system load and active replication approach under low system loads. The switch between these two replication methods is also done dynamically and transparently. Performance evaluation of the proposed fault-tolerant scheduler and a comparison with similar fault-tolerant scheduling policy is presented and shown that the proposed policy performs better than the existing approach.
引用
收藏
页码:627 / 632
页数:6
相关论文
共 50 条
  • [21] Reliability-Aware Task Allocation in Distributed Computing Systems using Hybrid Simulated Annealing and Tabu Search
    Faragardi, Hamid Reza
    Shojaee, Reza
    Yazdani, Nasser
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 1088 - 1095
  • [22] End-to-End Reliability-Aware Scheduling for Wireless Sensor Networks
    Dobslaw, Felix
    Zhang, Tingting
    Gidlund, Mikael
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2016, 12 (02) : 758 - 767
  • [23] Reliability-Aware Comprehensive Routing and Scheduling in Time-Sensitive Networking
    Feng, Jiaqi
    Zhang, Tong
    Yi, Changyan
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2022), PT II, 2022, 13472 : 243 - 254
  • [24] Reliability-aware task scheduling for energy efficiency on heterogeneous multiprocessor systems
    Zexi Deng
    Dunqian Cao
    Hong Shen
    Zihan Yan
    Huimin Huang
    The Journal of Supercomputing, 2021, 77 : 11643 - 11681
  • [25] Reliability-aware Scheduling and Routing for Messages in Time-sensitive Networking
    Zhou, Yuanbin
    Samii, Soheil
    Eles, Petru
    Peng, Zebo
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2021, 20 (05)
  • [26] Reliability-aware task scheduling for energy efficiency on heterogeneous multiprocessor systems
    Deng, Zexi
    Cao, Dunqian
    Shen, Hong
    Yan, Zihan
    Huang, Huimin
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (10): : 11643 - 11681
  • [27] Reliability-Aware and Energy-Efficient Workflow Scheduling in IaaS Clouds
    Ye, Lingjuan
    Xia, Yuanqing
    Tao, Siyuan
    Yan, Ce
    Gao, Runze
    Zhan, Yufeng
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (03) : 2156 - 2169
  • [28] Energy-Efficient Reliability-Aware Scheduling Algorithm on Heterogeneous Systems
    Tang, Xiaoyong
    Tan, Weizhen
    SCIENTIFIC PROGRAMMING, 2016, 2016
  • [29] On the reliability-aware geographic routing
    Taha, ZQ
    Liu, M
    2005 Wireless Telecommunications Symposium, 2005, : 74 - 78
  • [30] Reliability-aware system synthesis
    Glass, Michael
    Lukasiewycz, Martin
    Streichert, Thilo
    Haubelt, Christian
    Teich, Juergen
    2007 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2007, : 409 - 414