A fault-tolerant scheduling system for computational grids

被引:23
|
作者
Amoon, Mohammed [1 ,2 ]
机构
[1] Menoufia Univ, Fac Elect Eng, Comp Sci & Eng Dept, Menoufia, Egypt
[2] King Saud Univ, RCC, Dept Comp Sci, Riyadh 11437, Saudi Arabia
关键词
D O I
10.1016/j.compeleceng.2011.11.004
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Fault-tolerant scheduling is an important issue for computational grid systems, as grids typically consist of strongly varying and geographically distributed resources. The main scheduling strategy of most fault-tolerant scheduling systems depends on the response time and fault index when selecting a resource to execute a certain job. In this paper, a scheduling system is presented that depends on a new factor called scheduling indicator in selecting resources. This factor comprises of the response time and the failure rate of grid resources. Whenever a grid scheduler has jobs to schedule on grid resources, it uses the scheduling indicator to generate the scheduling decisions. The main scheduling strategy of the system is to select resources that have the lowest tendency to fail. Extensive simulation experiments are conducted to quantify the performance of the proposed system. Experiments have shown that the proposed system can considerably improve grid performance in terms of throughput, unavailability, turnaround time, and fail tendency. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:399 / 412
页数:14
相关论文
共 50 条
  • [21] Fault-Tolerant Dynamic Task Graph Scheduling
    Kurt, Mehmet Can
    Krishnamoorthy, Sriram
    Agrawal, Kunal
    Agrawal, Gagan
    [J]. SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 719 - 730
  • [22] Dynamic replication of fault-tolerant scheduling algorithm
    Wang, Hongxia
    Fang, Haoran
    Qiu, Xin
    [J]. Open Cybernetics and Systemics Journal, 2015, 9 : 2670 - 2676
  • [23] Dynamic replication of fault-tolerant scheduling algorithm
    School of Computer Science and Engineering, Shenyang Ligong University, Shenyang
    110159, China
    [J]. Open. Cybern. Syst. J., 1 (2670-2676):
  • [24] Fault-Tolerant Rate-Monotonic Scheduling
    Sunondo Ghosh
    Rami Melhem
    Daniel Mossé
    Joydeep Sen Sarma
    [J]. Real-Time Systems, 1998, 15 : 149 - 181
  • [25] A FAULT-TOLERANT DATAFLOW SYSTEM
    SRINI, VP
    [J]. COMPUTER, 1985, 18 (03) : 54 - 68
  • [26] THE BASIC FAULT-TOLERANT SYSTEM
    SCHMITTER, EJ
    BAUES, P
    [J]. IEEE MICRO, 1984, 4 (01) : 66 - 74
  • [27] FAULT-TOLERANT SYSTEM OPTIMIZATION
    ROSE, J
    [J]. PROCEEDINGS ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 1980, (NSYM): : 95 - 100
  • [28] Fault Tolerant Resource Management Scheme for Computational Grids
    Kumar, Anuj
    Pathak, Heman
    [J]. INTERNATIONAL CONFERENCE ON INTELLIGENT DATA COMMUNICATION TECHNOLOGIES AND INTERNET OF THINGS, ICICI 2018, 2019, 26 : 472 - 481
  • [29] Fault tolerant job scheduling in computational grid
    Nazir, Babar
    Khan, Taimoor
    [J]. SECOND INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES 2006, PROCEEDINGS, 2006, : 708 - +
  • [30] Improving ARINC 653 System Reliability by Using Fault-Tolerant Partition Scheduling
    Kistijantoro, Achmad Imam
    Gilbran, Aufar
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS (ICAICTA 2018), 2018, : 182 - 187