A novel fault-tolerant scheduling algorithm for precedence constrained tasks in real-time heterogeneous systems

被引:93
|
作者
Qin, Xiao
Jiang, Hong
机构
[1] New Mexico Inst Min & Technol, Dept Comp Sci, Socorro, NM 87801 USA
[2] Univ Nebraska, Dept Comp Sci & Engn, Lincoln, NE 68588 USA
基金
美国国家科学基金会;
关键词
real-time tasks; off-line scheduling; fault-tolerance; heterogeneous systems; precedence constraints; reliability; performability;
D O I
10.1016/j.parco.2006.06.006
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fault-tolerance is an essential requirement for real-time systems, due to potentially catastrophic consequences of faults. In this paper, we investigate an efficient off-line scheduling algorithm generating schedules in which real-time tasks with precedence constraints can tolerate one processor's permanent failure in a heterogeneous system with fully connected network. The tasks are assumed to be non-preemptable, and each task has two copies scheduled on different processors and mutually excluded in time. In the literature in recent years, the quality of a schedule has been previously improved by allowing a backup copy to overlap with other backup copies on the same processor. However, this approach assumes that tasks are independent of one other. To meet the needs of real-time systems where tasks have precedence constraints, a new overlapping scheme is proposed. We show that, given two tasks, the necessary conditions for their backup copies to safely overlap in time with each other are (1) their corresponding primary copies are scheduled on two different processors, (2) they are independent tasks, and (3) the execution of their backup copies implies the failures of the processors on which their primary copies are scheduled. For tasks with precedence constraints, the new overlapping scheme allows the backup copy of a task to overlap with its successors' primary copies, thereby further reducing schedule length. Based on a proposed reliability model, tasks are judiciously allocated to processors so as to maximize the reliability of heterogeneous systems. Additionally, times for detecting and handling of a permanent fault are incorporated into the scheduling scheme. We have performed experiments using synthetic workloads as well as a real world application. Simulation results show that compared with existing scheduling algorithms in the literature, our scheduling algorithm improves reliability by up to 22.4% (with an average of 16.4%) and achieves an improvement in performability, a measure that combines reliability and schedulability, by up to 421.9% (with an average of 49.3%). (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:331 / 356
页数:26
相关论文
共 50 条
  • [1] An efficient fault-tolerant scheduling algorithm for real-time tasks with precedence constraints in heterogeneous systems
    Qin, X
    Jiang, H
    Swanson, DR
    [J]. 2002 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDING, 2002, : 360 - 368
  • [2] An efficient fault-tolerant scheduling algorithm for precedence constrained tasks in heterogeneous distributed systems
    Nakechbandi, M.
    Colin, J. -Y.
    Gashumba, J. B.
    [J]. INNOVATIONS AND ADVANCED TECHNIQUES IN COMPUTER AND INFORMATION SCIENCES AND ENGINEERING, 2007, : 301 - 307
  • [3] Fault-Tolerant Scheduling of Real-Time Tasks on Heterogeneous Systems
    Wei, Mengxue
    Liu, Jing
    Li, Tao
    Xu, Xin
    Hu, Wei
    Zhao, Di
    [J]. PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 1006 - 1011
  • [4] A Fault-Tolerant Scheduling Algorithm Based on a Multi-Objective Genetic Algorithm for Precedence-Constrained Tasks in Real-Time Heterogeneous Distributed Systems
    Chen Yuanlong
    Ma Peijun
    Li Dong
    [J]. JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2013, 10 (05) : 1091 - 1096
  • [5] Contention awareness and fault-tolerant scheduling for precedence constrained tasks in heterogeneous systems
    Benoit, Anne
    Hakem, Mourad
    Robert, Yves
    [J]. PARALLEL COMPUTING, 2009, 35 (02) : 83 - 108
  • [6] Real-time fault-tolerant scheduling algorithm of periodic tasks in heterogeneous distributed systems
    School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
    [J]. Jisuanji Xuebao, 2007, 10 (1740-1749):
  • [7] Design of fault-tolerant scheduling algorithm for real-time tasks in distributed systems
    Han, Zongfen
    Qin, Xiao
    Pang, Liping
    Li, Shengli
    [J]. Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 27 (06): : 12 - 14
  • [8] An Efficient Fault-tolerant Scheduling Algorithm for Periodic Real-time Tasks in Heterogeneous Platforms
    Qiu, Weiwei
    Zheng, Zibin
    Wang, Xinyu
    Yang, Xiaohu
    [J]. 2013 IEEE 16TH INTERNATIONAL SYMPOSIUM ON OBJECT/COMPONENT/SERVICE-ORIENTED REAL-TIME DISTRIBUTED COMPUTING (ISORC), 2013,
  • [9] Fault-tolerant scheduling algorithm for heterogeneous distributed real-time simulation systems
    School of Mechatronics and Automation, National University of Defense Technology, Changsha 410073, China
    [J]. Ruan Jian Xue Bao, 2006, 10 (2040-2047):
  • [10] Real-time fault-tolerant scheduling in heterogeneous distributed systems
    Qin, X
    Han, ZF
    Pang, LP
    Li, SL
    Jin, H
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 421 - 427