A fault-tolerant dynamic scheduling algorithm for multiprocessor real-time systems and its analysis

被引:96
|
作者
Manimaran, G [1 ]
Murthy, CSR [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Madras 600036, Tamil Nadu, India
关键词
real-time system; dynamic scheduling; fault tolerance; resource reclaiming; run-time anomaly; safety critical application;
D O I
10.1109/71.735960
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many time-critical applications require dynamic scheduling with predictable performance. Tasks corresponding to these applications have deadlines to be met despite the presence of faults. In this paper, we propose an algorithm to dynamically schedule arriving real-time tasks with resource and fault-tolerant requirements on to multiprocessor systems. The tasks are assumed to be nonpreemptable and each task has two copies (versions) which are mutually excluded in space, as well as in time in the schedule, to handle permanent processor failures and to obtain better performance, respectively. Our algorithm can tolerate more than one fault at a time, and employs performance improving techniques such as 1) distance concept which decides the relative position of the two copies of a task in the task queue, 2) flexible backup overloading, which introduces a trade-off between degree of fault tolerance and performance, and 3) resource reclaiming, which reclaims resources both from deallocated backups and early completing tasks. We quantify, through simulation studies, the effectiveness of each of these techniques in improving the guarantee ratio, which is defined as the percentage of total tasks, arrived in the system, whose deadlines are met. Also, we compare through simulation studies the performance our algorithm with a best known algorithm for the problem, and show analytically the importance of distance parameter in fault-tolerant dynamic scheduling in multiprocessor real-time systems.
引用
收藏
页码:1137 / 1152
页数:16
相关论文
共 50 条
  • [2] A Fault-tolerant Scheduling Algorithm Based on Grouping for Real-time Multiprocessor
    Yu, Xingbiao
    Zheng, Changwen
    Hu, Xiaohui
    Zhao, Junsuo
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 919 - 923
  • [3] A Novel Intelligent Algorithm for Fault-Tolerant Task Scheduling in Real-Time Multiprocessor Systems
    Zarinzad, Golbarg
    Rahmani, Amir Masoud
    Dayhim, Nikta
    [J]. Third 2008 International Conference on Convergence and Hybrid Information Technology, Vol 2, Proceedings, 2008, : 816 - 821
  • [4] Fault-tolerant partitioning scheduling algorithms in real-time multiprocessor systems
    Beitollahi, Hakem
    Deconinck, Geert
    [J]. 12TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2006, : 296 - +
  • [5] AN ADAPTIVE SCHEDULING ALGORITHM FOR FAULT-TOLERANT REAL-TIME SYSTEMS
    CHETTO, H
    CHETTO, M
    [J]. SOFTWARE ENGINEERING JOURNAL, 1991, 6 (03): : 93 - 100
  • [6] A fault-tolerant real-time scheduling algorithm in software fault-tolerant module
    Liu, Dong
    Xing, Weiyan
    Li, Rui
    Zhang, Chunyuan
    Li, Haiyan
    [J]. COMPUTATIONAL SCIENCE - ICCS 2007, PT 4, PROCEEDINGS, 2007, 4490 : 961 - +
  • [7] Fault-Tolerant Real-Time Fair Scheduling on Multiprocessor Systems with Cold-Standby
    Nair, Piyoosh Purushothaman
    Sarkar, Arnab
    Biswas, Santosh
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (04) : 1718 - 1732
  • [8] An adaptive scheme for fault-tolerant scheduling of soft real-time tasks in multiprocessor systems
    Al-Omari, R
    Somani, AK
    Manimaran, G
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2005, 65 (05) : 595 - 608
  • [9] Supervisory control for fault-tolerant scheduling of real-time multiprocessor systems with aperiodic tasks
    Park, Seong-Jin
    Cho, Kwang-Hyun
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2009, 82 (02) : 217 - 227
  • [10] Real-Time Fault-Tolerant Scheduling Algorithm with Rearrangement in Cloud Systems
    Guo, Pengze
    Xue, Zhi
    [J]. PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 399 - 402