Fault-Tolerant Real-Time Fair Scheduling on Multiprocessor Systems with Cold-Standby

被引:7
|
作者
Nair, Piyoosh Purushothaman [1 ]
Sarkar, Arnab [1 ]
Biswas, Santosh [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Gauhati 781039, Assam, India
关键词
Task analysis; Fault tolerant systems; Processor scheduling; Real-time systems; Redundancy; Scheduling; Deadline partitioning; fault-tolerance; proportional fairness; scheduling; real-time; TASKS;
D O I
10.1109/TDSC.2019.2934098
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The ability to maintain functional and temporal correctness in the presence of faults is a key requirement in many safety-critical embedded systems. This work proposes an efficient fault recovery mechanism for real-time multiprocessor systems scheduled using a low overhead, semi-partitioned optimal proportional fair scheduling technique. We assume a system that can handle a single permanent processor fault at any time, using cold back-ups (with pre-specified activation / recovery time subsequent to the detection of a fault). As a result of the fault, the system may suffer transient overloads during such recovery periods, potentially leading to unacceptable fairness deviations and consequent rejections / early terminations of critical jobs. The proposed fault-tolerant scheduler, called Fault Tolerant Fair Scheduler (FT-FS), attempts to minimize such job terminations / rejections during recovery, by judiciously redistributing slacks accumulated by a subset of jobs, delivering more sustainable performance in the process. Experimental results reveal that the proposed FT-FS algorithm performs appreciably even under high system loads. Practical applicability of our proposed scheme has been illustrated using a case study on aircraft flight control system.
引用
收藏
页码:1718 / 1732
页数:15
相关论文
共 50 条