Using Explicit Output Comparisons for Fault Tolerant Scheduling (FTS) on Modern High-Performance Processors

被引:0
|
作者
Gao, Yue [1 ]
Gupta, Sandeep K. [1 ]
Breuer, Melvin A. [1 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90007 USA
关键词
SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Soft errors and errors caused by intermittent faults are a major concern for modern processors. In this paper we provide a drastically different approach for fault tolerant scheduling (FTS) of tasks in such processors. Traditionally in FTS, error detection is performed implicitly and concurrently with task execution, and associated overheads are incurred as increases in software run-time or hardware area. However, such embedded error detection (EED) techniques, e.g., watchdog processor assisted control flow checking, only provide approximately 70% error coverage [1, 2]. We propose the idea of utilizing straightforward explicit output comparison (EOC) which provides nearly 100% error coverage. We construct a framework for utilizing EOC in FTS, identify new challenges and tradeoffs, and develop a new off-line scheduling algorithm for EOC. We show that our EOC based approach provides higher error coverage and an average performance improvement of nearly 10% over EED-based FTS approaches, without increasing resource requirements. In our ongoing research we are identifying a richer set of ways of applying EOC, by itself and in conjunction with EED, to obtain further improvements.
引用
收藏
页码:927 / 932
页数:6
相关论文
共 50 条
  • [1] Fault-tolerant high-performance cordic processors
    Kwak, JH
    Piuri, V
    Swartzlander, EE
    IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI SYSTEMS, PROCEEDINGS, 2000, : 164 - 172
  • [2] FTS: A high-performance CORBA fault-tolerance service
    Friedman, R
    Hadad, E
    PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL WORKSHOP ON OBJECT-ORIENTED REAL-TIME DEPENDABLE SYSTEMS, 2002, : 61 - 68
  • [3] Fault-Tolerant Parallel Scheduling of Tasks on a Heterogeneous High-Performance Workstation Cluster
    Yu-Kwong Kwok
    The Journal of Supercomputing, 2001, 19 : 299 - 314
  • [4] Fault-tolerant parallel scheduling of tasks on a heterogeneous high-performance workstation cluster
    Kwok, YK
    JOURNAL OF SUPERCOMPUTING, 2001, 19 (03): : 299 - 314
  • [5] UNIFIED SCHEDULING OF HIGH-PERFORMANCE PARALLEL VLSI PROCESSORS FOR ROBOTICS
    KIM, B
    KAMEYAMA, M
    HIGUCHI, T
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1993, E76A (06) : 904 - 910
  • [6] Fault tolerant design of neuro-processors using weight limitation and ternary output
    Tomabechi, Nobubiro
    Fujioka, Yoshichika
    2007 MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-4, 2007, : 979 - 984
  • [7] DOORS: Towards high-performance fault tolerant CORBA
    Natarajan, B
    Gokhale, A
    Yajnik, S
    Schmidt, DC
    DOA'00: INTERNATIONAL SYMPOSIUM ON DISTRIBUTED OBJECTS AND APPL ICATIONS, PROCEEDINGS, 2000, : 39 - 48
  • [8] HIGH-PERFORMANCE FAULT-TOLERANT VLSI SYSTEMS USING MICRO ROLLBACK
    TAMIR, Y
    TREMBLAY, M
    IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (04) : 548 - 554
  • [9] Power-aware modulo scheduling for high-performance VLIW processors
    Yun, HS
    Kim, J
    ISLPED'01: PROCEEDINGS OF THE 2001 INTERNATIONAL SYMPOSIUM ON LOWPOWER ELECTRONICS AND DESIGN, 2001, : 40 - 45
  • [10] Hybrid-scheduling for reduced energy consumption in high-performance processors
    Valluri, Madhavi
    John, Lizy
    Hanson, Heather
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2006, 14 (09) : 1039 - 1043