Fault Resilient Real-Time Design for NoC Architectures

被引:2
|
作者
Zimmer, Christopher [1 ]
Mueller, Frank [1 ]
机构
[1] North Carolina State Univ, Dept Comp Sci, Raleigh, NC 27695 USA
基金
美国国家科学基金会;
关键词
SOFTWARE; REDUNDANCY; SYSTEMS; TASKS;
D O I
10.1109/ICCPS.2012.16
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Performance and time to market requirements cause many real-time designers to consider components, off the shelf (COTS) for real-time cyber-physical systems. Massive multi-core embedded processors with network-on-chip (NoC) designs to facilitate core-to-core communication are becoming common in COTS. These architectures benefit real-time scheduling, but they also pose predictability challenges. In this work, we develop a framework for Fault Observant and Correcting Real-Time Embedded design (Forte) that utilizes massive multi-core NoC designs to reduce overhead by up to an order of magnitude and to lower jitter in systems via utilizing message passing instead of shared memory as the means for intra-processor communication. Message passing, which is shown to improve the overall scalability of the system, is utilized as the basis for replication and task rejuvenation. This improves fault resilience by orders of magnitude. To our knowledge, this work is the first to systematically map real-time tasks onto massive multi-core processors with support for fault tolerance that considers NoC effects on scalability on an real hardware platform and not just in simulation.
引用
收藏
页码:75 / 84
页数:10
相关论文
共 50 条
  • [1] The design of real-time fault detectors
    Midonnet, S
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2005: COOPIS, DOA, AND ODBASE, PT 1, PROCEEDINGS, 2005, 3760 : 698 - 714
  • [2] DESIGN AND PERFORMANCE EVALUATION OF REAL-TIME COMMUNICATION ARCHITECTURES
    ELNAKHAL, AE
    RZEHAK, H
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 1993, 40 (04) : 404 - 411
  • [3] HopliteRT*: Real-Time NoC for FPGA
    Gonzalez, Yilian Ribot
    Nelissen, Geoffrey
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (11) : 3650 - 3661
  • [4] Simulation of distributed fault tolerant heterogeneous architectures for real-time control
    Benitez-Perez, H
    Thompson, HA
    Fleming, PJ
    [J]. ALGORITHMS AND ARCHITECTURES FOR REAL-TIME CONTROL 1998 (AARTC'98), 1998, : 71 - 76
  • [5] Analysis of fault tolerance and reliability in distributed real-time system architectures
    Philippi, S
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2003, 82 (02) : 195 - 206
  • [6] Towards the Practical Design of Performance-Aware Resilient Wireless NoC Architectures
    Agyeman, Michael Opoku
    Zong, Wen
    Kanakis, Triantafyllos
    Tong, Kin-Fai
    Mak, Terrence
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 479 - 484
  • [7] Heuristics for Mapping Real-time Applications to NoC-based Architectures using Genetic Algorithms
    Bonilha, Iae Santos
    dos Santos, Osmar Marchi
    Indrusiak, Leandro
    [J]. PROCEEDINGS OF IV BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEMS ENGINEERING SBESC 2014, 2014, : 144 - 149
  • [8] Fault tolerance analysis of NoC architectures
    Lehtonen, Teijo
    Liljeberg, Pasi
    Plosila, Juha
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 361 - +
  • [9] Performance analysis of pools in soft real-time design architectures
    Juiz, C
    Puigjaner, R
    Perros, H
    [J]. NINTH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS, PROCEEDINGS, 2001, : 15 - 22
  • [10] A real-time library for the design of hybrid robot control architectures
    Beccari, G
    Caselli, S
    Reggiani, M
    Zanichelli, F
    [J]. 1998 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - PROCEEDINGS, VOLS 1-3: INNOVATIONS IN THEORY, PRACTICE AND APPLICATIONS, 1998, : 1145 - 1150