Adaptive fault-tolerant architecture and routing algorithm for reliable many-core 3D-NoC systems

被引:20
|
作者
Ben Ahmed, Akram [1 ]
Ben Abdallah, Abderazek [2 ]
机构
[1] Keio Univ, Dept Informat & Comp Sci, Yokohama, Kanagawa 2238522, Japan
[2] Univ Aizu, Grad Sch Comp Sci & Engn, Adapt Syst Lab, Aizu Wakamatsu, Fukushima 9658580, Japan
关键词
3D NoC; Fault-tolerance; Robustness; Architecture; Dynamic reconfiguration; Deadlock-free; NETWORKS;
D O I
10.1016/j.jpdc.2016.03.014
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
During the last few decades, Three-dimensional Network-on-Chips (3D-NoCs) have been showing their advantages against 2D-NoC architectures. This is thanks to the reduced average interconnect length and lower interconnect-power consumption inherited from Three-dimensional Integrated Circuits (3D-ICs). On the other hand, questions about their reliability is starting to arise. This issue is mainly caused by their complex nature where a single faulty transistor may cause intolerable performance degradation or even the entire system collapse. To ensure their correct functionality, 3D-NoC systems must be fault-tolerant to any short-term malfunction or permanent physical damage to ensure message delivery on time while minimizing the performance degradation as much as possible. In this paper, we present a fault-tolerant 3D-NoC architecture, called 3D-Fault-Tolerant-OASIS (3D-FTO).(1) Withthe aid of a light-weight routing algorithm, 3D-FTO manages to avoid the system failure at the presence of a large number of transient, intermittent, and permanent faults. Moreover, the proposed architecture is leveraging on reconfigurable components to handle the fault occurrence in links, input buffers, and crossbar, where the faults are more often to happen. The proposed 3D-FTO system is able to work around different kinds of faults ensuring graceful performance degradation while minimizing the additional hardware complexity and remaining power-efficient. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:30 / 43
页数:14
相关论文
共 50 条
  • [1] Fault-Tolerant Circular Routing Algorithm for 3D-NoC
    Alizadeh, Razieh
    Saneei, Mohsen
    Ebrahimi, Masoumeh
    2014 INTERNATIONAL CONGRESS ON TECHNOLOGY, COMMUNICATION AND KNOWLEDGE (ICTCK), 2014,
  • [2] High Performance Fault-Tolerant Routing Algorithm for NoC-based Many-Core Systems
    Ebrahimi, Masoumeh
    Daneshtalab, Masoud
    Plosila, Juha
    PROCEEDINGS OF THE 2013 21ST EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING, 2013, : 462 - 469
  • [3] A low-overhead soft–hard fault-tolerant architecture, design and management scheme for reliable high-performance many-core 3D-NoC systems
    Khanh N. Dang
    Michael Meyer
    Yuichi Okuyama
    Abderazek Ben Abdallah
    The Journal of Supercomputing, 2017, 73 : 2705 - 2729
  • [4] A low-overhead soft-hard fault-tolerant architecture, design and management scheme for reliable high-performance many-core 3D-NoC systems
    Dang, Khanh N.
    Meyer, Michael
    Okuyama, Yuichi
    Ben Abdallah, Abderazek
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (06): : 2705 - 2729
  • [5] Reflect3d: An Adaptive and Fault-Tolerant Routing Algorithm for Vertically-Partially-Connected 3D-NoC
    da Silva, Alexandre Almeida
    e Silva Junior, Leonel Maia
    Coelho, Alexandre
    Silveira, Jarbas
    Marcon, Cesar
    34TH SBC/SBMICRO/IEEE/ACM SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN (SBCCI 2021), 2021,
  • [6] Fault-Tolerant Adaptive Routing under Permanent and Temporary Failures for Many-Core Systems-on-Chip
    Dimopoulos, Michael
    Gang, Yi
    Benabdenbi, Mounir
    Anghel, Lorena
    Zergainoh, Nacer-Eddine
    Nicolaidis, Michael
    PROCEEDINGS OF THE 2013 IEEE 19TH INTERNATIONAL ON-LINE TESTING SYMPOSIUM (IOLTS), 2013, : 7 - 12
  • [7] Architecture and design of high-throughput, low-latency, and fault-tolerant routing algorithm for 3D-network-on-chip (3D-NoC)
    Akram Ben Ahmed
    Abderazek Ben Abdallah
    The Journal of Supercomputing, 2013, 66 : 1507 - 1532
  • [8] Architecture and design of high-throughput, low-latency, and fault-tolerant routing algorithm for 3D-network-on-chip (3D-NoC)
    Ben Ahmed, Akram
    Ben Abdallah, Abderazek
    JOURNAL OF SUPERCOMPUTING, 2013, 66 (03): : 1507 - 1532
  • [9] A TSV Fault-Tolerant Scheme Based on Failure Classificationin 3D-NoC
    Ouyang, Yiming
    Da, Jian
    Wang, Xiumin
    Han, Qianqian
    Liang, Huaguo
    Du, Gaoming
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2017, 26 (04)
  • [10] A scalable and fault-tolerant network routing scheme for many-core and multi-chip systems
    Tsai, Wen-Chung
    Chu, Kuo-Chih
    Hu, Yu-Hen
    Chen, Sao-Jie
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2012, 72 (11) : 1433 - 1441