Adaptive fault-tolerant architecture and routing algorithm for reliable many-core 3D-NoC systems

被引:20
|
作者
Ben Ahmed, Akram [1 ]
Ben Abdallah, Abderazek [2 ]
机构
[1] Keio Univ, Dept Informat & Comp Sci, Yokohama, Kanagawa 2238522, Japan
[2] Univ Aizu, Grad Sch Comp Sci & Engn, Adapt Syst Lab, Aizu Wakamatsu, Fukushima 9658580, Japan
关键词
3D NoC; Fault-tolerance; Robustness; Architecture; Dynamic reconfiguration; Deadlock-free; NETWORKS;
D O I
10.1016/j.jpdc.2016.03.014
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
During the last few decades, Three-dimensional Network-on-Chips (3D-NoCs) have been showing their advantages against 2D-NoC architectures. This is thanks to the reduced average interconnect length and lower interconnect-power consumption inherited from Three-dimensional Integrated Circuits (3D-ICs). On the other hand, questions about their reliability is starting to arise. This issue is mainly caused by their complex nature where a single faulty transistor may cause intolerable performance degradation or even the entire system collapse. To ensure their correct functionality, 3D-NoC systems must be fault-tolerant to any short-term malfunction or permanent physical damage to ensure message delivery on time while minimizing the performance degradation as much as possible. In this paper, we present a fault-tolerant 3D-NoC architecture, called 3D-Fault-Tolerant-OASIS (3D-FTO).(1) Withthe aid of a light-weight routing algorithm, 3D-FTO manages to avoid the system failure at the presence of a large number of transient, intermittent, and permanent faults. Moreover, the proposed architecture is leveraging on reconfigurable components to handle the fault occurrence in links, input buffers, and crossbar, where the faults are more often to happen. The proposed 3D-FTO system is able to work around different kinds of faults ensuring graceful performance degradation while minimizing the additional hardware complexity and remaining power-efficient. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:30 / 43
页数:14
相关论文
共 50 条
  • [31] Partially shared cache and adaptive replacement algorithm for NoC-based many-core systems
    Yang, Pengfei
    Wang, Quan
    Ye, Hongwei
    Zhang, Zhiqiang
    JOURNAL OF SYSTEMS ARCHITECTURE, 2019, 98 : 424 - 433
  • [32] Adaptive Fault Simulation on Many-core Microprocessor Systems
    Haghbayan, Mohammad-Hashem
    Teravainen, Sami
    Rahmani, Amir-Mohammad
    Liljeberg, Pasi
    Tenhunen, Hannu
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFTS), 2015, : 151 - 154
  • [33] A New Fault-Tolerant Deadlock-Free Fully Adaptive Routing in NOC
    Janfaza, Vahid
    Baharlouei, Elaheh
    2017 IEEE EAST-WEST DESIGN & TEST SYMPOSIUM (EWDTS), 2017,
  • [34] Fault-Tolerant Routing Algorithm for Mesh based NoC using Reinforcement Learning
    Samala, Jagadheesh
    Takawale, Harshvardhan
    Chokhani, Yash
    Bhanu, P. Veda
    Soumya, J.
    2020 24TH INTERNATIONAL SYMPOSIUM ON VLSI DESIGN AND TEST (VDAT), 2020,
  • [35] Fault-tolerant routing algorithm of NoC based on buffer reuse of faulty links
    Zhang, S. (zsjk@163.com), 1600, Institute of Computing Technology (26):
  • [36] A NEW DEADLOCK-FREE FAULT-TOLERANT ROUTING ALGORITHM FOR NOC INTERCONNECTIONS
    Jovanovic, Slavisa
    Tanougast, Camel
    Weber, Serge
    Bobda, Christophe
    FPL: 2009 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, 2009, : 326 - +
  • [37] LEAD: An Adaptive 3D-NoC Routing Algorithm with Queuing-Theory Based Analytical Verification
    Salamat, Ronak
    Khayambashi, Misagh
    Ebrahimi, Masoumeh
    Bagherzadeh, Nader
    IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (08) : 1153 - 1166
  • [38] 3D NoC deflection fault-tolerant routing method based on dynamic priority
    Ouyang, Yiming
    Ouyang, Xiaoye
    Liang, Huaguo
    Huang, Zhengfeng
    Liu, Jun
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2014, 26 (03): : 486 - 492
  • [39] A Congestion-adaptive Fault-tolerant Routing Algorithm on HNoC
    Fang, Juan
    Cheng, Yanjin
    Zhao, Hui
    2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 911 - 916
  • [40] Performance of an adaptive fault-tolerant routing algorithm for multicast communications
    Borella, A
    Cancellieri, G
    ATM, NETWORKS AND LANS - NOC '96-II, 1996, : 295 - 296