Interconnection Network Reconstruction for Fault-Tolerance of Torus-Connected VLSI Array

被引:0
|
作者
Zhu, Longting [1 ]
Wu, Jigang [1 ]
Jiang, Guiyuan [2 ]
Sun, Jizhou [2 ]
机构
[1] Tianjin Polytech Univ, Sch Comp Sci & Software Engn, Tianjin 300387, Peoples R China
[2] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金; 国家教育部博士点专项基金资助;
关键词
torus-connected VLSI array; reconfiguration algorithm; fault-tolerance; contradiction graph; RECONFIGURATION ALGORITHM; EFFICIENT; MESHES;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Effective fault-tolerant techniques are essential for improving the reliability of multiprocessor systems. This paper investigates the fault-tolerance of torus-connected VLSI array using pre-integrated spare processing elements (PEs), by reconfiguring the interconnection network among all PEs. We model the problem of whether all faulty PEs can be replaced by spare ones as the problem of finding maximum independent set for a contradiction graph, which is constructed from the original physical arrays with faulty PEs. Each node of the graph represents an alternative of a faulty PE, while an edge denotes that different alternatives cannot coexist. We propose efficient algorithms to construct contradiction graphs from physical arrays with faulty PEs and redundant PEs. We then customize an ant-colony algorithm to find independent set as large as possible. We develop an efficient algorithm to generate logic arrays based on the produced independent set. Three different distributions of redundant PEs are discussed in this paper, and satisfactory results have been achieved in simulation.
引用
收藏
页码:285 / 298
页数:14
相关论文
共 50 条
  • [41] A novel buffering fault-tolerance approach for network on chip (NoC)
    Jafarzadeh, Nima
    Jalili, Ahmad
    Alzubi, Jafar A.
    Rezaee, Khosro
    Liu, Yang
    Gheisari, Mehdi
    Bigham, Bahram Sadeghi
    Javadpour, Amir
    [J]. IET CIRCUITS DEVICES & SYSTEMS, 2023, 17 (04) : 250 - 257
  • [42] On the probability of facing fault patterns: A performance and comparison measure of network fault-tolerance
    Safaei, Farshad
    Khonsari, Ahmad
    Moraveji, Reza
    [J]. COMPUTATIONAL SCIENCE - ICCS 2008, PT 1, 2008, 5101 : 539 - +
  • [43] Fault-Tolerance Data Aggregation for Clustering Wireless Sensor Network
    Shu Qin Ren
    Jong Sou Park
    [J]. Wireless Personal Communications, 2009, 51 : 179 - 192
  • [44] Information packets and MPC enable fault-tolerance in network control
    Klinkhieo, Supat
    Kambhampati, Chandra
    Patton, Ronald J.
    [J]. 2006 IEEE CONFERENCE ON EMERGING TECHNOLOGIES & FACTORY AUTOMATION, VOLS 1 -3, 2006, : 457 - +
  • [45] Multipath Fault-Tolerance Routing Mechanism in Data Center Network
    Ya, Nan
    Wang, Xingwei
    Zhang, Shuang
    Huang, Min
    [J]. 2018 17TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES), 2018, : 246 - 249
  • [46] Making the Fault-Tolerance of Emerging Neural Network Accelerators Scalable
    Liu, Tao
    Wen, Wujie
    [J]. 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,
  • [47] An Approximate Fault-Tolerance Design for a Convolutional Neural Network Accelerator
    Wei, Wenda
    Wang, Chenyang
    Zheng, Xinyang
    Yue, Hengshan
    [J]. IT PROFESSIONAL, 2023, 25 (04) : 85 - 90
  • [48] NEURAL NETWORK REALIZATION OF MARKOV RELIABILITY AND FAULT-TOLERANCE MODELS
    SULIMAN, M
    MANZOUL, MA
    [J]. MICROELECTRONICS AND RELIABILITY, 1991, 31 (01): : 141 - 147
  • [49] Fault-Tolerance Data Aggregation for Clustering Wireless Sensor Network
    Ren, Shu Qin
    Park, Jong Sou
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2009, 51 (01) : 179 - 192
  • [50] Neural network realization of Markov reliability and fault-tolerance models
    Suliman, Mamoun
    Manzoul, Mahmoud A.
    [J]. Microelectronics Reliability, 1991, 31 (01) : 141 - 147