ROUTING IN MODULAR FAULT-TOLERANT MULTIPROCESSOR SYSTEMS

被引:2
|
作者
ALAM, MS [1 ]
MELHEM, RG [1 ]
机构
[1] UNIV PITTSBURGH,DEPT COMP SCI,PITTSBURGH,PA 15260
基金
美国国家科学基金会;
关键词
SPARING; MODULAR MULTIPROCESSORS; FAULT-TOLERANT ROUTING; HYPERCUBE MULTICOMPUTERS; MESH CONNECTED PROCESSORS;
D O I
10.1109/71.476192
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we consider a class of modular multiprocessor architectures in which spares are added to each module to cover for faulty nodes within that module, thus forming a fault-tolerant basic block (FTBB). In contrast to reconfiguration techniques that preserve the physical adjacency between active nodes in the system, our goal is to preserve the logical adjacency between active nodes by means of a routing algorithm which delivers messages successfully to their destinations, We introduce two-phase routing strategies that route messages first to their destination FTBB, and then to the destination nodes within the destination FTBB. Such a strategy may be applied to a variety of architectures including binary hypercubes and three dimensional tori. In the presence of f faults in hypercubes and tori, we show that the worst case length of the message route is min {sigma + f, (K + 1)sigma} + c where sigma is the shortest path in the absence of faults, K is the number of spare nodes in an FTBB, and c is a small constant. The average routing overhead is much lower than the worst case overhead.
引用
收藏
页码:1206 / 1220
页数:15
相关论文
共 50 条
  • [31] A MULTIPROCESSOR WORKING AS A FAULT-TOLERANT CELLULAR AUTOMATON
    HANDLER, W
    [J]. COMPUTING, 1992, 48 (01) : 5 - 20
  • [32] Comments on "A Class of Fault-Tolerant Multiprocessor Networks"
    Kim, Jong-Seok
    Lee, Hyeong-Ok
    Kim, Sung Won
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2009, 58 (03) : 496 - 500
  • [33] ON AN OPTIMALLY FAULT-TOLERANT MULTIPROCESSOR NETWORK ARCHITECTURE
    SENGUPTA, A
    SEN, A
    BANDYOPADHYAY, S
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1987, 36 (05) : 619 - 623
  • [34] AN OPERATING SYSTEM FOR A FAULT-TOLERANT MULTIPROCESSOR CONTROLLER
    WILLIAMS, RD
    JOHNSON, BW
    ROBERTS, TE
    [J]. IEEE MICRO, 1988, 8 (04) : 18 - 29
  • [35] ATTEMPTO - AN EXPERIMENTAL FAULT-TOLERANT MULTIPROCESSOR SYSTEM
    DALCIN, M
    BRAUSE, R
    LUTZ, J
    DILGER, E
    RISSE, T
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1987, 20 (4-5): : 301 - 308
  • [36] A FAULT-TOLERANT MULTIPROCESSOR CONTROLLER FOR MAGNETIC BEARINGS
    YATES, SW
    WILLIAMS, RD
    [J]. IEEE MICRO, 1988, 8 (04) : 6 - 17
  • [37] An integrated scheduling mechanism for fault-tolerant modular avionics systems
    Lee, YH
    Younis, M
    Zhou, J
    [J]. 1998 IEEE AEROSPACE CONFERENCE PROCEEDINGS, VOL 4, 1998, : 21 - 29
  • [38] Fault-tolerant partitioning scheduling algorithms in real-time multiprocessor systems
    Beitollahi, Hakem
    Deconinck, Geert
    [J]. 12TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2006, : 296 - +
  • [39] An extended generalized hypercube as a fault-tolerant system area network for multiprocessor systems
    M. F. Karavay
    V. S. Podlazov
    [J]. Automation and Remote Control, 2015, 76 : 336 - 352
  • [40] Fault-tolerant routing in the star graph
    Rezazad, SM
    Sarbazi-Azad, H
    [J]. 18TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 2 (REGULAR PAPERS), PROCEEDINGS, 2004, : 503 - 506