TOLERATING FAULTS IN HYPERCUBES USING SUBCUBE PARTITIONING

被引:27
|
作者
BRUCK, J
CYPHER, R
SOROKER, D
机构
[1] IBM Almaden Research Center, San Jose
[2] Shell Development Company
关键词
FAULT-TOLERANCE; HYPERCUBES; PARALLEL COMPUTING; RECONFIGURATION; SUBCUBES;
D O I
10.1109/12.142686
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We examine the issue of running algorithms on a hypercube which has both node and edge faults, and we assume a worst case distribution of the faults. We prove that for any constant c, an n-dimensional hypercube (n-cube) with n(c) faulty components contains a fault-free subgraph that can implement a large class of hypercube algorithms with only a constant factor slowdown. In addition, our approach yields practical implementations for small numbers of faults. For example, we show that any regular algorithm can be implemented on an n-cube that has at most n - 1 faults with slowdowns of at most 2 for computation and at most 4 for communication. To the best of our knowledge this is the first result showing that an n-cube can tolerate more than O(n) arbitrarily placed faults with a constant factor slowdown.
引用
收藏
页码:599 / 605
页数:7
相关论文
共 50 条