Fault-tolerance schemes for clusterheads in clustered mesh networks

被引:0
|
作者
Zurawski, Jason [1 ]
Wang, Dajin [1 ]
机构
[1] Montclair State Univ, Dept Comp Sci, Montclair, NJ 07043 USA
关键词
distributed processing; fault tolerance; hierarchical control; interconnection networks; mesh;
D O I
10.1080/17445760701640332
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To improve the overall system performance for distributed systems using mesh as their underlying structure, a hierarchical approach was proposed in [7]. The hierarchical configuration divides the mesh into clusters, thus allowing for processing to occur in small local groups at the lower levels. After local operations, the results are passed to higher logical levels. This method has been shown to be able to significantly reduce the total communication cost for the entire system. This paper is concerned with the hierarchical system's ability to handle node failure. When using a hierarchical configuration, certain nodes in the mesh become more important to the overall system than others. It is important that the hierarchical system have a reorganising mechanism in case of node failure, in such a way that the performance gain from hierarchical configuration is salvaged as much as possible. The work presented in this paper focuses on minimising the loss of performance in the system hierarchy due to the presence of failing nodes. We will propose faulttolerance schemes for that purpose. The performance results will be compared to that of an ideal, fault-free system. We will present strategies to reconstruct the hierarchy, accommodating to the situation that some nodes in the original hierarchy are not functioning anymore. To that end, new local heads may be selected and local nodes regrouped. We will also present experiment results that examine the effectiveness of the proposed schemes. Examples of both faulty and fault-free hierarchical mesh systems will be tested to quantify how good the proposed schemes are.
引用
收藏
页码:271 / 287
页数:17
相关论文
共 50 条
  • [1] Fault-tolerance schemes for hierarchical mesh networks
    Zurawski, J
    Wang, DJ
    [J]. PDCAT 2005: Sixth International Conference on Parallel and Distributed Computing, Applications and Technologies, Proceedings, 2005, : 498 - 502
  • [2] Construction Schemes for Edge Fault-Tolerance of Ring Networks
    Hung, Chun-Nan
    Kung, Tzu-Liang
    Zhang, En-Cheng
    [J]. INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS-2018, 2019, 773 : 626 - 631
  • [3] Routing in wormhole-switched clustered networks with applications to fault-tolerance
    Halwan, V
    Ozguner, F
    [J]. 1998 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - PROCEEDINGS, 1998, : 114 - 121
  • [4] The global fault-tolerance of interconnection networks
    Harutyunyan, Hovhannes A.
    Morosan, Calin D.
    [J]. SNPD 2006: SEVENTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, PROCEEDINGS, 2006, : 171 - +
  • [5] FAULT-TOLERANCE IN A CLASS OF SORTING NETWORKS
    SUN, JL
    CERNY, E
    GECSEI, J
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1994, 43 (07) : 827 - 837
  • [6] Multiplexing schemes for cost-effective fault-tolerance
    Roy, S
    Beiu, V
    [J]. 2004 4TH IEEE CONFERENCE ON NANOTECHNOLOGY, 2004, : 589 - 592
  • [7] Fault-Tolerance Algorithm in Wireless Sensor Networks
    Al-Qadami, Nasser
    Koucheryavy, Andrey
    [J]. INFOCOMMUNICATIONS JOURNAL, 2015, 7 (04): : 28 - 33
  • [8] Fault-tolerance of (n, k)-star networks
    Li, Xiang-Jun
    Xu, Jun-Ming
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2014, 248 : 525 - 530
  • [9] The Structure Fault-Tolerance of Enhanced Hypercube Networks
    Jin, Dan
    Liu, Hong-mei
    [J]. 2018 INTERNATIONAL CONFERENCE ON ELECTRICAL, CONTROL, AUTOMATION AND ROBOTICS (ECAR 2018), 2018, 307 : 235 - 238
  • [10] FAULT-TOLERANCE IN MULTICHANNEL LOCAL AREA NETWORKS
    CAMARDA, P
    GERLA, M
    [J]. EIGHTH ANNUAL INTERNATIONAL PHOENIX CONFERENCE ON COMPUTERS AND COMMUNICATIONS: 1989 CONFERENCE PROCEEDINGS, 1989, : 133 - 137