Automated constraint-based addition of nonmasking and stabilizing fault-tolerance

被引:8
|
作者
Abujarad, F. [1 ]
Kulkarni, S. S. [2 ]
机构
[1] Yale Univ, Dept Emergency Med, New Haven, CT 06519 USA
[2] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
关键词
Nonmasking fault-tolerance; Stabilization; Program synthesis; Multi-core algorithms; Distributed programs; ALGORITHM;
D O I
10.1016/j.tcs.2011.02.008
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We focus on the constraint-based automated addition of nonmasking and stabilizing fault-tolerance to hierarchical programs. We specify legitimate states of the program in terms of constraints that should be satisfied in those states. To deal with faults that may violate these constraints, we add recovery actions while ensuring interference freedom among the recovery actions added for satisfying different constraints. Since the constraint-based manual design of fault-tolerance is well known, we expect our approach to have a significant benefit in automating the addition of fault-tolerance. We illustrate our algorithm with four case studies: stabilizing mutual exclusion, stabilizing diffusing computation, a data dissemination problem in sensor networks, and tree maintenance. With experimental results, we show that the complexity of our algorithm is reasonable and that it can be reduced using the structure of the hierarchical systems. We also reduced the time complexity of the synthesis using parallelism. We consider two approaches to speedup the synthesis algorithm: first, the use of the multiple constraints that have to be satisfied during synthesis; second, the use of the distributed nature of the programs being synthesized. We show that our approaches provide significant reduction in the synthesis time. To our knowledge, this is the first instance where automated synthesis has been successfully used in synthesizing programs that are correct under fairness assumptions. Moreover, in three of the case studies considered in this paper, the structure of the recovery paths is too complex to permit existing heuristic-based approaches for adding recovery. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:4228 / 4246
页数:19
相关论文
共 50 条
  • [1] Constraint Based Automated Synthesis of Nonmasking and Stabilizing Fault-Tolerance
    Abujarad, Fuad
    Kulkarni, Sandeep S.
    [J]. 2009 28TH IEEE INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2009, : 119 - 128
  • [2] Constraint satisfaction as a basis for designing nonmasking fault-tolerance
    Arora, A
    Gouda, M
    Varghese, G
    [J]. JOURNAL OF HIGH SPEED NETWORKS, 1996, 5 (03) : 293 - 306
  • [3] Designing masking fault-tolerance via nonmasking fault-tolerance
    Arora, A
    Kulkarni, SS
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1998, 24 (06) : 435 - 450
  • [4] Enhancing the fault-tolerance of nonmasking programs
    Kulkarni, SS
    Ebnenasir, A
    [J]. 23RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2002, : 441 - 449
  • [5] Automated Addition of Fault-Tolerance under Synchronous Semantics
    Lin, Yiyan
    Bonakdarpour, Borzoo
    Kulkarni, Sandeep
    [J]. STABILIZATION, SAFETY, AND SECURITY OF DISTRIBUTED SYSTEMS, SSS 2013, 2013, 8255 : 266 - 280
  • [6] Weakest Invariant Generation for Automated Addition of Fault-Tolerance
    Abujarad, Fuad
    Kulkarni, Sandeep S.
    [J]. ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2009, 258 (02) : 3 - 15
  • [7] Automating the addition of fault-tolerance
    Kulkarni, SS
    Arora, A
    [J]. FORMAL TECHNIQUES IN REAL-TIME AND FAULT-TOLERANT SYSTEMS, PROCEEDINGS, 2000, 1926 : 82 - 93
  • [8] Automated Fault-Tolerance Testing
    Nagarajan, Adithya
    Vaddadi, Ajay
    [J]. 2016 IEEE NINTH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW), 2016, : 275 - 276
  • [9] The complexity of automated addition of fault-tolerance without explicit legitimate states
    Fuad Abujarad
    Yiyan Lin
    Borzoo Bonakdarpour
    Sandeep S. Kulkarni
    [J]. Distributed Computing, 2015, 28 : 201 - 219
  • [10] The complexity of automated addition of fault-tolerance without explicit legitimate states
    Abujarad, Fuad
    Lin, Yiyan
    Bonakdarpour, Borzoo
    Kulkarni, Sandeep S.
    [J]. DISTRIBUTED COMPUTING, 2015, 28 (03) : 201 - 219