On verifying fault tolerance of distributed protocols

被引:0
|
作者
Fisman, Dana [1 ]
Kupferman, Orna [1 ]
Lustig, Yoad [1 ]
机构
[1] Hebrew Univ Jerusalem, Sch Comp Sci & Engn, IL-91904 Jerusalem, Israel
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed systems are composed of processes connected in some network. Distributed systems may suffer from faults: processes may stop, be interrupted, or be maliciously attacked. Fault-tolerant protocols are designed to be resistant to faults. Proving the resistance of protocols to faults is a very challenging problem, as it combines the parameterized setting that distributed systems are based-on, with the need to consider a hostile environment that produces the faults. Considering all the possible fault scenarios for a protocol is very difficult. Thus, reasoning about fault-tolerance protocols utterly needs formal methods. In this paper we describe a framework for verifying the fault tolerance of (synchronous or asynchronous) distributed protocols. In addition to the description of the protocol and the desired behavior, the user provides the fault type (e.g., fail-stop, Byzantine) and its distribution (e.g., at most half of the processes are faulty). Our framework is based on augmenting the description of the configurations of the system by a mask describing which processes are faulty. We focus on regular model checking and show how it is possible to compile the input for the model-checking problem to one that takes the faults and their distribution into an account, and perform regular model-checking on the compiled input. We demonstrate the effectiveness of our framework and argue for its generality.
引用
收藏
页码:315 / 331
页数:17
相关论文
共 50 条
  • [41] Dynamic fault tolerance in distributed vehicle systems
    Torlo, M
    Bertram, T
    ELECTRONIC SYSTEMS FOR VEHICLES, 2001, 1646 : 99 - 122
  • [42] On Fault Tolerance for Distributed Iterative Dataflow Processing
    Xu, Chen
    Holzemer, Markus
    Kaul, Manohar
    Soto, Juan
    Markl, Volker
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (08) : 1709 - 1722
  • [43] Ensuring fault-tolerance in distributed media
    Tormasov, AG
    Khasin, MA
    Pakhomov, YI
    PROGRAMMING AND COMPUTER SOFTWARE, 2001, 27 (05) : 245 - 251
  • [44] Ensuring Fault-Tolerance in Distributed Media
    A. G. Tormasov
    M. A. Khasin
    Yu. I. Pakhomov
    Programming and Computer Software, 2001, 27 : 245 - 251
  • [45] A distributed fault-tolerance mechanism in UNIX
    Gantenbein, RE
    Yu, ZJ
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 1996, : 146 - 149
  • [46] Verifying norm compliancy of protocols
    Aldewereld, Huib
    Vazquez-Salceda, Javier
    Dignum, Frank
    Meyer, John-Jules Ch.
    COORDINATION, ORGANIZATIONS, INSTITUTIONS, AND NORMS IN MULTI-AGENT SYSTEMS, 2006, 3913 : 231 - 245
  • [47] The complexity of verifying population protocols
    Javier Esparza
    Stefan Jaax
    Mikhail Raskin
    Chana Weil-Kennedy
    Distributed Computing, 2021, 34 : 133 - 177
  • [48] The complexity of verifying population protocols
    Esparza, Javier
    Jaax, Stefan
    Raskin, Mikhail
    Weil-Kennedy, Chana
    DISTRIBUTED COMPUTING, 2021, 34 (02) : 133 - 177
  • [49] Compiling and verifying security protocols
    Jacquemard, F
    Rusinowitch, M
    Vigneron, L
    LOGIC FOR PROGRAMMING AND AUTOMATED REASONING, PROCEEDINGS, 2000, 1955 : 131 - 160
  • [50] Verifying security protocols with Brutus
    Clarke, EM
    Jha, S
    Marrero, W
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2000, 9 (04) : 443 - 487