On verifying fault tolerance of distributed protocols

被引:0
|
作者
Fisman, Dana [1 ]
Kupferman, Orna [1 ]
Lustig, Yoad [1 ]
机构
[1] Hebrew Univ Jerusalem, Sch Comp Sci & Engn, IL-91904 Jerusalem, Israel
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed systems are composed of processes connected in some network. Distributed systems may suffer from faults: processes may stop, be interrupted, or be maliciously attacked. Fault-tolerant protocols are designed to be resistant to faults. Proving the resistance of protocols to faults is a very challenging problem, as it combines the parameterized setting that distributed systems are based-on, with the need to consider a hostile environment that produces the faults. Considering all the possible fault scenarios for a protocol is very difficult. Thus, reasoning about fault-tolerance protocols utterly needs formal methods. In this paper we describe a framework for verifying the fault tolerance of (synchronous or asynchronous) distributed protocols. In addition to the description of the protocol and the desired behavior, the user provides the fault type (e.g., fail-stop, Byzantine) and its distribution (e.g., at most half of the processes are faulty). Our framework is based on augmenting the description of the configurations of the system by a mask describing which processes are faulty. We focus on regular model checking and show how it is possible to compile the input for the model-checking problem to one that takes the faults and their distribution into an account, and perform regular model-checking on the compiled input. We demonstrate the effectiveness of our framework and argue for its generality.
引用
收藏
页码:315 / 331
页数:17
相关论文
共 50 条
  • [1] Verifying fault tolerance of distributed algorithms formally - An example
    Volzer, H
    1998 INTERNATIONAL CONFERENCE ON APPLICATION OF CONCURRENCY TO SYSTEM DESIGN, PROCEEDINGS, 1998, : 187 - 197
  • [2] A Model Checking Method for Verifying the Fault Tolerance of Distributed Protocol Liveness Properties
    Lu C.-Y.
    Nie C.-H.
    Cheung S.C.
    Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (08): : 1714 - 1731
  • [3] Verifying Architectural Variabilities in Software Fault Tolerance Techniques
    Brito, Patrick H. S.
    de Lemos, Rogerio
    Rubira, Cecilia M. F.
    2009 JOINT WORKING IEEE/IFIP CONFERENCE ON SOFTWARE ARCHITECTURE AND EUROPEAN CONFERENCE ON SOFTWARE ARCHITECTURE, 2009, : 231 - +
  • [4] Verifying fault tolerance of concurrent systems by model checking
    Yokogawa, T
    Tsuchiya, T
    Kikuno, T
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (11) : 2414 - 2425
  • [5] Verifying Fault-Tolerance in Probabilistic Swarm Systems
    Lomuscio, Alessio
    Pirovano, Edoardo
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 325 - 331
  • [6] FAULT TOLERANCE IN DISTRIBUTED SYSTEMS
    SCHMITTER, E
    SIEMENS FORSCHUNGS-UND ENTWICKLUNGSBERICHTE-SIEMENS RESEARCH AND DEVELOPMENT REPORTS, 1983, 12 (01): : 34 - 37
  • [7] Fault Tolerance in Distributed Paradigms
    Haider, Sajjad
    Ansari, Naveed Riaz
    Akbar, Muhammad
    Perwez, Mohammad Raza
    Ghori, Khawaja MoyeezUllah
    COMPUTER COMMUNICATION AND MANAGEMENT, 2011, 5 : 587 - 592
  • [8] FAULT TOLERANCE IN DISTRIBUTED UNIX
    BORG, A
    BLAU, W
    OBERLE, W
    GRAETSCH, W
    LECTURE NOTES IN COMPUTER SCIENCE, 1990, 448 : 224 - 243
  • [9] DuoAI: Fast, Automated Inference of Inductive Invariants for Verifying Distributed Protocols
    Yao, Jianan
    Tao, Runzhou
    Gu, Ronghui
    Nieh, Jason
    PROCEEDINGS OF THE 16TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2022, 2022, : 485 - 501
  • [10] A COMPILER THAT INCREASES THE FAULT TOLERANCE OF ASYNCHRONOUS PROTOCOLS
    COAN, BA
    IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (12) : 1541 - 1553