Probabilistic and temporal failure detectors for solving distributed problems

被引:1
|
作者
Guerraoui, Rachid [1 ]
Kozhaya, David [2 ]
Pignolet, Yvonne-Anne [3 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[2] ABB Res, Baden, Switzerland
[3] DFINITY, Zurich, Switzerland
关键词
Failure detectors; Probabilistic links; Message loss; Consensus; Modular algorithms; CONSENSUS;
D O I
10.1016/j.jpdc.2021.07.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Failure detectors (FD)s are celebrated for their modularity in solving distributed problems. Algorithms are constructed using FD building blocks. Synchrony assumptions to implement FDs are studied separately and are typically expressed as eventual guarantees that need to hold, after some point in time, foreverand deterministically. But in practice, they may hold only probabilistically and temporarily. This paper studies FDs in a realistic system N, where asynchrony is inflicted by probabilistic synchronous communication. We first address a problem with lozenge S, the weakest FD to solve consensus: an implementation of "consensus with probability 1" is possible in Nwithout randomness in the algorithm, while an implementation of "lozenge Swith probability 1" is impossible in N. We introduce lozenge S*, a new FD with probabilistic and temporal accuracy. We prove that lozenge S*(i) is implementable in Nand (ii) can replace lozenge S, in several existing deterministic consensus algorithms that use lozenge S, to yield an algorithm that solves "consensus with probability 1". We extend our results to other FD classes, e.g., lozenge P, and to a larger set of problems (beyond consensus), which we call decisive problems. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] A Distributed Immune Algorithm for Solving Optimization Problems
    Oszust, Mariusz
    Wysocki, Marian
    [J]. INTELLIGENT DISTRIBUTED COMPUTING, SYSTEMS AND APPLICATIONS, 2008, 162 : 147 - 155
  • [22] A distributed algorithm for solving quadratic optimization problems
    Jahvani, Mohammad
    Guay, Martin
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2024, 189
  • [23] Distributed algorithms for solving the convex feasibility problems
    Kaihong LU
    Gangshan JING
    Long WANG
    [J]. Science China(Information Sciences), 2020, 63 (08) : 242 - 244
  • [24] A distributed algorithm for solving mixed equilibrium problems
    Lu, Kaihong
    Jing, Gangshan
    Wang, Long
    [J]. AUTOMATICA, 2019, 105 : 246 - 253
  • [25] Failure detectors in homonymous distributed systems (with an application to consensus)
    Arevalo, Sergio
    Fernandez Anta, Antonio
    Imbs, Damien
    Jimenez, Ernesto
    Raynal, Michel
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 83 : 83 - 95
  • [26] Failure Detectors in Homonymous Distributed Systems (with an Application to Consensus)
    Arevalo, Sergio
    Fernandez Anta, Antonio
    Imbs, Damien
    Jimenez, Ernesto
    Raynal, Michel
    [J]. 2012 IEEE 32ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2012, : 275 - 284
  • [27] A methodology to design arbitrary failure detectors for distributed protocols
    Baldoni, Roberto
    Helary, Jean-Michel
    Piergiovanni, Sara Tucci
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2008, 54 (07) : 619 - 637
  • [28] Improving the Robustness of Distributed Failure Detectors in Adverse Conditions
    Lemos, F. T. C.
    Sato, L. M.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2012, 10 (01) : 1364 - 1369
  • [29] Failure detectors for large-scale distributed systems
    Hayashibara, N
    Cherif, A
    Katayama, T
    [J]. 21ST IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2002, : 404 - 409
  • [30] Using Temporal Awareness to Improve Distributed Problem Solving
    Samadidana, Saeid
    Mailler, Roger
    [J]. 2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 342 - 348