Failure Detectors in Homonymous Distributed Systems (with an Application to Consensus)

被引:6
|
作者
Arevalo, Sergio [1 ]
Fernandez Anta, Antonio [2 ]
Imbs, Damien [4 ]
Jimenez, Ernesto [1 ]
Raynal, Michel [3 ,4 ]
机构
[1] Univ Politecn Madrid, EUI, Madrid 28031, Spain
[2] Inst IMDEA Netwoks, Madrid 28918, Spain
[3] Inst Univ France, Paris, France
[4] IRISA, F-35042 Rennes, France
基金
中国国家自然科学基金;
关键词
Agreement problem; Asynchrony; Consensus; Distributed computability; failure detector; Homonymous system; Message-passing; Process crash; ANONYMOUS NETWORKS;
D O I
10.1109/ICDCS.2012.13
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper is on homonymous distributed systems where processes are prone to crash failures and have no initial knowledge of the system membership ("homonymous" means that several processes may have the same identifier). New classes of failure detectors suited to these systems are first defined. Among them, the classes H Omega and H Sigma are introduced that are the homonymous counterparts of the classes Omega and Sigma, respectively. (Recall that the pair <Omega, Sigma > defines the weakest failure detector to solve consensus.) Then, the paper shows how H Omega and H Sigma can be implemented in homonymous systems without membership knowledge (under different synchrony requirements). Finally, two algorithms are presented that use these failure detectors to solve consensus in homonymous asynchronous systems where there is no initial knowledge of the membership. One algorithm solves consensus with < H Omega, H Sigma >, while the other uses only H Omega, but needs a majority of correct processes. Observe that the systems with unique identifiers and anonymous systems are extreme cases of homonymous systems from which follows that all these results also apply to these systems. Interestingly, the new failure detector class H Omega can be implemented with partial synchrony, while the analogous class A Omega defined for anonymous systems can not be implemented (even in synchronous systems). Hence, the paper provides us with the first proof showing that consensus can be solved in anonymous systems with only partial synchrony (and a majority of correct processes).
引用
收藏
页码:275 / 284
页数:10
相关论文
共 50 条
  • [1] Failure detectors in homonymous distributed systems (with an application to consensus)
    Arevalo, Sergio
    Fernandez Anta, Antonio
    Imbs, Damien
    Jimenez, Ernesto
    Raynal, Michel
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 83 : 83 - 95
  • [2] Unreliable failure detectors for reliable distributed systems
    Chandra, TD
    Toueg, S
    [J]. JOURNAL OF THE ACM, 1996, 43 (02) : 225 - 267
  • [3] Failure detectors for large-scale distributed systems
    Hayashibara, N
    Cherif, A
    Katayama, T
    [J]. 21ST IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2002, : 404 - 409
  • [4] Consensus in Anonymous Distributed Systems: Is There a Weakest Failure Detector?
    Bonnet, Francois
    Raynal, Michel
    [J]. 2010 24TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2010, : 206 - 213
  • [5] Leader election in asynchronous distributed systems with unreliable failure detectors
    Park, SH
    Yamashita, M
    [J]. PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 687 - 693
  • [6] QoS Self-configuring Failure Detectors for Distributed Systems
    de Sa, Alirio Santos
    de Araujo Macedo, Raimundo Jose
    [J]. DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS, PROCEEDINGS, 2010, 6115 : 126 - 140
  • [7] Computing global functions in asynchronous distributed systems with perfect failure detectors
    Hélary, JM
    Hurfin, M
    Mostefaoui, A
    Raynal, M
    Tronel, F
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2000, 11 (09) : 897 - 909
  • [8] On termination detection in crash-prone distributed systems with failure detectors
    Mittal, Neeraj
    Freiling, Felix C.
    Venkatesan, S.
    Penso, Lucia Draque
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (06) : 855 - 875
  • [9] Hybrid binary consensus in anonymous asynchronous systems using coins and failure detectors
    Ernesto Jiménez
    José Luis López-Presa
    Javier Martín-Rueda
    [J]. The Journal of Supercomputing, 2019, 75 : 8262 - 8292
  • [10] Hybrid binary consensus in anonymous asynchronous systems using coins and failure detectors
    Jimenez, Ernesto
    Luis Lopez-Presa, Jose
    Martin-Rueda, Javier
    [J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (12): : 8262 - 8292