Specifying and using a partitionable group communication service

被引:35
|
作者
Fekete, A
Lynch, N
Shvartsman, A
机构
[1] Univ Sydney, Basser Dept Comp Sci, Sydney, NSW 2006, Australia
[2] MIT, Comp Sci Lab, Cambridge, MA 02139 USA
[3] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
来源
ACM TRANSACTIONS ON COMPUTER SYSTEMS | 2001年 / 19卷 / 02期
关键词
algorithms; design; performance; verification; group communication protocols; message-passing protocols; conditional performance analysis; total-order broadcast; composable building blocks; service specification; ordered broadcast; distributed algorithms;
D O I
10.1145/377769.377776
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Group communication services are becoming accepted as effective building blocks for the construction of fault-tolerant distributed applications. Many specifications for group communication services have been proposed. However, there is still no agreement about what these specifications should say, especially in cases where the services are partitionable, i.e., where communication failures may lead to simultaneous creation of groups with disjoint memberships, such that each group is unaware of the existence of any other group. In this paper, we present a new, succinct specification for a view-oriented partitionable group communication service. The service associates each message with a particular view of the group membership. All send and receive events for a message occur within the associated view. The service provides a total order on the messages within each view, and each processor receives a prefix of this order. Our specification separates safety requirements from performance and fault-tolerance requirements. The safety requirements are expressed by an abstract, global state machine. To present the performance and fault-tolerance requirements, we include failure-status input actions in the specification; we then give properties saying that consensus on the view and timely message delivery are guaranteed in an execution provided that the execution stabilizes to a situation in which the failure-status stops changing and corresponds to a consistently partitioned system. Because consensus is not required in every execution, the specification is not subject to the existing impossibility results for partitionable systems. Our specification has a simple implementation, based on the membership algorithm of Cristian and Schmuck. We show the utility of the specification by constructing an ordered-broadcast application, using an algorithm (based on algorithms of Amir, Dolev, Keidar, and others) that reconciles information derived from different instantiations of the group. The application manages the view-change activity to build a shared sequence of messages, i.e., the per-view total orders of the group service are combined to give a universal total order. We prove the correctness and analyze the performance and fault-tolerance of the resulting application.
引用
收藏
页码:171 / 216
页数:46
相关论文
共 50 条
  • [1] Group communication in partitionable distributed systems
    Babaoglu, Ö
    Davoli, R
    Montresor, A
    [J]. ADVANCES IN DISTRIBUTED SYSTEMS: ADVANCED DISTRIBUTED COMPUTING: FROM ALGORITHMS TO SYSTEMS, 2000, 1752 : 48 - 78
  • [2] Group communication in partitionable systems:: Specification and algorithms
    Babaoglu, Ö
    Davoli, R
    Montresor, A
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2001, 27 (04) : 308 - 336
  • [3] A protocol for partitionable group membership service in mobile Internet
    Wang, GJ
    Cao, JN
    Chan, KCC
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2005, 5 (07): : 773 - 792
  • [4] Enhancing partitionable group membership service in asynchronous distributed systems
    Sameh, Ahmed
    Mansour, Marwa
    [J]. 2002, Int. Assoc. of Science and Technology for Development (05):
  • [5] Enhancing Partitionable Group Membership Service in asynchronous distributed systems
    Sameh, A
    Mansour, M
    [J]. COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2001, : 239 - 242
  • [6] Constructing applications using the timewheel group communication service
    Mishra, S
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1591 - 1598
  • [7] On the specification of partitionable group membership
    Pleisch, Stefan
    Ruetti, Olivier
    Schiper, Andre
    [J]. EDCC-7: SEVENTH EUROPEAN DEPENDABLE COMPUTING CONFERENCE, PROCEEDINGS, 2008, : 37 - 45
  • [8] Flexibility in group communication service
    Enokido, T
    Wietrzyk, VI
    Takizawa, M
    [J]. 14TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, : 151 - 155
  • [9] Using the heartbeat failure detector for quiescent reliable communication and consensus in partitionable networks
    Aguilera, MK
    Chen, W
    Toueg, S
    [J]. THEORETICAL COMPUTER SCIENCE, 1999, 220 (01) : 3 - 30
  • [10] Execution service for a partitionable low bandwidth network
    Hickey, Takako M.
    van Renesse, Robbert
    [J]. Proceedings - Annual International Conference on Fault-Tolerant Computing, 1999, : 174 - 181