Greft: Arbitrary Fault-Tolerant Distributed Graph Processing

被引:6
|
作者
Presser, Daniel [1 ]
Lung, Lau Cheuk [1 ]
Correia, Miguel [2 ]
机构
[1] Univ Fed Santa Catarina, Dept Informat & Estat, Florianopolis, SC, Brazil
[2] Univ Lisbon, Inst Super Tecn, INESC ID, Lisbon, Portugal
关键词
D O I
10.1109/BigDataCongress.2015.73
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many large-scale computing problems can be modeled as graphs. Example areas include the web, social networks, and biological systems. The increasing sizes of datasets has led to the creation of various distributed large scale graph processing systems, e.g., Google Pregel. Although these systems tolerate crash faults, the literature suggests they are vulnerable to a wider range of accidental arbitrary faults (also called Byzantine faults). In this paper we present an algorithm and a prototype of a distributed large-scale graph processing system that can tolerate arbitrary faults. The prototype is based on GPS, an open source implementation of Pregel. Experimental results of the prototype in Amazon AWS are presented, showing that it uses only twice the resources of the original implementation, instead of 3-4 times as usual in Byzantine fault-tolerant systems. This cost may be acceptable for critical applications that require this level of fault tolerance.
引用
收藏
页码:452 / 459
页数:8
相关论文
共 50 条
  • [41] IMPLEMENTING FAULT-TOLERANT DISTRIBUTED OBJECTS.
    Birman, Kenneth P.
    Joseph, Thomas A.
    Raeuchle, Thomas
    Abbadi, Amr El
    IEEE Transactions on Software Engineering, 1985, SE-11 (06) : 502 - 508
  • [42] DISTRIBUTED RECOVERY IN FAULT-TOLERANT MULTIPROCESSOR NETWORKS
    YANNEY, RM
    HAYES, JP
    IEEE TRANSACTIONS ON COMPUTERS, 1986, 35 (10) : 871 - 879
  • [43] ON RELIABILITY MODELING OF FAULT-TOLERANT DISTRIBUTED SYSTEMS
    THAMBIDURAI, P
    PARK, YK
    TRIVEDI, KS
    9TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1989, : 136 - 142
  • [44] Challenges in Fault-Tolerant Distributed Runtime Verification
    Bonakdarpour, Borzoo
    Fraigniaud, Pierre
    Rajsbaum, Sergio
    Travers, Corentin
    LEVERAGING APPLICATIONS OF FORMAL METHODS, VERIFICATION AND VALIDATION: DISCUSSION, DISSEMINATION, APPLICATIONS, ISOLA 2016, PT II, 2016, 9953 : 363 - 370
  • [45] A FAULT-TOLERANT COMMUNICATION ARCHITECTURE FOR DISTRIBUTED SYSTEMS
    PRADHAN, DK
    REDDY, SM
    IEEE TRANSACTIONS ON COMPUTERS, 1982, 31 (09) : 863 - 870
  • [46] Fault-Tolerant Mechanism of the Distributed Cluster Computers
    Shang, Yizi
    Jin, Yang
    Wu, Baosheng
    Tsinghua Science and Technology, 2007, 12 (SUPPL. 1): : 186 - 191
  • [47] A FAULT-TOLERANT DISTRIBUTED TERMINATION DETECTION SCHEME
    YE, X
    KEANE, JA
    INFORMATION PROCESSING '94, VOL I: TECHNOLOGY AND FOUNDATIONS, 1994, 51 : 300 - 305
  • [48] Fault-Tolerant Adaptive Parallel and Distributed Simulation
    D'Angelo, Gabriele
    Ferretti, Stefano
    Marzolla, Moreno
    Armaroli, Lorenzo
    2016 IEEE/ACM 20TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), 2016, : 37 - 44
  • [49] Distributed Voting for Fault-Tolerant Nanoscale Systems
    Namazi, Ali
    Nourani, Mehrdad
    2007 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, VOLS, 1 AND 2, 2007, : 569 - 574
  • [50] Secure and fault-tolerant voting in distributed systems
    Hardekopf, B
    Kwiat, K
    Upadhyaya, S
    2001 IEEE AEROSPACE CONFERENCE PROCEEDINGS, VOLS 1-7, 2001, : 1117 - 1126