Greft: Arbitrary Fault-Tolerant Distributed Graph Processing

被引:6
|
作者
Presser, Daniel [1 ]
Lung, Lau Cheuk [1 ]
Correia, Miguel [2 ]
机构
[1] Univ Fed Santa Catarina, Dept Informat & Estat, Florianopolis, SC, Brazil
[2] Univ Lisbon, Inst Super Tecn, INESC ID, Lisbon, Portugal
关键词
D O I
10.1109/BigDataCongress.2015.73
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many large-scale computing problems can be modeled as graphs. Example areas include the web, social networks, and biological systems. The increasing sizes of datasets has led to the creation of various distributed large scale graph processing systems, e.g., Google Pregel. Although these systems tolerate crash faults, the literature suggests they are vulnerable to a wider range of accidental arbitrary faults (also called Byzantine faults). In this paper we present an algorithm and a prototype of a distributed large-scale graph processing system that can tolerate arbitrary faults. The prototype is based on GPS, an open source implementation of Pregel. Experimental results of the prototype in Amazon AWS are presented, showing that it uses only twice the resources of the original implementation, instead of 3-4 times as usual in Byzantine fault-tolerant systems. This cost may be acceptable for critical applications that require this level of fault tolerance.
引用
收藏
页码:452 / 459
页数:8
相关论文
共 50 条
  • [1] Fault-tolerant distributed stream processing system
    Gorawski, Marcin
    Marks, Pawel
    SEVENTEENTH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, : 395 - +
  • [2] An adaptive distributed fault-tolerant routing algorithm for the star graph
    Bai, LQ
    Ebara, H
    Nakano, H
    Maeda, H
    ALGORITHMS AND COMPUTATION, PROCEEDINGS, 1997, 1350 : 62 - 71
  • [3] Minimizing Latency in Fault-Tolerant Distributed Stream Processing Systems
    Brito, Andrey
    Fetzer, Christof
    Felber, Pascal
    2009 29TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 2009, : 173 - +
  • [5] Fault-Tolerant Distributed Reconnaissance
    Lauf, Adrian P.
    Robinson, William H.
    MILITARY COMMUNICATIONS CONFERENCE, 2010 (MILCOM 2010), 2010, : 1812 - 1817
  • [6] Fault-tolerant distributed simulation
    Damani, OP
    Garg, VK
    TWELFTH WORKSHOP ON PARALLEL AND DISTRIBUTED SIMULATION - PADS'98, PROCEEDINGS, 1998, : 38 - 45
  • [7] Fault-tolerant routing in the star graph
    Rezazad, SM
    Sarbazi-Azad, H
    18TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 2 (REGULAR PAPERS), PROCEEDINGS, 2004, : 503 - 506
  • [8] Fault-tolerant broadcasting on the arrangement graph
    Bai, LQ
    Ebara, H
    Nakano, H
    Maeda, H
    COMPUTER JOURNAL, 1998, 41 (03): : 171 - 184
  • [9] Ares: a High Performance and Fault-tolerant Distributed Stream Processing System
    Lin, Changfu
    Zhan, Jingjing
    Chen, Hanhua
    Tan, Jie
    Jin, Hai
    2018 IEEE 26TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2018, : 176 - 186
  • [10] Economical and Fault-Tolerant Load Balancing in Distributed Stream Processing Systems
    Xiao, Fuyuan
    Kitasuka, Teruaki
    Aritsugi, Masayoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (04): : 1062 - 1073