Distributed temporal graph analytics with GRADOOP

被引:13
|
作者
Rost, Christopher [1 ,2 ]
Gomez, Kevin [1 ,2 ]
Taeschner, Matthias [1 ,2 ]
Fritzsche, Philip [1 ,2 ]
Schons, Lucas [1 ,2 ]
Christ, Lukas [1 ,2 ]
Adameit, Timo [1 ,2 ]
Junghanns, Martin [3 ]
Rahm, Erhard [1 ,2 ]
机构
[1] Univ Leipzig, Leipzig, Germany
[2] ScaDS AI Dresden Leipzig, Leipzig, Germany
[3] Neo4j Inc, San Mateo, CA USA
来源
VLDB JOURNAL | 2022年 / 31卷 / 02期
关键词
Graph processing; Temporal graph; Distributed processing; Graph analytics; Bitemporal graph model; PARALLEL;
D O I
10.1007/s00778-021-00667-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Temporal property graphs are graphs whose structure and properties change over time. Temporal graph datasets tend to be large due to stored historical information, asking for scalable analysis capabilities. We give a complete overview of Gradoop, a graph dataflow system for scalable, distributed analytics of temporal property graphs which has been continuously developed since 2005. Its graph model TPGM allows bitemporal modeling not only of vertices and edges but also of graph collections. A declarative analytical language called GrALa allows analysts to flexibly define analytical graph workflows by composing different operators that support temporal graph analysis. Built on a distributed dataflow system, large temporal graphs can be processed on a shared-nothing cluster. We present the system architecture of Gradoop, its data model TPGM with composable temporal graph operators, like snapshot, difference, pattern matching, graph grouping and several implementation details. We evaluate the performance and scalability of selected operators and a composed workflow for synthetic and real-world temporal graphs with up to 283 M vertices and 1.8 B edges, and a graph lifetime of about 8 years with up to 20 M new edges per year. We also reflect on lessons learned from the Gradoop effort.
引用
收藏
页码:375 / 401
页数:27
相关论文
共 50 条
  • [1] Distributed temporal graph analytics with GRADOOP
    Christopher Rost
    Kevin Gomez
    Matthias Täschner
    Philip Fritzsche
    Lucas Schons
    Lukas Christ
    Timo Adameit
    Martin Junghanns
    Erhard Rahm
    [J]. The VLDB Journal, 2022, 31 : 375 - 401
  • [2] Declarative and distributed graph analytics with GRADOOP
    Junghanns, Martin
    Kiessling, Max
    Teichmann, Niklas
    Gomez, Kevin
    Petermann, Andre
    Rahm, Erhard
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (12): : 2006 - 2009
  • [3] Distributed Graph Analytics
    Srikant, Y. N.
    [J]. DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY (ICDCIT 2020), 2020, 11969 : 3 - 20
  • [4] The Taxonomy of Distributed Graph Analytics
    Rao, T. Ramalingeswara
    Mitra, Pabitra
    Goswami, A.
    [J]. 2018 FIFTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2018, : 315 - 322
  • [5] Analyzing Temporal Graphs with Gradoop
    Rost, Christopher
    Thor, Andreas
    Rahm, Erhard
    [J]. Datenbank-Spektrum, 2019, 19 (03): : 199 - 208
  • [6] Coded Computing for Distributed Graph Analytics
    Prakash, Saurav
    Reisizadeh, Amirhossein
    Pedarsani, Ramtin
    Avestimehr, Salman
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2018, : 1221 - 1225
  • [7] Coded Computing for Distributed Graph Analytics
    Prakash, Saurav
    Reisizadeh, Amirhossein
    Pedarsani, Ramtin
    Avestimehr, Amir Salman
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2020, 66 (10) : 6534 - 6554
  • [8] DisGCo: A Compiler for Distributed Graph Analytics
    Rajendran, Anchu
    Nandivada, V. Krishna
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2020, 17 (04)
  • [9] Phoenix: A Substrate for Resilient Distributed Graph Analytics
    Dathathri, Roshan
    Gill, Gurbinder
    Loc Hoang
    Pingali, Keshav
    [J]. TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, : 615 - 630
  • [10] Towards a distributed infrastructure for evolving graph analytics
    Moffitt, Vera Zaychik
    Stoyanovich, Julia
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 843 - 848