Distributed temporal graph analytics with GRADOOP

被引:0
|
作者
Christopher Rost
Kevin Gomez
Matthias Täschner
Philip Fritzsche
Lucas Schons
Lukas Christ
Timo Adameit
Martin Junghanns
Erhard Rahm
机构
[1] University of Leipzig & ScaDS.AI Dresden/Leipzig,
[2] Neo4j,undefined
[3] Inc.,undefined
来源
The VLDB Journal | 2022年 / 31卷
关键词
Graph processing; Temporal graph; Distributed processing; Graph analytics; Bitemporal graph model;
D O I
暂无
中图分类号
学科分类号
摘要
Temporal property graphs are graphs whose structure and properties change over time. Temporal graph datasets tend to be large due to stored historical information, asking for scalable analysis capabilities. We give a complete overview of Gradoop, a graph dataflow system for scalable, distributed analytics of temporal property graphs which has been continuously developed since 2005. Its graph model TPGM allows bitemporal modeling not only of vertices and edges but also of graph collections. A declarative analytical language called GrALa allows analysts to flexibly define analytical graph workflows by composing different operators that support temporal graph analysis. Built on a distributed dataflow system, large temporal graphs can be processed on a shared-nothing cluster. We present the system architecture of Gradoop, its data model TPGM with composable temporal graph operators, like snapshot, difference, pattern matching, graph grouping and several implementation details. We evaluate the performance and scalability of selected operators and a composed workflow for synthetic and real-world temporal graphs with up to 283 M vertices and 1.8 B edges, and a graph lifetime of about 8 years with up to 20 M new edges per year. We also reflect on lessons learned from the Gradoop effort.
引用
收藏
页码:375 / 401
页数:26
相关论文
共 50 条
  • [41] Graph Analytics for Signature Discovery
    Hogan, Emilie
    Johnson, John R.
    Halappanavar, Mahantesh
    Lo, Chaomei
    2013 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS: BIG DATA, EMERGENT THREATS, AND DECISION-MAKING IN SECURITY INFORMATICS, 2013, : 315 - 320
  • [42] Quantifying Communication in Graph Analytics
    Anghel, Andreea
    Rodriguez, German
    Prisacari, Bogdan
    Minkenberg, Cyriel
    Dittmann, Gero
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2015, 2015, 9137 : 472 - 487
  • [43] GRAFS: Declarative Graph Analytics
    Houshmand, Farzin
    Lesani, Mohsen
    Vora, Keval
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2021, 5
  • [44] Big Graph Analytics Systems
    Yan, Da
    Bu, Yingyi
    Tian, Yuanyuan
    Deshpande, Amol
    Cheng, James
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2241 - 2243
  • [45] Graph signatures for visual analytics
    Wong, Pak Chung
    Foote, Harlan
    Chin, George, Jr.
    Mackey, Patrick
    Perrine, Ken
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (06) : 1399 - 1413
  • [46] Big graph visual analytics
    Haglin, David
    Trimm, David
    Wong, Pak Chung
    INFORMATION VISUALIZATION, 2017, 16 (03) : 155 - 156
  • [47] Big Graph Analytics Platforms
    不详
    FOUNDATIONS AND TRENDS IN DATABASES, 2015, 7 (1-2): : 2 - +
  • [48] A Lightweight Infrastructure for Graph Analytics
    Nguyen, Donald
    Lenharth, Andrew
    Pingali, Keshav
    SOSP'13: PROCEEDINGS OF THE TWENTY-FOURTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, 2013, : 456 - 471
  • [49] Essentials of Parallel Graph Analytics
    Osama, Muhammad
    Porumbescu, Serban D.
    Owens, John D.
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 314 - 317
  • [50] Semantic Property Graph for Scalable Knowledge Graph Analytics
    Purohit, Sumit
    Nhuy Van
    Chin, George
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 2672 - 2677