Profiling distributed graph processing systems through visual analytics

被引:4
|
作者
Arleo, Alessio [1 ]
Didimo, Walter [1 ]
Liotta, Giuseppe [1 ]
Montecchiani, Fabrizio [1 ]
机构
[1] Univ Perugia, Dipartimento Ingn, Perugia, Italy
关键词
Distributed platforms; Apache giraph; Vertex-centric frameworks; Profiling; Anomaly detection; Visual analytics; MULTIPLE TIME-SERIES; SOCIAL NETWORKS; VISUALIZATION; ALGORITHMS;
D O I
10.1016/j.future.2018.04.067
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Analyzing large-scale graphs provides valuable insights in different application scenarios, including social networking, crime detection, content ranking, and recommendations. While many graph processing systems working on top of distributed infrastructures have been proposed to deal with big graphs, the task of profiling their massive computations remains time consuming and error-prone. This paper presents GiViP, a visual profiler for distributed graph processing systems based on a Pregel-like computation model. GiViP captures the huge amount of messages exchanged throughout a computation and provides a powerful user interface for the visual analysis of the collected data. We discuss the effectiveness of our approach and show how to take advantage of GiViP to detect anomalies related to the computation and to the infrastructure, such as slow computing units, anomalous message patterns, unbalanced graph partitions, and links with high latency. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:43 / 57
页数:15
相关论文
共 50 条
  • [1] Datalography: Scaling Datalog Graph Analytics on Graph Processing Systems
    Moustafa, Walaa Eldin
    Papavasileiou, Vicky
    Yocum, Ken
    Deutsch, Alin
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 56 - 65
  • [2] Distributed Graph Analytics
    Srikant, Y. N.
    [J]. DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY (ICDCIT 2020), 2020, 11969 : 3 - 20
  • [3] A Visual Analytics Framework for Distributed Data Analysis Systems
    Nayeem, Abdullah-Al-Raihan
    Elshambakey, Mohammed
    Dobbs, Todd
    Lee, Huikyo
    Crichton, Daniel
    Zhu, Yimin
    Chokwitthaya, Chanachok
    Tolone, William J.
    Cho, Isaac
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 229 - 240
  • [4] The Taxonomy of Distributed Graph Analytics
    Rao, T. Ramalingeswara
    Mitra, Pabitra
    Goswami, A.
    [J]. 2018 FIFTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2018, : 315 - 322
  • [5] Graph Sampling for Visual Analytics
    Zhang, Fangyan
    Zhang, Song
    Wong, Pak Chung
    [J]. JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2017, 61 (04)
  • [6] Graph signatures for visual analytics
    Wong, Pak Chung
    Foote, Harlan
    Chin, George, Jr.
    Mackey, Patrick
    Perrine, Ken
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (06) : 1399 - 1413
  • [7] Big graph visual analytics
    Haglin, David
    Trimm, David
    Wong, Pak Chung
    [J]. INFORMATION VISUALIZATION, 2017, 16 (03) : 155 - 156
  • [8] Graph Colouring as a Challenge Problem for Dynamic Graph Processing on Distributed Systems
    Sallinen, Scott
    Iwabuchi, Keita
    Poudel, Suraj
    Gokhale, Maya
    Ripeanu, Matei
    Pearce, Roger
    [J]. SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 347 - 358
  • [9] Fast Failure Recovery in Distributed Graph Processing Systems
    Shen, Yanyan
    Gang Chen
    Jagadish, H. V.
    Wei Lu
    Ooi, Beng Chin
    Tudor, Bogdan Marius
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (04): : 437 - 448
  • [10] Distributed temporal graph analytics with GRADOOP
    Rost, Christopher
    Gomez, Kevin
    Taeschner, Matthias
    Fritzsche, Philip
    Schons, Lucas
    Christ, Lukas
    Adameit, Timo
    Junghanns, Martin
    Rahm, Erhard
    [J]. VLDB JOURNAL, 2022, 31 (02): : 375 - 401