Profiling distributed graph processing systems through visual analytics

被引:4
|
作者
Arleo, Alessio [1 ]
Didimo, Walter [1 ]
Liotta, Giuseppe [1 ]
Montecchiani, Fabrizio [1 ]
机构
[1] Univ Perugia, Dipartimento Ingn, Perugia, Italy
关键词
Distributed platforms; Apache giraph; Vertex-centric frameworks; Profiling; Anomaly detection; Visual analytics; MULTIPLE TIME-SERIES; SOCIAL NETWORKS; VISUALIZATION; ALGORITHMS;
D O I
10.1016/j.future.2018.04.067
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Analyzing large-scale graphs provides valuable insights in different application scenarios, including social networking, crime detection, content ranking, and recommendations. While many graph processing systems working on top of distributed infrastructures have been proposed to deal with big graphs, the task of profiling their massive computations remains time consuming and error-prone. This paper presents GiViP, a visual profiler for distributed graph processing systems based on a Pregel-like computation model. GiViP captures the huge amount of messages exchanged throughout a computation and provides a powerful user interface for the visual analysis of the collected data. We discuss the effectiveness of our approach and show how to take advantage of GiViP to detect anomalies related to the computation and to the infrastructure, such as slow computing units, anomalous message patterns, unbalanced graph partitions, and links with high latency. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:43 / 57
页数:15
相关论文
共 50 条
  • [41] Visual Analytics Solution for Scheduling Processing Phases
    Thomas, J. Joshua
    Belaton, Bahari
    Khader, Ahamad Tajudin
    Justtina
    [J]. INTELLIGENT COMPUTING & OPTIMIZATION, 2019, 866 : 395 - 408
  • [42] Troubleshooting Distributed Data Analytics Systems
    Pi, Aidi
    [J]. MIDDLEWARE'19: PROCEEDINGS OF THE 2019 20TH INTERNATIONAL MIDDLEWARE CONFERENCE DOCTORAL SYMPOSIUM, 2019, : 9 - 13
  • [43] Processing Concurrent Graph Analytics with Decoupled Computation Model
    Xue, Jilong
    Yang, Zhi
    Hou, Shian
    Dai, Yafei
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (05) : 876 - 890
  • [44] Big Data Analytics Using Graph Signal Processing
    Amin, Farhan
    Barukab, Omar M.
    Choi, Gyu Sang
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 489 - 502
  • [45] Data Replication for Distributed Graph Processing
    Ho, Li-Yung
    Wu, Jan-Jan
    Liu, Pangfeng
    [J]. 2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 319 - 326
  • [46] An Elasticity Study of Distributed Graph Processing
    Au, Sietse
    Uta, Alexandru
    Ilyushkin, Alexey
    Iosup, Alexandru
    [J]. 2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 382 - 383
  • [47] Neural processing in distributed visual systems - many eyes, many solutions
    Chappell, D. R.
    Speiser, D., I
    [J]. INTEGRATIVE AND COMPARATIVE BIOLOGY, 2020, 60 : E35 - E35
  • [48] iPartition: a distributed partitioning algorithm for block-centric graph processing systems
    Sagharichian, Masoud
    Langouri, Morteza Alipour
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (18): : 21116 - 21143
  • [49] Efficient Fault-tolerance for Iterative Graph Processing on Distributed Dataflow Systems
    Xu, Chen
    Holzemer, Markus
    Kaul, Manohar
    Markl, Volker
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 613 - 624
  • [50] Fast Failure Recovery in Vertex-Centric Distributed Graph Processing Systems
    Lu, Wei
    Shen, Yanyan
    Wang, Tongtong
    Zhang, Meihui
    Jagadish, H. V.
    Du, Xiaoyong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (04) : 733 - 746