Estimating the Impact of Communication Schemes for Distributed Graph Processing

被引:0
|
作者
Ye, Tian [1 ]
Kuppannagari, Sanmukh R. [1 ]
De Rose, Cesar A. F.
Wijeratne, Sasindu [1 ]
Kannan, Rajgopal [1 ]
Prasanna, Viktor K. [1 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90007 USA
关键词
Distributed Graph Processing; Performance Estimation; Communication Schemes; Cluster Computing;
D O I
10.1109/ISPDC55340.2022.00016
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Extreme scale graph analytics is imperative for several real-world Big Data applications with the underlying graph structure containing millions or billions of vertices and edges. Since such huge graphs cannot fit into the memory of a single computer, distributed processing of the graph is required. Several frameworks have been developed for performing graph processing on distributed systems. The frameworks focus primarily on choosing the right computation model and the partitioning scheme under the assumption that such design choices will automatically reduce the communication overheads. For any computational model and partitioning scheme, communication schemes - the data to be communicated and the virtual interconnection network among the nodes - have significant impact on the performance. To analyze this impact, in this work, we identify widely used communication schemes and estimate their performance. Analyzing the trade-offs between the number of compute nodes and communication costs of various schemes on a distributed platform by brute force experimentation can be prohibitively expensive. Thus, our performance estimation models provide an economic way to perform the analyses given the partitions and the communication scheme as input. We validate our model on a local HPC cluster as well as the cloud hosted NSF Chameleon cluster. Using our estimates as well as the actual measurements, we compare the communication schemes and provide conditions under which one scheme should be preferred over the others.
引用
收藏
页码:49 / 56
页数:8
相关论文
共 50 条
  • [31] CoRAL: Confined recovery in distributed asynchronous graph processing
    Vora, Keval
    Tian, Chen
    Gupta, Rajiv
    Hu, Ziang
    [J]. ACM SIGPLAN Notices, 2017, 52 (04): : 223 - 236
  • [32] Learning of robust spectral graph dictionaries for distributed processing
    Dorina Thanou
    Pascal Frossard
    [J]. EURASIP Journal on Advances in Signal Processing, 2018
  • [33] Performance Modeling and Task Scheduling in Distributed Graph Processing
    Presser, Daniel
    Siqueira, Frank
    Reina, Fabio
    [J]. 2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 135 - 142
  • [34] Distributed and Cooperative Task Processing: Cournot Oligopolies on a Graph
    Pavlic, Theodore P.
    Passino, Kevin M.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (06) : 774 - 784
  • [35] CoRAL: Confined Recovery in Distributed Asynchronous Graph Processing
    Vora, Keval
    Tian, Chen
    Gupta, Rajiv
    Hu, Ziang
    [J]. ACM SIGPLAN NOTICES, 2017, 52 (04) : 223 - 236
  • [36] Learning of robust spectral graph dictionaries for distributed processing
    Thanou, Dorina
    Frossard, Pascal
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2018,
  • [37] CoRAL: Confined Recovery in Distributed Asynchronous Graph Processing
    Vora, Keval
    Tian, Chen
    Gupta, Rajiv
    Hu, Ziang
    [J]. OPERATING SYSTEMS REVIEW, 2017, 51 (02) : 223 - 236
  • [38] A Distributed Graph Partitioning Algorithm for Processing Large Graphs
    Chen, Tefeng
    Li, Bo
    [J]. PROCEEDINGS 2016 IEEE SYMPOSIUM ON SERVICE-ORIENTED SYSTEM ENGINEERING SOSE 2016, 2016, : 71 - 77
  • [39] An Experimental Comparison of Partitioning Strategies in Distributed Graph Processing
    Verma, Shiv
    Leslie, Luke M.
    Shin, Yosub
    Gupta, Indranil
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (05): : 493 - 504
  • [40] Secure group communication schemes for dynamic heterogeneous distributed computing
    Castiglione, Arcangelo
    D'Arco, Paolo
    De Santis, Alfredo
    Russo, Rosario
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 74 : 313 - 324