Tuning the granularity of parallelism for distributed graph processing

被引:0
|
作者
Luo, Xinyuan [1 ]
Wu, Sai [1 ]
Wang, Wei [2 ]
Shou, Lidan [1 ]
机构
[1] Zhejiang Univ, Dept Comp Sci, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Tech Inst Econ, Dept Digital Informat Technol, Hangzhou, Zhejiang, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Graph processing; Pregel; Parallelism model; Vertex-centric model; Synchronization model; FRAMEWORK;
D O I
10.1007/s10619-017-7195-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Popular distributed graph processing frameworks, such as Pregel and GraphLab, are based on the vertex-centric computation model, where users write their customized Compute function for each vertex to process the data iteratively. Vertices are evenly partitioned among the compute nodes, and the granularity of parallelism of the graph algorithm is normally tuned by adjusting the number of compute nodes. Vertex-centric model splits the computation into phases. Inside one specific phase, the computation proceeds as an embarrassingly parallel process, because no communication between compute nodes incurs. By default, current graph engine only handles one iteration of the algorithm in a phase. However, in this paper, we find that we can also tune the granularity of parallelism, by aggregating the computation of multiple iterations into one phase, which has a significant impact on the performance of the graph algorithm. In the ideal case, if all computations are handled in one phase, the whole algorithm turns into an embarrassingly parallel algorithm and the benefit of parallelism is maximized. Based on this observation, we propose two approaches, a function-based approach and a parameter-based approach, to automatically transform a Pregel algorithm into a new one with tunable granularity of parallelism. We study the cost of such transformation and the trade-off between the granularity of parallelism and the performance. We provide a new direction to tune the performance of parallel algorithms. Finally, the approaches are implemented in our graph processing system, N2, and we illustrate their performance using popular graph algorithms.
引用
收藏
页码:117 / 148
页数:32
相关论文
共 50 条
  • [31] Learning of robust spectral graph dictionaries for distributed processing
    Dorina Thanou
    Pascal Frossard
    [J]. EURASIP Journal on Advances in Signal Processing, 2018
  • [32] Performance Modeling and Task Scheduling in Distributed Graph Processing
    Presser, Daniel
    Siqueira, Frank
    Reina, Fabio
    [J]. 2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 135 - 142
  • [33] Distributed and Cooperative Task Processing: Cournot Oligopolies on a Graph
    Pavlic, Theodore P.
    Passino, Kevin M.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (06) : 774 - 784
  • [34] CoRAL: Confined Recovery in Distributed Asynchronous Graph Processing
    Vora, Keval
    Tian, Chen
    Gupta, Rajiv
    Hu, Ziang
    [J]. ACM SIGPLAN NOTICES, 2017, 52 (04) : 223 - 236
  • [35] Learning of robust spectral graph dictionaries for distributed processing
    Thanou, Dorina
    Frossard, Pascal
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2018,
  • [36] Estimating the Impact of Communication Schemes for Distributed Graph Processing
    Ye, Tian
    Kuppannagari, Sanmukh R.
    De Rose, Cesar A. F.
    Wijeratne, Sasindu
    Kannan, Rajgopal
    Prasanna, Viktor K.
    [J]. 2022 21ST INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC 2022), 2022, : 49 - 56
  • [37] CoRAL: Confined Recovery in Distributed Asynchronous Graph Processing
    Vora, Keval
    Tian, Chen
    Gupta, Rajiv
    Hu, Ziang
    [J]. OPERATING SYSTEMS REVIEW, 2017, 51 (02) : 223 - 236
  • [38] A Distributed Graph Partitioning Algorithm for Processing Large Graphs
    Chen, Tefeng
    Li, Bo
    [J]. PROCEEDINGS 2016 IEEE SYMPOSIUM ON SERVICE-ORIENTED SYSTEM ENGINEERING SOSE 2016, 2016, : 71 - 77
  • [39] An Experimental Comparison of Partitioning Strategies in Distributed Graph Processing
    Verma, Shiv
    Leslie, Luke M.
    Shin, Yosub
    Gupta, Indranil
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (05): : 493 - 504
  • [40] PARALLELISM IN GRAPH-PARTITIONING
    SAVAGE, JE
    WLOKA, MG
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1991, 13 (03) : 257 - 272