Tuning the granularity of parallelism for distributed graph processing

被引:0
|
作者
Luo, Xinyuan [1 ]
Wu, Sai [1 ]
Wang, Wei [2 ]
Shou, Lidan [1 ]
机构
[1] Zhejiang Univ, Dept Comp Sci, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Tech Inst Econ, Dept Digital Informat Technol, Hangzhou, Zhejiang, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Graph processing; Pregel; Parallelism model; Vertex-centric model; Synchronization model; FRAMEWORK;
D O I
10.1007/s10619-017-7195-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Popular distributed graph processing frameworks, such as Pregel and GraphLab, are based on the vertex-centric computation model, where users write their customized Compute function for each vertex to process the data iteratively. Vertices are evenly partitioned among the compute nodes, and the granularity of parallelism of the graph algorithm is normally tuned by adjusting the number of compute nodes. Vertex-centric model splits the computation into phases. Inside one specific phase, the computation proceeds as an embarrassingly parallel process, because no communication between compute nodes incurs. By default, current graph engine only handles one iteration of the algorithm in a phase. However, in this paper, we find that we can also tune the granularity of parallelism, by aggregating the computation of multiple iterations into one phase, which has a significant impact on the performance of the graph algorithm. In the ideal case, if all computations are handled in one phase, the whole algorithm turns into an embarrassingly parallel algorithm and the benefit of parallelism is maximized. Based on this observation, we propose two approaches, a function-based approach and a parameter-based approach, to automatically transform a Pregel algorithm into a new one with tunable granularity of parallelism. We study the cost of such transformation and the trade-off between the granularity of parallelism and the performance. We provide a new direction to tune the performance of parallel algorithms. Finally, the approaches are implemented in our graph processing system, N2, and we illustrate their performance using popular graph algorithms.
引用
收藏
页码:117 / 148
页数:32
相关论文
共 50 条
  • [41] Granularity Analysis for Exploiting Adaptive Parallelism of Declarative Programs on Multiprocessors
    田新民
    王鼎兴
    沈美明
    郑纬民
    温冬婵
    [J]. Journal of Computer Science & Technology, 1994, (02) : 144 - 152
  • [42] Parallelism and granularity in time dependent approaches to reactive scattering calculations
    Piermarini, V
    Laganà, A
    Balint-Kurti, GG
    Smith, L
    Allan, RJ
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 2879 - 2884
  • [43] A methodology for granularity-based control of parallelism in logic programs
    Lopez, P
    Hermenegildo, M
    Debray, S
    [J]. JOURNAL OF SYMBOLIC COMPUTATION, 1996, 21 (4-6) : 715 - 734
  • [44] Automatic Performance Tuning for Distributed Data Stream Processing Systems
    Herodotou, Herodotos
    Odysseos, Lambros
    Chen, Yuxing
    Lu, Jiaheng
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 3194 - 3197
  • [45] MEASURE OF PARALLELISM OF DISTRIBUTED COMPUTATIONS
    CHARRONBOST, B
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1989, 349 : 434 - 445
  • [46] Graph Active Learning at Subgraph Granularity
    Cao, Yunqi
    Wang, Ziming
    Chen, Haopeng
    [J]. 2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 578 - 585
  • [47] A DISTRIBUTED PROLOG SYSTEM WITH AND PARALLELISM
    CARLTON, M
    VANROY, P
    [J]. IEEE SOFTWARE, 1988, 5 (01) : 43 - 51
  • [48] A Distributed Multi-GPU System for Fast Graph Processing
    Jia, Zhihao
    Kwon, Yongkee
    Shipman, Galen
    McCormick, Pat
    Erez, Mattan
    Aiken, Alex
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 11 (03): : 297 - 310
  • [49] LightGraph: Lighten Communication in Distributed Graph-Parallel Processing
    Zhao, Yue
    Yoshigoe, Kenji
    Xie, Mengjun
    Zhou, Suijian
    Seker, Remzi
    Bian, Jiang
    [J]. 2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 717 - 724
  • [50] Study of attack graph construction based on distributed parallel processing
    [J]. Ma, J.-C., 1600, China Ordnance Society (33):