Tuning the granularity of parallelism for distributed graph processing

被引:0
|
作者
Luo, Xinyuan [1 ]
Wu, Sai [1 ]
Wang, Wei [2 ]
Shou, Lidan [1 ]
机构
[1] Zhejiang Univ, Dept Comp Sci, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Tech Inst Econ, Dept Digital Informat Technol, Hangzhou, Zhejiang, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Graph processing; Pregel; Parallelism model; Vertex-centric model; Synchronization model; FRAMEWORK;
D O I
10.1007/s10619-017-7195-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Popular distributed graph processing frameworks, such as Pregel and GraphLab, are based on the vertex-centric computation model, where users write their customized Compute function for each vertex to process the data iteratively. Vertices are evenly partitioned among the compute nodes, and the granularity of parallelism of the graph algorithm is normally tuned by adjusting the number of compute nodes. Vertex-centric model splits the computation into phases. Inside one specific phase, the computation proceeds as an embarrassingly parallel process, because no communication between compute nodes incurs. By default, current graph engine only handles one iteration of the algorithm in a phase. However, in this paper, we find that we can also tune the granularity of parallelism, by aggregating the computation of multiple iterations into one phase, which has a significant impact on the performance of the graph algorithm. In the ideal case, if all computations are handled in one phase, the whole algorithm turns into an embarrassingly parallel algorithm and the benefit of parallelism is maximized. Based on this observation, we propose two approaches, a function-based approach and a parameter-based approach, to automatically transform a Pregel algorithm into a new one with tunable granularity of parallelism. We study the cost of such transformation and the trade-off between the granularity of parallelism and the performance. We provide a new direction to tune the performance of parallel algorithms. Finally, the approaches are implemented in our graph processing system, N2, and we illustrate their performance using popular graph algorithms.
引用
收藏
页码:117 / 148
页数:32
相关论文
共 50 条
  • [1] Tuning the granularity of parallelism for distributed graph processing
    Xinyuan Luo
    Sai Wu
    Wei Wang
    Lidan Shou
    [J]. Distributed and Parallel Databases, 2017, 35 : 117 - 148
  • [2] DISTRIBUTED PARALLELISM OF GRAPH-TRANSFORMATIONS
    EHRIG, H
    BOEHM, P
    HUMMERT, U
    LOWE, M
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1988, 314 : 1 - 19
  • [3] Graph Partitioning for Distributed Graph Processing
    Onizuka M.
    Fujimori T.
    Shiokawa H.
    [J]. Data Science and Engineering, 2017, 2 (1) : 94 - 105
  • [4] REAL-TIME SAR-DATA PROCESSING - PARALLELISM AND GRANULARITY ILLUSIONS AND POSSIBILITIES
    DILECCE, V
    MARINO, D
    MASTRONARDI, G
    [J]. EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 1991, 2 (06): : 643 - 648
  • [5] Multilevel Granularity Parallelism Synthesis on FPGAs
    Papakonstantinou, Alexandros
    Liang, Yun
    Stratton, John A.
    Gururaj, Karthik
    Chen, Deming
    Hwu, Wen-Mei W.
    Cong, Jason
    [J]. 2011 IEEE 19TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2011, : 178 - 185
  • [6] Performance Tuning in Distributed Processing of ETL
    Yang, Ping
    Liu, Zaiying
    Ni, Jun
    [J]. 2013 SEVENTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR ENGINEERING AND SCIENCE (ICICSE 2013), 2013, : 85 - 88
  • [7] Data Replication for Distributed Graph Processing
    Ho, Li-Yung
    Wu, Jan-Jan
    Liu, Pangfeng
    [J]. 2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 319 - 326
  • [8] An Elasticity Study of Distributed Graph Processing
    Au, Sietse
    Uta, Alexandru
    Ilyushkin, Alexey
    Iosup, Alexandru
    [J]. 2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 382 - 383
  • [9] AsynGraph: Maximizing Data Parallelism for Efficient Iterative Graph Processing on GPUs
    Zhang, Yu
    Liao, Xiaofei
    Gu, Lin
    Jin, Hai
    Hu, Kan
    Liu, Haikun
    He, Bingsheng
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2020, 17 (04)
  • [10] Online Multimedia Similarity Search with Response Time-Aware Parallelism and Task Granularity Auto-Tuning
    Andrade, Guilherme
    Teodoro, George
    Ferreira, Renato
    [J]. 2017 29TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 2017, : 153 - 160