Graph Partitioning for Distributed Graph Processing

被引:24
|
作者
Onizuka M. [1 ]
Fujimori T. [1 ]
Shiokawa H. [2 ]
机构
[1] Graduate School of Information Science and Technology, Osaka University, 1-5, Yamadaoka, Suita, 565-0871, Osaka
[2] Center of Computational Sciences, University of Tsukuba, 1-1-1, Tennoudai, Tsukuba, Ibaraki
关键词
Distributed processing; Graph mining; Graph partitioning;
D O I
10.1007/s41019-017-0034-4
中图分类号
学科分类号
摘要
There is a large demand for distributed engines that efficiently process large-scale graph data, such as social graph and web graph. The distributed graph engines execute analysis process after partitioning input graph data and assign them to distributed computers, so the quality of graph partitioning largely affects the communication cost and load balance among computers during the analysis process. We propose an effective graph partitioning technique that achieves low communication cost and good load balance among computers at the same time. We first generate more clusters than the number of computers by extending the modularity-based clustering, and then merge those clusters into balanced-size clusters until the number of clusters becomes the number of computers by using techniques designed for graph packing problem. We implemented our technique on top of distributed graph engine, PowerGraph, and made intensive experiments. The results show that our partitioning technique reduces the communication cost so it improves the response time of graph analysis patterns. In particular, PageRank computation is 3.2 times faster at most than HDRF, the state-of-the art of streaming-based partitioning approach. © 2017, The Author(s).
引用
收藏
页码:94 / 105
页数:11
相关论文
共 50 条
  • [31] Cost-Aware Partitioning for Efficient Large Graph Processing in Geo-Distributed Datacenters
    Zhou, Amelie Chi
    Shen, Bingkun
    Xiao, Yao
    Ibrahim, Shadi
    He, Bingsheng
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (07) : 1707 - 1723
  • [32] TopoX: Topology Refactorization for Efficient Graph Partitioning and Processing
    Li, Dongsheng
    Zhang, Yiming
    Wang, Jinyan
    Tan, Kian-Lee
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (08): : 891 - 905
  • [33] On graph partitioning, spectral analysis, and digital mesh processing
    Gotsman, C
    [J]. SMI 2003: SHAPE MODELING INTERNATIONAL 2003, PROCEEDINGS, 2003, : 165 - 171
  • [34] JET: MULTILEVEL GRAPH PARTITIONING ON GRAPHICS PROCESSING UNITS
    Gilbert, Michael S.
    Madduri, Kamesh
    Boman, Erik G.
    Rajamanickam, Siva
    [J]. SIAM Journal on Scientific Computing, 2024, 46 (05):
  • [35] OPTIMIZED QUANTIZATION IN DISTRIBUTED GRAPH SIGNAL PROCESSING
    Nobre, Isabela Cunha Maia
    Frossard, Pascal
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5376 - 5380
  • [36] Tuning the granularity of parallelism for distributed graph processing
    Luo, Xinyuan
    Wu, Sai
    Wang, Wei
    Shou, Lidan
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2017, 35 (02) : 117 - 148
  • [37] Tuning the granularity of parallelism for distributed graph processing
    Xinyuan Luo
    Sai Wu
    Wei Wang
    Lidan Shou
    [J]. Distributed and Parallel Databases, 2017, 35 : 117 - 148
  • [38] Scalable Concurrency Debugging with Distributed Graph Processing
    Zheng, Long
    Liao, Xiaofei
    Jin, Hai
    Zhao, Jieshan
    Wang, Qinggang
    [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO'18), 2018, : 188 - 199
  • [39] Large Scale Graph Processing in a Distributed Environment
    Upadhyay, Nitesh
    Patel, Parita
    Cheramangalath, Unnikrishnan
    Srikant, Y. N.
    [J]. EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 465 - 477
  • [40] Distributed Signal Processing with Graph Spectral Dictionaries
    Thanou, Dorina
    Frossard, Pascal
    [J]. 2015 53RD ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2015, : 1391 - 1398