A Distributed Algorithm for Large-Scale Graph Partitioning

被引:25
|
作者
Rahimian, Fatemeh [1 ,2 ]
Payberah, Amir H. [2 ]
Girdzijauskas, Sarunas [1 ]
Jelasity, Mark [3 ,4 ]
Haridi, Seif [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
[2] SICS Swedish ICT, SE-16429 Kista, Sweden
[3] Hungarian Acad Sci, MTA SZTE Res Grp AI, H-6701 Szeged, Hungary
[4] Univ Szeged, H-6701 Szeged, Hungary
关键词
Design; Algorithms; Performance; graph partitioning; edge-cut partitioning; vertex-cut partitioning; distributed algorithm; load balancing; simulated annealing; SCHEME;
D O I
10.1145/2714568
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Balanced graph partitioning is an NP-complete problem with a wide range of applications. These applications include many large-scale distributed problems, including the optimal storage of large sets of graph-structured data over several hosts. However, in very large-scale distributed scenarios, state-of-the-art algorithms are not directly applicable because they typically involve frequent global operations over the entire graph. In this article, we propose a fully distributed algorithm called JA-BE-JA that uses local search and simulated annealing techniques for two types of graph partitioning: edge-cut partitioning and vertex-cut partitioning. The algorithm is massively parallel: There is no central coordination, each vertex is processed independently, and only the direct neighbors of a vertex and a small subset of random vertices in the graph need to be known locally. Strict synchronization is not required. These features allow JA-BE-JA to be easily adapted to any distributed graph-processing system from data centers to fully distributed networks. We show that the minimal edge-cut value empirically achieved by JA-BE-JA is comparable to state-of-the-art centralized algorithms such as METIS. In particular, on large social networks, JA-BE-JA outperforms METIS. We also show that JA-BE-JA computes very low vertex-cuts, which are proved significantly more effective than edge-cuts for processing most real-world graphs.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Critical Path Awareness Techniques for Large-Scale Graph Partitioning
    Wu, Baixuan
    Xiao, Zheng
    Lin, Peiying
    Tang, Zhuo
    Li, Kenli
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2023, 8 (03): : 412 - 422
  • [22] Automatic Graph Partitioning for Very Large-scale Deep Learning
    Tanaka, Masahiro
    Taura, Kenjiro
    Hanawa, Toshihiro
    Torisawa, Kentaro
    [J]. 2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 1004 - 1013
  • [23] 3-D Partitioning for Large-Scale Graph Processing
    Li, Xue
    Zhang, Mingxing
    Chen, Kang
    Wu, Yongwei
    Qian, Xuehai
    Zheng, Weiming
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (01) : 111 - 127
  • [24] ON NETWORK PARTITIONING ALGORITHM OF LARGE-SCALE CMOS CIRCUITS
    DENG, AC
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (02): : 294 - 299
  • [25] A Distributed Algorithm for Large-Scale Generalized Matching
    Manshadi, Faraz Makari
    Awerbuch, Baruch
    Gemulla, Rainer
    Khandekar, Rohit
    Mestre, Julian
    Sozio, Mauro
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (09): : 613 - 624
  • [26] Partitioning for Large-scale Systems: A Sequential Distributed MPC Design
    Barreiro-Gomez, J.
    Ocampo-Martinez, C.
    Quijano, N.
    [J]. IFAC PAPERSONLINE, 2017, 50 (01): : 8838 - 8843
  • [27] TSH: Easy-to-be distributed partitioning for large-scale graphs
    Wang, Ning
    Wang, Zhigang
    Gu, Yu
    Bao, Yubin
    Yu, Ge
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 101 : 804 - 818
  • [28] A Stream Partitioning Approach to Processing Large Scale Distributed Graph Datasets
    Wang, Rui
    Chiu, Kenneth
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [29] A Distributed Graph Algorithm for Discovering Unique Behavioral Groups from Large-Scale Telco Data
    Ho, Qirong
    Lin, Wenqing
    Shaham, Eran
    Krishnaswamy, Shonali
    The Anh Dang
    Wang, Jingxuan
    Zhongyan, Isabel Choo
    Shi-Nash, Amy
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1353 - 1362
  • [30] Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data
    Bae, Minho
    Jeong, Minjoong
    Oh, Sangyoon
    [J]. IEEE ACCESS, 2020, 8 : 72801 - 72813