A Distributed Algorithm for Large-Scale Graph Partitioning

被引:25
|
作者
Rahimian, Fatemeh [1 ,2 ]
Payberah, Amir H. [2 ]
Girdzijauskas, Sarunas [1 ]
Jelasity, Mark [3 ,4 ]
Haridi, Seif [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
[2] SICS Swedish ICT, SE-16429 Kista, Sweden
[3] Hungarian Acad Sci, MTA SZTE Res Grp AI, H-6701 Szeged, Hungary
[4] Univ Szeged, H-6701 Szeged, Hungary
关键词
Design; Algorithms; Performance; graph partitioning; edge-cut partitioning; vertex-cut partitioning; distributed algorithm; load balancing; simulated annealing; SCHEME;
D O I
10.1145/2714568
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Balanced graph partitioning is an NP-complete problem with a wide range of applications. These applications include many large-scale distributed problems, including the optimal storage of large sets of graph-structured data over several hosts. However, in very large-scale distributed scenarios, state-of-the-art algorithms are not directly applicable because they typically involve frequent global operations over the entire graph. In this article, we propose a fully distributed algorithm called JA-BE-JA that uses local search and simulated annealing techniques for two types of graph partitioning: edge-cut partitioning and vertex-cut partitioning. The algorithm is massively parallel: There is no central coordination, each vertex is processed independently, and only the direct neighbors of a vertex and a small subset of random vertices in the graph need to be known locally. Strict synchronization is not required. These features allow JA-BE-JA to be easily adapted to any distributed graph-processing system from data centers to fully distributed networks. We show that the minimal edge-cut value empirically achieved by JA-BE-JA is comparable to state-of-the-art centralized algorithms such as METIS. In particular, on large social networks, JA-BE-JA outperforms METIS. We also show that JA-BE-JA computes very low vertex-cuts, which are proved significantly more effective than edge-cuts for processing most real-world graphs.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] A distributed algorithm for operating large-scale ridesourcing systems
    Zhang, Ruolin
    Masoud, Neda
    [J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2021, 156
  • [42] A distributed clustering algorithm for large-scale dynamic networks
    Bernard, Thibault
    Bui, Alain
    Pilard, Laurence
    Sohier, Devan
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2012, 15 (04): : 335 - 350
  • [43] A distributed clustering algorithm for large-scale dynamic networks
    Thibault Bernard
    Alain Bui
    Laurence Pilard
    Devan Sohier
    [J]. Cluster Computing, 2012, 15 : 335 - 350
  • [44] Large-Scale Hyperspectral Image Restoration via a Superpixel Distributed Algorithm Based on Graph Signal Processing
    Cai, Wanyuan
    Jiang, Junzheng
    Qian, Jiang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [45] VSEP: A Distributed Algorithm for Graph Edge Partitioning
    Zhang, Yu
    Liu, Yanbing
    Yu, Jing
    Liu, Ping
    Guo, Li
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2015, 2015, 9532 : 71 - 84
  • [46] LKAQ: Large-scale knowledge graph approximate query algorithm
    Wan, Xiaolong
    Wang, Hongzhi
    Li, Jianzhong
    [J]. INFORMATION SCIENCES, 2019, 505 : 306 - 324
  • [47] A new algorithm for shortest path problem in large-scale graph
    Xiao, Li
    Chen, Lixue
    Xiao, Jingzhong
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2012, 6 (03): : 657 - 663
  • [48] A GRAPH-COLORING ALGORITHM FOR LARGE-SCALE SCHEDULING PROBLEMS
    LOTFI, V
    SARIN, S
    [J]. COMPUTERS & OPERATIONS RESEARCH, 1986, 13 (01) : 27 - 32
  • [49] Large-scale Entity Alignment via Knowledge Graph Merging, Partitioning and Embedding
    Xin, Kexuan
    Sun, Zequn
    Hua, Wen
    Hu, Wei
    Qu, Jianfeng
    Zhou, Xiaofang
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2240 - 2249
  • [50] Performance Characteristics of Graph500 on Large-Scale Distributed Environment
    Suzumura, Toyotaro
    Ueno, Koji
    Sato, Hitoshi
    Fujisawa, Katsuki
    Matsuoka, Satoshi
    [J]. 2011 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2011, : 149 - 158