Distributed Graph Clustering and Sparsification

被引:14
|
作者
Sun, He [1 ]
Zanetti, Luca [2 ]
机构
[1] Univ Edinburgh, Sch Informat, 10 Crichton St, Edinburgh EH8 9AB, Midlothian, Scotland
[2] Univ Cambridge, William Gates Bldg, Cambridge CB3 0FD, Scotland
关键词
Graph clustering; graph sparsification; distributed computing;
D O I
10.1145/3364208
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Graph clustering is a fundamental computational problem with a number of applications in algorithm design, machine learning, data mining, and analysis of social networks. Over the past decades, researchers have proposed a number of algorithmic design methods for graph clustering. Most of these methods, however, are based on complicated spectral techniques or convex optimisation and cannot be directly applied for clustering many networks that occur in practice, whose information is often collected on different sites. Designing a simple and distributed clustering algorithm is of great interest and has comprehensive applications for processing big datasets. In this article, we present a simple and distributed algorithm for graph clustering: For a wide class of graphs that are characterised by a strong cluster-structure, our algorithm finishes in a poly-logarithmic number of rounds and recovers a partition of the graph close to optimal. One of the main procedures behind our algorithm is a sampling scheme that, given a dense graph as input, produces a sparse subgraph that provably preserves the cluster-structure of the input. Compared with previous sparsification algorithms that require Laplacian solvers or involve combinatorial constructions, this procedure is easy to implement in a distributed setting and runs fast in practice.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Simple Parallel and Distributed Algorithms for Spectral Graph Sparsification
    Koutis, Ioannis
    [J]. PROCEEDINGS OF THE 26TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA'14), 2014, : 61 - 66
  • [2] DISTRIBUTED LINK SPARSIFICATION FOR SCALABLE SCHEDULING USING GRAPH NEURAL NETWORKS
    Zhao, Zhongyuan
    Swami, Ananthram
    Segarra, Santiago
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 5308 - 5312
  • [3] Uncertain Graph Sparsification
    Parchas, Panos
    Papailiou, Nikolaos
    Papadias, Dimitris
    Bonchi, Francesco
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 2141 - 2142
  • [4] Uncertain Graph Sparsification
    Parchas, Panos
    Papailiou, Nikolaos
    Papadias, Dimitris
    Bonchi, Francesco
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (12) : 2435 - 2449
  • [5] Graph sparsification with graph convolutional networks
    Jiayu Li
    Tianyun Zhang
    Hao Tian
    Shengmin Jin
    Makan Fardad
    Reza Zafarani
    [J]. International Journal of Data Science and Analytics, 2022, 13 : 33 - 46
  • [6] Graph sparsification with graph convolutional networks
    Li, Jiayu
    Zhang, Tianyun
    Tian, Hao
    Jin, Shengmin
    Fardad, Makan
    Zafarani, Reza
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2022, 13 (01) : 33 - 46
  • [7] Distributed structural clustering on large graph
    Rong, Chuitian
    Zhou, Jinyu
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (22):
  • [8] Distributed Graph Clustering by Load Balancing
    Sun, He
    Zanetti, Luca
    [J]. PROCEEDINGS OF THE 29TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA'17), 2017, : 163 - 171
  • [9] Scaling Graph Clustering with Distributed Sketches
    Priest, Benjamin W.
    Dunton, Alec
    Sanders, Geoffrey
    [J]. 2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [10] A GENERAL FRAMEWORK FOR GRAPH SPARSIFICATION
    Fung, Wai-Shing
    Hariharan, Ramesh
    Harvey, Nicholas J. A.
    Panigrahi, Debmalya
    [J]. SIAM JOURNAL ON COMPUTING, 2019, 48 (04) : 1196 - 1223