An Efficient and Scalable Algorithmic Method for Generating Large-Scale Random Graphs

被引:0
|
作者
Alam, Maksudul [1 ,2 ]
Khan, Maleq [1 ]
Vullikanti, Anil [1 ,2 ]
Marathe, Madhav [1 ,2 ]
机构
[1] Virginia Tech, Biocomplex Inst, Network Dynam & Simulat Sci Lab, Blacksburg, VA 24061 USA
[2] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
关键词
network theory; random graphs; parallel programming; distributed computing;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many real-world systems and networks are modeled and analyzed using various random graph models. These models must incorporate relevant properties such as degree distribution and clustering coefficient. Many models, such as the Chung-Lu (CL), stochastic Kronecker, stochastic block model (SBM), and block two-level Erdos-Renyi (BTER) models have been devised to capture those properties. However, the generative algorithms for these models are mostly sequential and take prohibitively long time to generate large-scale graphs. In this paper, we present a novel time and space efficient algorithmic method to generate random graphs using CL, BTER, and SBM models. First, we present an efficient sequential algorithm and an efficient distributed-memory parallel algorithm for the CL model. Our sequential algorithm takes O(m) time and O(Lambda) space, where m and. are the number of edges and distinct degrees, and our parallel algorithm takes O (m/P + Lambda + P) time w.h.p. and O(Lambda) space using P processors. These algorithms are almost time optimal since any sequential and parallel algorithms need at least O(m) and O(m P) time, respectively. Our algorithms outperform the best known previous algorithms by a significant margin in terms of both time and space. Experimental results on various large-scale networks show that both of our sequential and parallel algorithms require 400-15000 times less memory than the existing sequential and parallel algorithms, respectively, making our algorithms suitable for generating very large-scale networks. Moreover, both of our algorithms are about 3-4 times faster than the existing sequential and parallel algorithms. Finally, we show how our algorithmic method also leads to efficient parallel and sequential algorithms for the SBM and BTER models.
引用
收藏
页码:372 / 383
页数:12
相关论文
共 50 条
  • [1] Algorithms for generating large-scale clustered random graphs
    Wang, Cheng
    Lizardo, Omar
    Hachen, David
    [J]. NETWORK SCIENCE, 2014, 2 (03) : 403 - 415
  • [2] Large-scale structures in random graphs
    Bottcher, Julia
    [J]. SURVEYS IN COMBINATORICS 2017, 2017, 440 : 87 - 140
  • [3] Generating Large-Scale Heterogeneous Graphs for Benchmarking
    Gupta, Amarnath
    [J]. SPECIFYING BIG DATA BENCHMARKS, 2014, 8163 : 113 - 128
  • [4] An efficient pruning method for subgraph matching in large-scale graphs
    Moayed, Hojjat
    Mansoori, Eghbal G.
    Moosavi, Mohammad R.
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (10): : 10511 - 10532
  • [5] An efficient pruning method for subgraph matching in large-scale graphs
    Hojjat Moayed
    Eghbal G. Mansoori
    Mohammad R. Moosavi
    [J]. The Journal of Supercomputing, 2023, 79 : 10511 - 10532
  • [6] Random Slicing: Efficient and Scalable Data Placement for Large-Scale Storage Systems
    Miranda, Alberto
    Effert, Sascha
    Kang, Yangwook
    Miller, Ethan L.
    Popov, Ivan
    Brinkmann, Andre
    Friedetzky, Tom
    Cortes, Toni
    [J]. ACM TRANSACTIONS ON STORAGE, 2014, 10 (03)
  • [7] Scalable Motif Counting for Large-scale Temporal Graphs
    Gao, Zhongqiang
    Cheng, Chuanqi
    Yu, Yanwei
    Cao, Lei
    Huang, Chao
    Dong, Junyu
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2656 - 2668
  • [8] Parallel generation of large-scale random graphs
    Vullikanti, Anil
    [J]. 2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 278 - 278
  • [9] Random walk with jumps in large-scale random geometric graphs
    Tzevelekas, Leonidas
    Oikonomou, Konstantinos
    Stavrakakis, Ioannis
    [J]. COMPUTER COMMUNICATIONS, 2010, 33 (13) : 1505 - 1514
  • [10] Efficient Machine Learning On Large-Scale Graphs
    Erickson, Parker
    Lee, Victor E.
    Shi, Feng
    Tang, Jiliang
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4788 - 4789