Generating synthetic social graphs with Darwini

被引:2
|
作者
Edunov, Sergey [1 ]
Logothetis, Dionysios [1 ]
Ching, Avery [1 ]
Kabiljo, Maja [1 ]
Wang, Cheng [2 ]
机构
[1] Facebook, Menlo Pk, CA 94025 USA
[2] Univ Houston, Houston, TX 77004 USA
关键词
D O I
10.1109/ICDCS.2018.00062
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Synthetic graph generators facilitate research in graph algorithms and graph processing systems by providing access to graphs that resemble real social networks while addressing privacy and security concerns. Nevertheless, their practical value lies in their ability to capture important metrics of real graphs, such as degree distribution and clustering properties. Graph generators must also be able to produce such graphs at the scale of real-world industry graphs, that is, hundreds of billions or trillions of edges. In this paper, we propose Darwini, a graph generator that captures a number of core characteristics of real graphs. Importantly, given a source graph, it can reproduce the degree distribution and, unlike existing approaches, the local clustering coefficient distribution. Furthermore, Darwini maintains a number of metrics, such as graph assortativity, eigenvalues, and others. Comparing Darwini with state-of-the-art generative models, we show that it can reproduce these characteristics more accurately. Finally, we provide an open source implementation of Darwini on the vertex-centric Apache Giraph (TM) model that can generate synthetic graphs with up to 3 trillion edges.
引用
收藏
页码:567 / 577
页数:11
相关论文
共 50 条
  • [1] Generating Synthetic Graphs for Large Sensitive and Correlated Social Networks
    Ju, Xin
    Zhang, Xiaofeng
    Cheung, William K.
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 286 - 293
  • [2] Generating Synthetic Decentralized Social Graphs with Local Differential Privacy
    Qin, Zhan
    Yu, Ting
    Yang, Yin
    Khalil, Issa
    Xiao, Xiaokui
    Ren, Kui
    [J]. CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 425 - 438
  • [3] ProvGen: Generating Synthetic PROV Graphs with Predictable Structure
    Firth, Hugo
    Missier, Paolo
    [J]. PROVENANCE AND ANNOTATION OF DATA AND PROCESSES (IPAW 2014), 2015, 8628 : 16 - 27
  • [4] Generating synthetic task graphs for simulating stream computing systems
    Ajwani, Deepak
    Ali, Shoukat
    Katrinis, Kostas
    Li, Cheng-Hong
    Park, Alfred J.
    Morrison, John P.
    Schenfeld, Eugen
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (10) : 1362 - 1374
  • [5] Spectral Graph Forge: A Framework for Generating Synthetic Graphs With a Target Modularity
    Baldesi, Luca
    Markopoulou, Athina
    Butts, Carter T.
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2019, 27 (05) : 2125 - 2136
  • [6] Generating Simple Directed Social Network Graphs for Information Spreading
    Schweimer, Christoph
    Gfrerer, Christine
    Lugstein, Florian
    Pape, David
    Velimsky, Jan A.
    Elsasser, Robert
    Geiger, Bernhard C.
    [J]. PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1475 - 1485
  • [7] Generating trusted graphs for trust evaluation in online social networks
    Jiang, Wenjun
    Wang, Guojun
    Wu, Jie
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 31 : 48 - 58
  • [8] An automata algorithm for generating trusted graphs in online social networks
    Fatehi, Nina
    Shahhoseini, Hadi Shahriar
    Wei, Jesse
    Chang, Ching-Ter
    [J]. APPLIED SOFT COMPUTING, 2022, 118
  • [9] An End-to-End Network for Generating Social Relationship Graphs
    Goel, Arushi
    Ma, Keng Teck
    Tan, Cheston
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11178 - 11187
  • [10] A synthetic data generator for online social network graphs
    Nettleton, David F.
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2016, 6 (01)