DRONE: An Efficient Distributed Subgraph-Centric Framework for Processing Large-Scale Power-law Graphs

被引:2
|
作者
Zhang, Shuai [1 ,2 ]
Jiang, Zite [1 ,2 ,3 ]
Hou, Xingzhong [1 ,2 ]
Li, Mingyu [1 ]
Yuan, Mengting [4 ]
You, Haihang [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, State Key Lab Processors, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China
[3] Zhongguancun Lab, Beijing 102206, Peoples R China
[4] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
关键词
Fault tolerance; graph partition; large-scale power-law graph; parallel graph computation; subgraph-centric model; VERTEX;
D O I
10.1109/TPDS.2022.3223068
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, the ever-increasing volume of graph-structured data such as social networks, graph databases and knowledge graphs requires to be processed efficiently and scalably. These natural graphs commonly found in the real world have highly skewed power-law degree distribution and are called power-law graphs. The subgraph-centric programming model is a promising approach applied in many state-of-the-art distributed graph computing frameworks. However, the performance of subgraph-centric frameworks is limited when processing large-scale power-law graphs. When deployed to the subgraph-centric framework, existing graph partitioning algorithms are not suitable for power-law graphs. In this paper, we present a novel distributed graph computing framework, DRONE (Distributed gRaph cOmputiNg Engine), which leverages the subgraph-centric model and the vertex-cut graph partitioning strategy. DRONE also supports the fault tolerance mechanism to accommodate the increasing scale of machines with negligible overhead (6.48% on average). We further study the execution workflow of DRONE and propose an efficient and balanced graph partition algorithm (EBV) for DRONE. Experiments show that DRONE reduces the running time on real-world graphs by 25.6%, on average, compared to the state-of-the-art distributed graph computing frameworks. In addition, the EBV graph partition algorithm reduces the replication factor by at least 21.8% than other self-based partition algorithms. Our results indicate that DRONE has excellent potential in processing large-scale power-law graphs.
引用
收藏
页码:463 / 474
页数:12
相关论文
共 37 条
  • [1] An Efficient and Balanced Graph Partition Algorithm for the Subgraph-Centric Programming Model on Large-scale Power-law Graphs
    Zhang, Shuai
    Jiang, Zite
    Hou, Xingzhong
    Guan, Zhen
    Yuan, Mengting
    You, Haihang
    2021 IEEE 41ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2021), 2021, : 68 - 78
  • [2] NEPG: Partitioning Large-Scale Power-Law Graphs
    Si, Jiaqi
    Gan, Xinbiao
    Bai, Hao
    Dong, Dezun
    Pang, Zhengbin
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT III, 2022, 13157 : 668 - 690
  • [3] An Efficient Subgraph-Inferring Framework for Large-Scale Heterogeneous Graphs
    Zhou, Wei
    Huang, Hong
    Shi, Ruize
    Yin, Kehan
    Jin, Hai
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 9431 - 9439
  • [4] A High-Level Framework for Distributed Processing of Large-Scale Graphs
    Krepska, Elzbieta
    Kielmann, Thilo
    Fokkink, Wan
    Bal, Henri
    DISTRIBUTED COMPUTING AND NETWORKING, 2011, 6522 : 155 - 166
  • [5] An efficient pruning method for subgraph matching in large-scale graphs
    Moayed, Hojjat
    Mansoori, Eghbal G.
    Moosavi, Mohammad R.
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (10): : 10511 - 10532
  • [6] An efficient pruning method for subgraph matching in large-scale graphs
    Hojjat Moayed
    Eghbal G. Mansoori
    Mohammad R. Moosavi
    The Journal of Supercomputing, 2023, 79 : 10511 - 10532
  • [7] POWER-LAW SPECTRA AND THE LARGE-SCALE PATTERN OF THE CMB
    SCARAMELLA, R
    VITTORIO, N
    ASTRONOMY, COSMOLOGY AND FUNDAMENTAL PHYSICS, 1989, 155 : 466 - 467
  • [8] iGiraph: A Cost-efficient Framework for Processing Large-scale Graphs on Public Clouds
    Heidari, Safiollah
    Calheiros, Rodrigo N.
    Buyya, Rajkumar
    2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2016, : 301 - 310
  • [9] Re-ranking Approach to Classification in Large-scale Power-law Distributed Category Systems
    Babbar, Rohit
    Partalas, Ioannis
    Gaussier, Eric
    Amini, Massih-Reza
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1059 - 1062
  • [10] Large-scale frictionless jamming with power-law particle size distributions
    Monti, Joseph M.
    Clemmer, Joel T.
    Srivastava, Ishan
    Silbert, Leonardo E.
    Grest, Gary S.
    Lechman, Jeremy B.
    PHYSICAL REVIEW E, 2022, 106 (03)