Hybrid Pulling/Pushing for I/O-Efficient Distributed and Iterative Graph Computing

被引:23
|
作者
Wang, Zhigang [1 ]
Gu, Yu [1 ]
Bao, Yubin [1 ]
Yu, Ge [1 ]
Yu, Jeffrey Xu [2 ]
机构
[1] Northeastern Univ, Shenyang, Liaoning, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
I/O-Efficient; Distributed Graph Computing; Push; Pull; FRAMEWORK;
D O I
10.1145/2882903.2882938
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Billion-node graphs are rapidly growing in size in many applications such as online social networks. Most graph algorithms generate a large number of messages during iterative computations. Vertex-centric distributed systems usually store graph data and message data on disk to improve scalability. Currently, these distributed systems with disk-resident data take a push-based approach to handle messages. This works well if few messages reside on disk. Otherwise, it is I/O-inefficient due to expensive random writes. By contrast, the existing memory-resident pull-based approach individually pulls messages for each vertex on demand. Although it can be used to avoid disk operations regarding messages, expensive I/O costs are incurred by random and frequent access to vertices. This paper proposes a hybrid solution to support switching between push and pull adaptively, to obtain optimal performance for distributed systems with disk-resident data in different scenarios. We first employ a new block-centric technique (b-pull) to improve the I/O-performance of pulling messages, although the iterative computation is vertex-centric. I/O costs of data accesses are shifted from the receiver side where messages are written/read by push to the sender side where graph data are read by b-pull. Graph data are organized by clustering vertices and edges to achieve high I/O efficiency in b-pull. Second, we design a seamless switching mechanism and a prominent performance prediction method to guarantee efficiency when switching between push and b-pull. We conduct extensive performance studies to confirm the effectiveness of our proposals over existing up-to-date solutions using a broad spectrum of real-world graphs.
引用
收藏
页码:479 / 494
页数:16
相关论文
共 50 条
  • [41] I/O-Efficient Algorithms for Degeneracy Computation on Massive Networks
    Li, Ronghua
    Song, Qiushuo
    Xiao, Xiaokui
    Qin, Lu
    Wang, Guoren
    Yu, Jeffrey
    Mao, Rui
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (07) : 3335 - 3348
  • [42] I/O-Efficient Algorithms for Topological Sort and Related Problems
    Cao, Nairen
    Fineman, Jeremy T.
    Russell, Katina
    Yang, Eugene
    [J]. ACM TRANSACTIONS ON ALGORITHMS, 2022, 18 (01)
  • [43] Worst-Case I/O-Efficient Skyline Algorithms
    Sheng, Cheng
    Tao, Yufei
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2012, 37 (04):
  • [44] I/O-Efficient Path Traversal in Succinct Planar Graphs
    Craig Dillabaugh
    Meng He
    Anil Maheshwari
    Norbert Zeh
    [J]. Algorithmica, 2017, 77 : 714 - 755
  • [45] Implementing I/O-efficient data structures using TPIE
    Arge, L
    Procopiuc, O
    Vitter, JS
    [J]. ALGORITHMS-ESA 2002, PROCEEDINGS, 2002, 2461 : 88 - 100
  • [46] I/O-efficient algorithms on near-planar graphs
    Haverkort, H
    Toma, L
    [J]. LATIN 2006: THEORETICAL INFORMATICS, 2006, 3887 : 580 - 591
  • [47] GraphWalker: An I/O-Efficient and Resource-Friendly Graph Analytic System for Fast and Scalable Random Walks
    Wang, Rui
    Li, Yongkun
    Xie, Hong
    Xu, Yinlong
    Lui, John C. S.
    [J]. PROCEEDINGS OF THE 2020 USENIX ANNUAL TECHNICAL CONFERENCE, 2020, : 559 - 571
  • [48] I/O-Efficient Dictionary Search with One Edit Error
    Chung, Chin-Wan
    Tao, Yufei
    Wang, Wei
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2014, 2014, 8799 : 191 - 202
  • [49] I/O-efficient computation of water flow across a terrain
    MADALGO, University of Aarhus, Aarhus, Denmark
    不详
    [J]. Proc Annu Symp Comput Geom, (403-412):
  • [50] An I/O-Efficient Disk-based Graph System for Scalable Second-Order RandomWalk of Large Graphs
    Li, Hongzheng
    Shao, Yingxia
    Du, Junping
    Cui, Bin
    Chen, Lei
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (08): : 1619 - 1631