LOSC: Efficient Out-of-Core Graph Processing with Locality-optimized Subgraph Construction

被引:0
|
作者
Xu, Xianghao [1 ]
Wang, Fang [1 ,4 ]
Jiang, Hong [2 ]
Cheng, Yongli [1 ,3 ]
Hua, Yu [1 ]
Feng, Dan [1 ]
Zhang, Yongxuan [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan, Hubei, Peoples R China
[2] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[3] FuZhou Univ, Coll Math & Comp Sci, Fuzhou, Fujian, Peoples R China
[4] Shenzhen Huazhong Univ Sci & Technol, Res Inst, Shenzhen, Peoples R China
基金
国家重点研发计划;
关键词
graph computing; out-of-core; subgraph construction; FRAMEWORK;
D O I
10.1145/3326285.3329069
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Big data applications increasingly rely on the analysis of large graphs. In recent years, a number of out-of-core graph processing systems have been proposed to process graphs with billions of edges on just one commodity computer, by efficiently using the secondary storage (e.g., hard disk, SSD). On the other hand, the vertex-centric computing model is extensively used in graph processing thanks to its good applicability and expressiveness. Unfortunately, when implementing vertex-centric model for out-of-core graph processing, the large number of random memory accesses required to construct subgraphs lead to a serious performance bottleneck that substantially weakens cache access locality and thus leads to very long waiting time experienced by users for the computing results. In this paper, we propose an efficient out-of-core graph processing system, LOSC, to substantially reduce the overhead of subgraph construction without sacrificing the underlying vertex-centric computing model. LOSC proposes a locality-optimized subgraph construction scheme that significantly improves the in-memory data access locality of the subgraph construction phase. Furthermore, LOSC adopts a compact edge storage format and a lightweight replication of vertices to reduce I/O traffic and improve computation efficiency. Extensive evaluation results show that LOSC is respectively 6.9x and 3.5x faster than GraphChi and GridGraph, two state-of-the-art out-of-core systems.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Efficient methods for out-of-core sparse Cholesky factorization
    Rothberg, E
    Schreiber, R
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1999, 21 (01): : 129 - 144
  • [32] Out-of-core GPU Memory Management for MapReduce-based Large-scale Graph Processing
    Shirahata, Koichi
    Sato, Hitoshi
    Matsuoka, Satoshi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2014, : 221 - 229
  • [33] Victream: Computing Framework for Out-of-Core Processing on Multiple GPUs
    Suzuki, Jun
    Hayashi, Yuki
    Kan, Masaki
    Miyakawa, Shinya
    Takenaka, Takashi
    Araki, Takuya
    Kitsuregawa, Masaru
    [J]. BDCAT'17: PROCEEDINGS OF THE FOURTH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, 2017, : 179 - 188
  • [34] A compiler driven out-of-core programming approach for optimizing data locality in loop nests
    Zhang, W
    Leiss, EL
    [J]. PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, 2001, : 41 - 47
  • [35] A Highly Efficient I/O-based Out-of-Core Stencil Algorithm with Globally Optimized Temporal Blocking
    Midorikawa, Hiroko
    Tan, Hideyuki
    [J]. 2017 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2017,
  • [36] Efficient Out-of-Core and Out-of-Place Rectangular Matrix Transposition and Rotation
    Godard, Paul
    Loechner, Vincent
    Bastoul, Cedric
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (11) : 1942 - 1948
  • [37] A Parallel Memory Efficient Framework for Out-of-Core Mesh simplification
    Lu Yongquan
    Li Nan
    Gao Pengdong
    Qiu Chu
    Wang Jintao
    Lv Rui
    [J]. HPCC: 2009 11TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2009, : 666 - 671
  • [38] Subgraph join: Efficient processing subgraph queries on graph-structured XML document
    Wang, HZ
    Wang, W
    Lin, XM
    Li, JZ
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 68 - 80
  • [39] Efficient out-of-core sorting algorithms for the Parallel Disks Model
    Kundeti, Vamsi
    Rajasekaran, Sanguthevar
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2011, 71 (11) : 1427 - 1433
  • [40] An efficient method for very large scale out-of-core terrain visualization
    Zhang, Huijie
    Sun, Jigui
    Yu, Haihong
    Qi, Changsong
    [J]. ICAT 2006: 16TH INTERNATIONAL CONFERENCE ON ARTIFICIAL REALITY AND TELEXISTENCE - WORSHOPS, PROCEEDINGS, 2006, : 36 - 41