MultiLogVC: Efficient Out-of-Core Graph Processing Framework for Flash Storage

被引:3
|
作者
Matam, Kiran Kumar [1 ]
Hashemi, Hanieh [2 ]
Annavaram, Murali [2 ]
机构
[1] Facebook Inc, Menlo Pk, CA 94025 USA
[2] Univ Southern Calif, Elect Engn Dept, Los Angeles, CA 90007 USA
关键词
out-of-core graph processing; graph analytics; SSD storage systems; log storage;
D O I
10.1109/IPDPS49936.2021.00033
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph analytics are at the heart of a broad range of applications such as drug discovery, page ranking, transportation systems, and recommendation models. When graph size exceeds the available memory size in a computing node, out-of-core graph processing is needed. For the widely used out-of-core graph processing systems. the graphs are stored and accessed from a long latency SSD storage, which becomes a significant performance bottleneck. To tackle this long latency this work exploits the key insight that that nearly all graph algorithms have a dynamically varying number of active vertices that must be processed in each iteration. However, existing graph processing frameworks, such as GraphChi, load the entire graph in each iteration even if a small fraction of the graph is active. This limitation is due to the structure of the graph storage used by these systems. In this work, we propose to use a compressed sparse row (CSR) based graph storage that is more amenable for selectively loading only a few active vertices in each iteration. However, CSR based graph processing suffers from random update propagation to many target vertices. To solve this challenge, we propose to use a multi-log update mechanism that logs updates separately, rather than directly update the active edges and vertices in a graph. The multi-log system maintains a separate log per each vertex interval (a group of vertices). This separation enables efficient processing of all updates bound to each vertex interval by just loading the corresponding log. Further, by logging all the updates associated with a vertex interval in one contiguous log this approach reduces read amplification since all the pages in the log will be processed in the next iteration without wasted page reads. Over the current state of the art out-of-core graph processing framework, our evaluation results show that the MultiLogVC framework improves performance by up to 17.84x, 1.19x, 1.65x, 1.38x, 3.15x, and 6.00x for the widely used breadth-first search, pagerank, community detection, graph coloring, maximal independent set, and random-walk applications, respectively.
引用
收藏
页码:245 / 255
页数:11
相关论文
共 50 条
  • [21] CLIP: A Disk I/O Focused Parallel Out-of-Core Graph Processing System
    Ai, Zhiyuan
    Zhang, Mingxing
    Wu, Yongwei
    Qian, Xuehai
    Chen, Kang
    Zheng, Weimin
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (01) : 45 - 62
  • [22] LOSC: A locality-optimized subgraph construction scheme for out-of-core graph processing
    Xu, Xianghao
    Wang, Fang
    Jiang, Hong
    Cheng, Yongli
    Hua, Yu
    Feng, Dan
    Zhang, Yongxuan
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 172 : 51 - 68
  • [23] An efficient algorithm for out-of-core matrix transposition
    Suh, J
    Prasanna, VK
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2002, 51 (04) : 420 - 438
  • [24] A parallel out-of-core multifrontal method: Storage of factors on disk and analysis of models for an out-of-core active memory
    Agullo, Emmanuel
    Guermouche, Abdou
    L'Excellent, Jean-Yves
    [J]. PARALLEL COMPUTING, 2008, 34 (6-8) : 296 - 317
  • [25] Efficient parallel out-of-core isosurface extraction
    Zhang, HJ
    Newman, TS
    [J]. PVG 2003 PROCEEDINGS, 2003, : 9 - 16
  • [26] An Algebraic Framework for Out-of-Core Hierarchical Segmentation Algorithms
    Cousty, Jean
    Perret, Benjamin
    Phelippeau, Harold
    Carneiro, Stela
    Kamlay, Pierre
    Buzer, Lilian
    [J]. DISCRETE GEOMETRY AND MATHEMATICAL MORPHOLOGY, DGMM 2021, 2021, 12708 : 378 - 390
  • [27] A Framework to Transform In-Core GPU Algorithms to Out-of-Core Algorithms
    Harada, Takahiro
    [J]. PROCEEDINGS I3D 2016: 20TH ACM SIGGRAPH SYMPOSIUM ON INTERACTIVE 3D GRAPHICS AND GAMES, 2016, : 179 - 180
  • [28] Squeezing out All the Value of Loaded Data: An Out-of-core Graph Processing System with Reduced Disk I/O
    Ai, Zhiyuan
    Zhang, Mingxing
    Wu, Yongwei
    Qian, Xuehai
    Chen, Kang
    Zheng, Weimin
    [J]. 2017 USENIX ANNUAL TECHNICAL CONFERENCE (USENIX ATC '17), 2017, : 125 - 137
  • [29] Design of an Efficient Out-of-Core Read Alignment Algorithm
    Konagurthu, Arun S.
    Allison, Lloyd
    Conway, Thomas
    Beresford-Smith, Bryan
    Zobel, Justin
    [J]. ALGORITHMS IN BIOINFORMATICS, 2010, 6293 : 189 - 201
  • [30] Efficient view-dependent out-of-core visualization
    Guthe, M
    Borodin, P
    Klein, R
    [J]. FOURTH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND ITS APPLICATIONS IN INDUSTRY, 2004, 5444 : 428 - 438