LOSC: A locality-optimized subgraph construction scheme for out-of-core graph processing

被引:0
|
作者
Xu, Xianghao [1 ,2 ]
Wang, Fang [2 ]
Jiang, Hong [3 ]
Cheng, Yongli [4 ,5 ]
Hua, Yu [2 ]
Feng, Dan [2 ]
Zhang, Yongxuan [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan, Peoples R China
[3] Univ Texas Arlington, Dept Comp Sci Engn, Arlington, TX USA
[4] FuZhou Univ, Coll Math & Comp Sci, Fuzhou, Peoples R China
[5] Zhejiang lab, Hangzhou, Peoples R China
关键词
Graph processing; Out-of-core; Subgraph construction;
D O I
10.1016/j.jpdc.2022.10.005
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Big data applications increasingly rely on the analysis of large graphs. In recent years, a number of out-of-core graph processing systems have been proposed to process graphs with billions of edges on just one commodity computer, by efficiently using the secondary storage (e.g., hard disk, SSD). Unfortunately, these graph processing systems continue to suffer from poor performance, despite of many solutions proposed to address the disk I/O bottleneck problem, a commonly recognized root cause. However, our experimental results show that another root cause of the poor performance is the subgraph construction phase of graph processing, which induces a large number of random memory accesses that substantially weaken cache access locality and thus greatly degrade the performance. In this paper, we propose an efficient out-of-core graph processing system, LOSC, to substantially reduce the overheads of subgraph construction. LOSC proposes a locality-optimized subgraph construction scheme that significantly improves the in-memory data access locality of the subgraph construction phase. Furthermore, LOSC adopts a compact edge storage format and a lightweight replication of vertices to reduce I/O traffic and improve computation efficiency. Extensive evaluation results show that LOSC is respectively 9.4x and 5.1x faster than GraphChi and GridGraph, two representative out-of-core systems. In addition, LOSC outperforms other state-of-art out-of-core graph processing systems including FlashGraph, GraphZ, G-Store and NXGraph. For example, LOSC can be up to 6.9x faster than FlashGraph.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:51 / 68
页数:18
相关论文
共 47 条
  • [1] LOSC: Efficient Out-of-Core Graph Processing with Locality-optimized Subgraph Construction
    Xu, Xianghao
    Wang, Fang
    Jiang, Hong
    Cheng, Yongli
    Hua, Yu
    Feng, Dan
    Zhang, Yongxuan
    [J]. PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS 2019), 2019,
  • [2] FOG: A Fast Out-of-Core Graph Processing Framework
    Zhiyuan Shao
    Jian He
    Huiming Lv
    Hai Jin
    [J]. International Journal of Parallel Programming, 2017, 45 : 1259 - 1272
  • [3] FOG: A Fast Out-of-Core Graph Processing Framework
    Shao, Zhiyuan
    He, Jian
    Lv, Huiming
    Jin, Hai
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (06) : 1259 - 1272
  • [4] BlockGraphChi: Enabling Block Update in Out-of-Core Graph Processing
    Shao, Zhiyuan
    Mei, Zhenjie
    Ding, Xiaofeng
    Jin, Hai
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 47 (04) : 668 - 685
  • [5] BlockGraphChi: Enabling Block Update in Out-of-Core Graph Processing
    Zhiyuan Shao
    Zhenjie Mei
    Xiaofeng Ding
    Hai Jin
    [J]. International Journal of Parallel Programming, 2019, 47 : 668 - 685
  • [6] OMRGx: Programmable and Transparent Out-of-Core Graph Partitioning and Processing
    Kaur, Gurneet
    Gupta, Rajiv
    [J]. PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL SYMPOSIUM ON MEMORY MANAGEMENT, ISMM 2023, 2023, : 137 - 149
  • [7] MultiLogVC: Efficient Out-of-Core Graph Processing Framework for Flash Storage
    Matam, Kiran Kumar
    Hashemi, Hanieh
    Annavaram, Murali
    [J]. 2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 245 - 255
  • [8] Competition-Based Adaptive Caching for Out-of-core Graph Processing
    Myung, Kihyeon
    Kim, Hwajung
    Lee, Yunjae
    Yeom, HeonYoung
    [J]. 21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 31 - 40
  • [9] SOWalker: An I/O-Optimized Out-of-Core Graph Processing System for Second-Order RandomWalks
    Wu, Yutong
    Shi, Zhan
    Huang, Shicai
    Tian, Zhipeng
    Zuo, Pengwei
    Fang, Peng
    Wang, Fang
    Feng, Dan
    [J]. PROCEEDINGS OF THE 2023 USENIX ANNUAL TECHNICAL CONFERENCE, 2023, : 87 - 100
  • [10] GraphSD: A State and Dependency aware Out-of-Core Graph Processing System
    Xu, Xianghao
    Jiang, Hong
    Wang, Fang
    Cheng, Yongli
    Fang, Peng
    [J]. 51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022, 2022,