A disk I/O optimized system for concurrent graph processing jobs

被引:0
|
作者
Xianghao Xu
Fang Wang
Hong Jiang
Yongli Cheng
Dan Feng
Peng Fang
机构
[1] Nanjing University of Science and Technology,School of Computer Science and Engineering
[2] Huazhong University of Science and Technology,Wuhan National Laboratory for Optoelectronics
[3] University of Texas at Arlington,Department of Computer Science & Engineering
[4] Fuzhou University,College of Computer and Data Science
[5] Zhejiang Lab,undefined
来源
关键词
graph processing; disk I/O; concurrent jobs;
D O I
暂无
中图分类号
学科分类号
摘要
In order to analyze and process the large graphs with high cost efficiency, researchers have developed a number of out-of-core graph processing systems in recent years based on just one commodity computer. On the other hand, with the rapidly growing need of analyzing graphs in the real-world, graph processing systems have to efficiently handle massive concurrent graph processing (CGP) jobs. Unfortunately, due to the inherent design for single graph processing job, existing out-of-core graph processing systems usually incur unnecessary data accesses and severe competition of I/O bandwidth when handling the CGP jobs. In this paper, we propose GraphCP, a disk I/O optimized out-of-core graph processing system that efficiently supports the processing of CGP jobs. GraphCP proposes a benefit-aware sharing execution model to share the I/O access and processing of graph data among the CGP jobs and adaptively schedule the graph data loading based on the states of vertices, which efficiently overcomes above challenges faced by existing out-of-core graph processing systems. Moreover, GraphCP adopts a dependency-based future-vertex updating model so as to reduce disk I/Os in the future iterations. In addition, GraphCP organizes the graph data with a Source-Sorted Sub-Block graph representation for better processing capacity and I/O access locality. Extensive evaluation results show that GraphCP is 20.5× and 8.9× faster than two out-of-core graph processing systems GridGraph and GraphZ, and 3.5× and 1.7× faster than two state-of-art concurrent graph processing systems Seraph and GraphSO.
引用
收藏
相关论文
共 50 条
  • [1] A disk I/O optimized system for concurrent graph processing jobs
    Xu, Xianghao
    Wang, Fang
    Jiang, Hong
    Cheng, Yongli
    Feng, Dan
    Fang, Peng
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (03)
  • [2] CGraph: A Distributed Storage and Processing System for Concurrent Iterative Graph Analysis Jobs
    Zhang, Yu
    Zhao, Jin
    Liao, Xiaofei
    Jin, Hai
    Gu, Lin
    Liu, Haikun
    He, Bingsheng
    He, Ligang
    ACM TRANSACTIONS ON STORAGE, 2019, 15 (02)
  • [3] CLIP: A Disk I/O Focused Parallel Out-of-Core Graph Processing System
    Ai, Zhiyuan
    Zhang, Mingxing
    Wu, Yongwei
    Qian, Xuehai
    Chen, Kang
    Zheng, Weimin
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (01) : 45 - 62
  • [4] GraphCP: An I/O-Efficient Concurrent Graph Processing Framework
    Xu, Xianghao
    Wang, Fang
    Jiang, Hong
    Cheng, Yongli
    Feng, Dan
    Zhang, Yongxuan
    Fang, Peng
    2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
  • [5] SOWalker: An I/O-Optimized Out-of-Core Graph Processing System for Second-Order RandomWalks
    Wu, Yutong
    Shi, Zhan
    Huang, Shicai
    Tian, Zhipeng
    Zuo, Pengwei
    Fang, Peng
    Wang, Fang
    Feng, Dan
    PROCEEDINGS OF THE 2023 USENIX ANNUAL TECHNICAL CONFERENCE, 2023, : 87 - 100
  • [6] Efficient buffering for concurrent disk and tape I/O
    Myllymaki, J
    Livny, M
    PERFORMANCE EVALUATION, 1996, 27-8 : 453 - 471
  • [7] Squeezing out All the Value of Loaded Data: An Out-of-core Graph Processing System with Reduced Disk I/O
    Ai, Zhiyuan
    Zhang, Mingxing
    Wu, Yongwei
    Qian, Xuehai
    Chen, Kang
    Zheng, Weimin
    2017 USENIX ANNUAL TECHNICAL CONFERENCE (USENIX ATC '17), 2017, : 125 - 137
  • [8] Krill: A Compiler and Runtime System for Concurrent Graph Processing
    Chen, Hongzheng
    Shen, Minghua
    Xiao, Nong
    Lu, Yutong
    SC21: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2021,
  • [9] Redio: Accelerating Disk-Based Graph Processing by Reducing Disk I/Os
    Wu, Chengwen
    Zhang, Guangyan
    Wang, Yang
    Jiang, Xinyang
    Zheng, Weimin
    IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (03) : 414 - 425
  • [10] Load the Edges You Need: A Generic I/O Optimization for Disk-based Graph Processing
    Vora, Keval
    Xu, Guoqing
    Gupta, Rajiv
    PROCEEDINGS OF USENIX ATC '16: 2016 USENIX ANNUAL TECHNICAL CONFERENCE, 2016, : 507 - 522