A disk I/O optimized system for concurrent graph processing jobs

被引：0

作者：

Xianghao Xu

Fang Wang

Hong Jiang

Yongli Cheng

Dan Feng

Peng Fang

机构：

[1] Nanjing University of Science and Technology,School of Computer Science and Engineering

[2] Huazhong University of Science and Technology,Wuhan National Laboratory for Optoelectronics

[3] University of Texas at Arlington,Department of Computer Science & Engineering

[4] Fuzhou University,College of Computer and Data Science

[5] Zhejiang Lab,undefined

来源：

Frontiers of Computer Science | 2024年 / 18卷

关键词：

graph processing; disk I/O; concurrent jobs;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In order to analyze and process the large graphs with high cost efficiency, researchers have developed a number of out-of-core graph processing systems in recent years based on just one commodity computer. On the other hand, with the rapidly growing need of analyzing graphs in the real-world, graph processing systems have to efficiently handle massive concurrent graph processing (CGP) jobs. Unfortunately, due to the inherent design for single graph processing job, existing out-of-core graph processing systems usually incur unnecessary data accesses and severe competition of I/O bandwidth when handling the CGP jobs. In this paper, we propose GraphCP, a disk I/O optimized out-of-core graph processing system that efficiently supports the processing of CGP jobs. GraphCP proposes a benefit-aware sharing execution model to share the I/O access and processing of graph data among the CGP jobs and adaptively schedule the graph data loading based on the states of vertices, which efficiently overcomes above challenges faced by existing out-of-core graph processing systems. Moreover, GraphCP adopts a dependency-based future-vertex updating model so as to reduce disk I/Os in the future iterations. In addition, GraphCP organizes the graph data with a Source-Sorted Sub-Block graph representation for better processing capacity and I/O access locality. Extensive evaluation results show that GraphCP is 20.5× and 8.9× faster than two out-of-core graph processing systems GridGraph and GraphZ, and 3.5× and 1.7× faster than two state-of-art concurrent graph processing systems Seraph and GraphSO.

引用

共 50 条

[1] A disk I/O optimized system for concurrent graph processing jobs
Xu, Xianghao
Wang, Fang
Jiang, Hong
Cheng, Yongli
Feng, Dan
Fang, Peng
FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (03)
[2] CGraph: A Distributed Storage and Processing System for Concurrent Iterative Graph Analysis Jobs
Zhang, Yu
Zhao, Jin
Liao, Xiaofei
Jin, Hai
Gu, Lin
Liu, Haikun
He, Bingsheng
He, Ligang
ACM TRANSACTIONS ON STORAGE, 2019, 15 (02)
[3] CLIP: A Disk I/O Focused Parallel Out-of-Core Graph Processing System
Ai, Zhiyuan
Zhang, Mingxing
Wu, Yongwei
Qian, Xuehai
Chen, Kang
Zheng, Weimin
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (01) : 45 - 62
[4] GraphCP: An I/O-Efficient Concurrent Graph Processing Framework
Xu, Xianghao
Wang, Fang
Jiang, Hong
Cheng, Yongli
Feng, Dan
Zhang, Yongxuan
Fang, Peng
2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
[5] SOWalker: An I/O-Optimized Out-of-Core Graph Processing System for Second-Order RandomWalks
Wu, Yutong
Shi, Zhan
Huang, Shicai
Tian, Zhipeng
Zuo, Pengwei
Fang, Peng
Wang, Fang
Feng, Dan
PROCEEDINGS OF THE 2023 USENIX ANNUAL TECHNICAL CONFERENCE, 2023, : 87 - 100
[6] Efficient buffering for concurrent disk and tape I/O
Myllymaki, J
Livny, M
PERFORMANCE EVALUATION, 1996, 27-8 : 453 - 471
[7] Squeezing out All the Value of Loaded Data: An Out-of-core Graph Processing System with Reduced Disk I/O
Ai, Zhiyuan
Zhang, Mingxing
Wu, Yongwei
Qian, Xuehai
Chen, Kang
Zheng, Weimin
2017 USENIX ANNUAL TECHNICAL CONFERENCE (USENIX ATC '17), 2017, : 125 - 137
[8] Krill: A Compiler and Runtime System for Concurrent Graph Processing
Chen, Hongzheng
Shen, Minghua
Xiao, Nong
Lu, Yutong
SC21: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2021,
[9] Redio: Accelerating Disk-Based Graph Processing by Reducing Disk I/Os
Wu, Chengwen
Zhang, Guangyan
Wang, Yang
Jiang, Xinyang
Zheng, Weimin
IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (03) : 414 - 425
[10] Load the Edges You Need: A Generic I/O Optimization for Disk-based Graph Processing
Vora, Keval
Xu, Guoqing
Gupta, Rajiv
PROCEEDINGS OF USENIX ATC '16: 2016 USENIX ANNUAL TECHNICAL CONFERENCE, 2016, : 507 - 522

← 1 2 3 4 5 →