A disk I/O optimized system for concurrent graph processing jobs

被引：0

作者：

Xu, Xianghao ^{[1
,2
]}

Wang, Fang ^{[2
]}

Jiang, Hong ^{[3
]}

Cheng, Yongli ^{[4
,5
]}

Feng, Dan ^{[2
]}

Fang, Peng ^{[2
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China

[2] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China

[3] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA

[4] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China

[5] Zhejiang Lab, Hangzhou 311121, Peoples R China

来源：

FRONTIERS OF COMPUTER SCIENCE | 2024年 / 18卷 / 03期

基金：

中国国家自然科学基金;

关键词：

graph processing; disk I/O; concurrent jobs; FRAMEWORK;

D O I：

10.1007/s11704-023-2361-0

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In order to analyze and process the large graphs with high cost efficiency, researchers have developed a number of out-of-core graph processing systems in recent years based on just one commodity computer. On the other hand, with the rapidly growing need of analyzing graphs in the real-world, graph processing systems have to efficiently handle massive concurrent graph processing (CGP) jobs. Unfortunately, due to the inherent design for single graph processing job, existing out-of-core graph processing systems usually incur unnecessary data accesses and severe competition of I/O bandwidth when handling the CGP jobs. In this paper, we propose GraphCP, a disk I/O optimized out-of-core graph processing system that efficiently supports the processing of CGP jobs. GraphCP proposes a benefit-aware sharing execution model to share the I/O access and processing of graph data among the CGP jobs and adaptively schedule the graph data loading based on the states of vertices, which efficiently overcomes above challenges faced by existing out-of-core graph processing systems. Moreover, GraphCP adopts a dependency-based future-vertex updating model so as to reduce disk I/Os in the future iterations. In addition, GraphCP organizes the graph data with a Source-Sorted Sub-Block graph representation for better processing capacity and I/O access locality. Extensive evaluation results show that GraphCP is 20.5x and 8.9x faster than two out-of-core graph processing systems GridGraph and GraphZ, and 3.5x and 1.7x faster than two state-of-art concurrent graph processing systems Seraph and GraphSO.

引用

页数：17

共 50 条

[21] Optimizing the number of processing nodes and I/O nodes in a shared disk parallel database system: SPAX
Park, YK
Jin, S
Cho, SH
Lee, JS
HIGH PERFORMANCE COMPUTING ON THE INFORMATION SUPERHIGHWAY - HPC ASIA '97, PROCEEDINGS, 1997, : 667 - 671
[22] Glign: Taming Misaligned Graph Traversals in Concurrent Graph Processing
Yin, Xizhe
Zhao, Zhijia
Gupta, Rajiv
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 1, ASPLOS 2023, 2023, : 78 - 92
[23] A high resolution disk I/O trace system
Huang, Tao
Xu, Teng
Lu, Xianliang
Operating Systems Review (ACM), 2001, 35 (04): : 82 - 87
[24] Improving disk I/O performance in a virtualized system
Li, Dingding
Jin, Hai
Liao, Xiaofei
Zhang, Yu
Zhou, Bingbing
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2013, 79 (02) : 187 - 200
[25] Asymptotic dimension and the disk graph I
Hamenstaedt, Ursula
JOURNAL OF TOPOLOGY, 2019, 12 (02) : 658 - 673
[26] OPTIMIZED QUANTIZATION IN DISTRIBUTED GRAPH SIGNAL PROCESSING
Nobre, Isabela Cunha Maia
Frossard, Pascal
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5376 - 5380
[27] ScaleG: A Distributed Disk-Based System for Vertex-Centric Graph Processing
Wang, Xubo
Wen, Dong
Qin, Lu
Chang, Lijun
Zhang, Ying
Zhang, Wenjie
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 2019 - 2033
[28] Scheduling of jobs in a hypercube processing system
Khosla, I
Bhattacharya, S
Tsai, WT
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1996, 47 (05) : 626 - 639
[29] Scheduling of jobs in a hypercube processing system
Univ of Minnesota, United States
J Oper Res Soc, 5 (626-639):
[30] An I/O-Efficient Disk-based Graph System for Scalable Second-Order RandomWalk of Large Graphs
Li, Hongzheng
Shao, Yingxia
Du, Junping
Cui, Bin
Chen, Lei
PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (08): : 1619 - 1631

← 1 2 3 4 5 →