A disk I/O optimized system for concurrent graph processing jobs

被引:0
|
作者
Xu, Xianghao [1 ,2 ]
Wang, Fang [2 ]
Jiang, Hong [3 ]
Cheng, Yongli [4 ,5 ]
Feng, Dan [2 ]
Fang, Peng [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China
[3] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[4] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China
[5] Zhejiang Lab, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
graph processing; disk I/O; concurrent jobs; FRAMEWORK;
D O I
10.1007/s11704-023-2361-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to analyze and process the large graphs with high cost efficiency, researchers have developed a number of out-of-core graph processing systems in recent years based on just one commodity computer. On the other hand, with the rapidly growing need of analyzing graphs in the real-world, graph processing systems have to efficiently handle massive concurrent graph processing (CGP) jobs. Unfortunately, due to the inherent design for single graph processing job, existing out-of-core graph processing systems usually incur unnecessary data accesses and severe competition of I/O bandwidth when handling the CGP jobs. In this paper, we propose GraphCP, a disk I/O optimized out-of-core graph processing system that efficiently supports the processing of CGP jobs. GraphCP proposes a benefit-aware sharing execution model to share the I/O access and processing of graph data among the CGP jobs and adaptively schedule the graph data loading based on the states of vertices, which efficiently overcomes above challenges faced by existing out-of-core graph processing systems. Moreover, GraphCP adopts a dependency-based future-vertex updating model so as to reduce disk I/Os in the future iterations. In addition, GraphCP organizes the graph data with a Source-Sorted Sub-Block graph representation for better processing capacity and I/O access locality. Extensive evaluation results show that GraphCP is 20.5x and 8.9x faster than two out-of-core graph processing systems GridGraph and GraphZ, and 3.5x and 1.7x faster than two state-of-art concurrent graph processing systems Seraph and GraphSO.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Optimizing the number of processing nodes and I/O nodes in a shared disk parallel database system: SPAX
    Park, YK
    Jin, S
    Cho, SH
    Lee, JS
    HIGH PERFORMANCE COMPUTING ON THE INFORMATION SUPERHIGHWAY - HPC ASIA '97, PROCEEDINGS, 1997, : 667 - 671
  • [22] Glign: Taming Misaligned Graph Traversals in Concurrent Graph Processing
    Yin, Xizhe
    Zhao, Zhijia
    Gupta, Rajiv
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 1, ASPLOS 2023, 2023, : 78 - 92
  • [23] A high resolution disk I/O trace system
    Huang, Tao
    Xu, Teng
    Lu, Xianliang
    Operating Systems Review (ACM), 2001, 35 (04): : 82 - 87
  • [24] Improving disk I/O performance in a virtualized system
    Li, Dingding
    Jin, Hai
    Liao, Xiaofei
    Zhang, Yu
    Zhou, Bingbing
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2013, 79 (02) : 187 - 200
  • [25] Asymptotic dimension and the disk graph I
    Hamenstaedt, Ursula
    JOURNAL OF TOPOLOGY, 2019, 12 (02) : 658 - 673
  • [26] OPTIMIZED QUANTIZATION IN DISTRIBUTED GRAPH SIGNAL PROCESSING
    Nobre, Isabela Cunha Maia
    Frossard, Pascal
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5376 - 5380
  • [27] ScaleG: A Distributed Disk-Based System for Vertex-Centric Graph Processing
    Wang, Xubo
    Wen, Dong
    Qin, Lu
    Chang, Lijun
    Zhang, Ying
    Zhang, Wenjie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 2019 - 2033
  • [28] Scheduling of jobs in a hypercube processing system
    Khosla, I
    Bhattacharya, S
    Tsai, WT
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1996, 47 (05) : 626 - 639
  • [29] Scheduling of jobs in a hypercube processing system
    Univ of Minnesota, United States
    J Oper Res Soc, 5 (626-639):
  • [30] An I/O-Efficient Disk-based Graph System for Scalable Second-Order RandomWalk of Large Graphs
    Li, Hongzheng
    Shao, Yingxia
    Du, Junping
    Cui, Bin
    Chen, Lei
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (08): : 1619 - 1631