A disk I/O optimized system for concurrent graph processing jobs

被引:0
|
作者
Xianghao Xu
Fang Wang
Hong Jiang
Yongli Cheng
Dan Feng
Peng Fang
机构
[1] Nanjing University of Science and Technology,School of Computer Science and Engineering
[2] Huazhong University of Science and Technology,Wuhan National Laboratory for Optoelectronics
[3] University of Texas at Arlington,Department of Computer Science & Engineering
[4] Fuzhou University,College of Computer and Data Science
[5] Zhejiang Lab,undefined
来源
关键词
graph processing; disk I/O; concurrent jobs;
D O I
暂无
中图分类号
学科分类号
摘要
In order to analyze and process the large graphs with high cost efficiency, researchers have developed a number of out-of-core graph processing systems in recent years based on just one commodity computer. On the other hand, with the rapidly growing need of analyzing graphs in the real-world, graph processing systems have to efficiently handle massive concurrent graph processing (CGP) jobs. Unfortunately, due to the inherent design for single graph processing job, existing out-of-core graph processing systems usually incur unnecessary data accesses and severe competition of I/O bandwidth when handling the CGP jobs. In this paper, we propose GraphCP, a disk I/O optimized out-of-core graph processing system that efficiently supports the processing of CGP jobs. GraphCP proposes a benefit-aware sharing execution model to share the I/O access and processing of graph data among the CGP jobs and adaptively schedule the graph data loading based on the states of vertices, which efficiently overcomes above challenges faced by existing out-of-core graph processing systems. Moreover, GraphCP adopts a dependency-based future-vertex updating model so as to reduce disk I/Os in the future iterations. In addition, GraphCP organizes the graph data with a Source-Sorted Sub-Block graph representation for better processing capacity and I/O access locality. Extensive evaluation results show that GraphCP is 20.5× and 8.9× faster than two out-of-core graph processing systems GridGraph and GraphZ, and 3.5× and 1.7× faster than two state-of-art concurrent graph processing systems Seraph and GraphSO.
引用
收藏
相关论文
共 50 条
  • [21] Optimizing the number of processing nodes and I/O nodes in a shared disk parallel database system: SPAX
    Park, YK
    Jin, S
    Cho, SH
    Lee, JS
    HIGH PERFORMANCE COMPUTING ON THE INFORMATION SUPERHIGHWAY - HPC ASIA '97, PROCEEDINGS, 1997, : 667 - 671
  • [22] Glign: Taming Misaligned Graph Traversals in Concurrent Graph Processing
    Yin, Xizhe
    Zhao, Zhijia
    Gupta, Rajiv
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 1, ASPLOS 2023, 2023, : 78 - 92
  • [23] A high resolution disk I/O trace system
    Huang, Tao
    Xu, Teng
    Lu, Xianliang
    Operating Systems Review (ACM), 2001, 35 (04): : 82 - 87
  • [24] Improving disk I/O performance in a virtualized system
    Li, Dingding
    Jin, Hai
    Liao, Xiaofei
    Zhang, Yu
    Zhou, Bingbing
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2013, 79 (02) : 187 - 200
  • [25] Asymptotic dimension and the disk graph I
    Hamenstaedt, Ursula
    JOURNAL OF TOPOLOGY, 2019, 12 (02) : 658 - 673
  • [26] OPTIMIZED QUANTIZATION IN DISTRIBUTED GRAPH SIGNAL PROCESSING
    Nobre, Isabela Cunha Maia
    Frossard, Pascal
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5376 - 5380
  • [27] ScaleG: A Distributed Disk-Based System for Vertex-Centric Graph Processing
    Wang, Xubo
    Wen, Dong
    Qin, Lu
    Chang, Lijun
    Zhang, Ying
    Zhang, Wenjie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 2019 - 2033
  • [28] Scheduling of jobs in a hypercube processing system
    Khosla, I
    Bhattacharya, S
    Tsai, WT
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1996, 47 (05) : 626 - 639
  • [29] Scheduling of jobs in a hypercube processing system
    Univ of Minnesota, United States
    J Oper Res Soc, 5 (626-639):
  • [30] An I/O-Efficient Disk-based Graph System for Scalable Second-Order RandomWalk of Large Graphs
    Li, Hongzheng
    Shao, Yingxia
    Du, Junping
    Cui, Bin
    Chen, Lei
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (08): : 1619 - 1631