A disk I/O optimized system for concurrent graph processing jobs

被引:0
|
作者
Xu, Xianghao [1 ,2 ]
Wang, Fang [2 ]
Jiang, Hong [3 ]
Cheng, Yongli [4 ,5 ]
Feng, Dan [2 ]
Fang, Peng [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China
[3] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[4] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China
[5] Zhejiang Lab, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
graph processing; disk I/O; concurrent jobs; FRAMEWORK;
D O I
10.1007/s11704-023-2361-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to analyze and process the large graphs with high cost efficiency, researchers have developed a number of out-of-core graph processing systems in recent years based on just one commodity computer. On the other hand, with the rapidly growing need of analyzing graphs in the real-world, graph processing systems have to efficiently handle massive concurrent graph processing (CGP) jobs. Unfortunately, due to the inherent design for single graph processing job, existing out-of-core graph processing systems usually incur unnecessary data accesses and severe competition of I/O bandwidth when handling the CGP jobs. In this paper, we propose GraphCP, a disk I/O optimized out-of-core graph processing system that efficiently supports the processing of CGP jobs. GraphCP proposes a benefit-aware sharing execution model to share the I/O access and processing of graph data among the CGP jobs and adaptively schedule the graph data loading based on the states of vertices, which efficiently overcomes above challenges faced by existing out-of-core graph processing systems. Moreover, GraphCP adopts a dependency-based future-vertex updating model so as to reduce disk I/Os in the future iterations. In addition, GraphCP organizes the graph data with a Source-Sorted Sub-Block graph representation for better processing capacity and I/O access locality. Extensive evaluation results show that GraphCP is 20.5x and 8.9x faster than two out-of-core graph processing systems GridGraph and GraphZ, and 3.5x and 1.7x faster than two state-of-art concurrent graph processing systems Seraph and GraphSO.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Processing Concurrent Graph Analytics with Decoupled Computation Model
    Xue, Jilong
    Yang, Zhi
    Hou, Shian
    Dai, Yafei
    IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (05) : 876 - 890
  • [32] Practicably Boosting the Processing Performance of BFS-like Algorithms on Semi -External Graph System via I/O -Efficient Graph Ordering
    Yang, Tsun-Yu
    Liang, Yuhong
    Yang, Ming-Chang
    PROCEEDINGS OF THE 20TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, FAST 2022, 2022, : 381 - 395
  • [33] Storage System Optimization, Improving CPU Efficiency in I/O Bounded Jobs
    Cabrillo Bartolome, Iban
    Rodriguez Marrero, Ana Y.
    IBERGRID: 4TH IBERIAN GRID INFRASTRUCTURE CONFERENCE PROCEEDINGS, 2010, : 344 - 354
  • [34] BUFFERED DISK I/O
    BLUM, R
    DR DOBBS JOURNAL, 1985, 10 (02): : 98 - &
  • [35] Realizing Memory-Optimized Distributed Graph Processing
    Liakos, Panagiotis
    Papakonstantinopoulou, Katia
    Delis, Alex
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (04) : 743 - 756
  • [36] A novel disk I/O scheduling framework of virtualized storage system
    Li, Dingding
    Dong, Mianxiong
    Tang, Yong
    Ota, Kaoru
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 2395 - 2405
  • [37] A novel disk I/O scheduling framework of virtualized storage system
    Dingding Li
    Mianxiong Dong
    Yong Tang
    Kaoru Ota
    Cluster Computing, 2019, 22 : 2395 - 2405
  • [38] Elixir: A System for Synthesizing Concurrent Graph Programs
    Prountzos, Dimitrios
    Manevich, Roman
    Pingali, Keshav
    ACM SIGPLAN NOTICES, 2012, 47 (10) : 375 - 394
  • [39] A Survey on Concurrent Processing of Graph Analytical Queries: Systems and Algorithms
    Li, Yuchen
    Sun, Shixuan
    Xiao, Hanhua
    Ye, Chang
    Lu, Shengliang
    He, Bingsheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 5508 - 5528
  • [40] An Efficient Graph Processing System
    Zhou, Xianke
    Chang, Pengfei
    Chen, Gang
    WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 401 - 412