Improving reading performance by file prefetching mechanism in distributed cache systems

被引:0
|
作者
Gui, Jing [1 ]
Wang, Yongbin [1 ]
Shuai, Wuyue [1 ]
机构
[1] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
关键词
Alluxio; cache systems; cost-benefit analysis; file prefetching; WaveNet; ACCESS PREDICTION;
D O I
10.1002/cpe.8215
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Distributed cache systems are utilized to enhance I/O performance between computing applications and storage systems. However, the traditional file access predictors employed in these cache systems are only suitable for workloads with simple file access patterns, rendering them inadequate for the complex access patterns found in big data computing scenarios. In this article, we propose a file access predictor (DFAP) based on WaveNet, which has exhibited promising results in file access tasks when compared to other baseline models. Cache systems are often constrained by limited cache space due to cost, cluster size, and other factors. In big data scenarios, cached data and prefetched data often compete for limited space. To address this issue, we introduce a cache prefetching algorithm (CBAP) for cache systems, which is based on cost-benefit analysis to improve cache utilization. Furthermore, we implement a novel file prefetching framework on Alluxio, which accelerates computing jobs by up to 18%.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] The design and implementation of appointed file prefetching for distributed file systems
    Hwang, Gwan-Hwan
    Lin, Hsin-Fu
    Sy, Chun-Chin
    Chang, Chiu-Yang
    [J]. JOURNAL OF RESEARCH AND PRACTICE IN INFORMATION TECHNOLOGY, 2008, 40 (02): : 91 - 108
  • [2] A prefetching algorithm for improving web cache performance
    Umapathi, C.
    Raja, J.
    [J]. Journal of Applied Sciences, 2006, 6 (15) : 3122 - 3127
  • [3] Efficient Prefetching and Client-Side Caching Algorithms for Improving the Performance of Read Operations in Distributed File Systems
    Nalajala, Anusha
    Ragunathan, Thirumalaisamy
    Naha, Ranesh
    [J]. IEEE Access, 2022, 10 : 126232 - 126252
  • [4] Efficient Prefetching and Client-Side Caching Algorithms for Improving the Performance of Read Operations in Distributed File Systems
    Nalajala, Anusha
    Ragunathan, Thirumalaisamy
    Naha, Ranesh
    [J]. IEEE ACCESS, 2022, 10 : 126232 - 126252
  • [5] Server-side prefetching in distributed file systems
    Liao, Jianwei
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (02): : 294 - 310
  • [6] A Distributed Cache Framework for Metadata Service of Distributed File Systems
    Sun, Yao
    Liu, Jie
    Ye, Dan
    Zhong, Hua
    [J]. 2013 19TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2013), 2013, : 51 - 58
  • [7] Improving Performance of Distributed File System through Frequent Block Access Pattern-Based Prefetching Algorithm
    Nalajala, Anusha
    Ragunathan, T.
    Rajendra, Harsha Tavidisetty
    Nikhith, Nagamlla Venkata Sai
    Gopisetty, Rathnamma
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [8] Performing Initiative Data Prefetching in Distributed File Systems for Cloud Computing
    Liao, Jianwei
    Trahay, Francois
    Xiao, Guoqiang
    Li, Li
    Ishikawa, Yutaka
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2017, 5 (03) : 550 - 562
  • [9] An RNN Based Mechanism for File Prefetching
    Chen, Hui
    Zhou, Enqiang
    Liu, Jie
    Zhang, Zhicheng
    [J]. 2019 18TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2019), 2019, : 13 - 16
  • [10] File service mechanism for distributed systems
    [J]. Tanaka, Mikio, 1600, (31):