Improving reading performance by file prefetching mechanism in distributed cache systems

被引:0
|
作者
Gui, Jing [1 ]
Wang, Yongbin [1 ]
Shuai, Wuyue [1 ]
机构
[1] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
关键词
Alluxio; cache systems; cost-benefit analysis; file prefetching; WaveNet; ACCESS PREDICTION;
D O I
10.1002/cpe.8215
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Distributed cache systems are utilized to enhance I/O performance between computing applications and storage systems. However, the traditional file access predictors employed in these cache systems are only suitable for workloads with simple file access patterns, rendering them inadequate for the complex access patterns found in big data computing scenarios. In this article, we propose a file access predictor (DFAP) based on WaveNet, which has exhibited promising results in file access tasks when compared to other baseline models. Cache systems are often constrained by limited cache space due to cost, cluster size, and other factors. In big data scenarios, cached data and prefetched data often compete for limited space. To address this issue, we introduce a cache prefetching algorithm (CBAP) for cache systems, which is based on cost-benefit analysis to improve cache utilization. Furthermore, we implement a novel file prefetching framework on Alluxio, which accelerates computing jobs by up to 18%.
引用
收藏
页数:22
相关论文
共 50 条
  • [32] Design considerations of high performance data cache with prefetching
    Chi, CH
    Yuan, YL
    [J]. EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 1243 - 1250
  • [33] An intelligent cache system with hardware prefetching for high performance
    Lee, JH
    Jeong, SW
    Kim, SD
    Weems, CC
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2003, 52 (05) : 607 - 616
  • [34] Adaptive Dual-Cache Scheme with dynamic prefetching scheme in parallel file system
    Kim, CY
    Cho, JH
    Seo, DW
    [J]. CLUSTER 2000: IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2000, : 361 - 362
  • [35] APS: adaptable prefetching scheme to different running environments for concurrent read streams in distributed file systems
    Lee, Sangmin
    Hyun, Soon J.
    Kim, Hong-Yeon
    Kim, Young-Kyun
    [J]. JOURNAL OF SUPERCOMPUTING, 2018, 74 (06): : 2870 - 2902
  • [36] Modeling of Distributed File Systems for Practical Performance Analysis
    Wu, Yongwei
    Ye, Feng
    Chen, Kang
    Zheng, Weimin
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (01) : 156 - 166
  • [37] PRACTICAL PREFETCHING TECHNIQUES FOR MULTIPROCESSOR FILE-SYSTEMS
    KOTZ, D
    ELLIS, CS
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 1993, 1 (01) : 33 - 51
  • [38] EEPC: Energy-Efficient Persistent Cache Scheme for Mobile Distributed File Systems
    Li, Hang
    Li, Wentong
    Lv, Yina
    Liu, Jialin
    Yang, Long
    Shi, Liang
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (09): : 15998 - 16008
  • [39] Performance increase mechanisms for parallel and distributed file systems
    Universidad Politecnica de Madrid, Boadilla del Monte Madrid, Spain
    [J]. Parallel Comput, 4-5 (525-542):
  • [40] PERFORMANCE RELIABILITY ISSUES IN DISTRIBUTED FILE-SYSTEMS
    HAC, A
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 1986, 6 (03) : 219 - 224