A Lookahead Read Cache: Improving Read Performance for Deduplication Backup Storage

被引:16
|
作者
Park, Dongchul [1 ]
Fan, Ziqi [1 ]
Nam, Young Jin [1 ]
Du, David H. C. [1 ]
机构
[1] Univ Minnesota Twin Cities, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
deduplication; dedupe; read cache; backup;
D O I
10.1007/s11390-017-1680-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data deduplication (dedupe for short) is a special data compression technique. It has been widely adopted to save backup time as well as storage space, particularly in backup storage systems. Therefore, most dedupe research has primarily focused on improving dedupe write performance. However, backup storage dedupe read performance is also a crucial problem for storage recovery. This paper designs a new dedupe storage read cache for backup applications that improves read performance by exploiting a special characteristic: the read sequence is the same as the write sequence. Consequently, for better cache utilization, by looking ahead for future references within a moving window, it evicts victims from the cache having the smallest future access. Moreover, to further improve read cache performance, it maintains a small log buffer to judiciously cache future access data chunks. Extensive experiments with real-world backup workloads demonstrate that the proposed read cache scheme improves read performance by up to 64.3%.
引用
收藏
页码:26 / 40
页数:15
相关论文
共 50 条
  • [1] A Lookahead Read Cache: Improving Read Performance for Deduplication Backup Storage
    Dongchul Park
    Ziqi Fan
    Young Jin Nam
    David H. C. Du
    [J]. Journal of Computer Science and Technology, 2017, 32 : 26 - 40
  • [2] Endurable SSD-Based Read Cache for Improving the Performance of Selective Restore from Deduplication Systems
    Jian Liu
    Yun-Peng Chai
    Xiao Qin
    Yao-Hong Liu
    [J]. Journal of Computer Science and Technology, 2018, 33 : 58 - 78
  • [3] Endurable SSD-Based Read Cache for Improving the Performance of Selective Restore from Deduplication Systems
    Liu, Jian
    Chai, Yun-Peng
    Qin, Xiao
    Liu, Yao-Hong
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2018, 33 (01) : 58 - 78
  • [4] Improving read performance with BP-DAGs for storage-efficient file backup
    Yang, Tianming
    Zhang, Jing
    Hao, Ningbo
    [J]. Open Electrical and Electronic Engineering Journal, 2013, 7 (01): : 90 - 97
  • [5] Improving Cache Performance Using Read-Write Partitioning
    Khan, Samira
    Alameldeen, Alaa R.
    Wilkerson, Chris
    Mutlu, Onur
    Jimenez, Daniel A.
    [J]. 2014 20TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA-20), 2014, : 452 - 463
  • [6] Read-Performance Optimization for Deduplication-Based Storage Systems in the Cloud
    Mao, Bo
    Jiang, Hong
    Wu, Suzhen
    Fu, Yinjin
    Tian, Lei
    [J]. ACM TRANSACTIONS ON STORAGE, 2014, 10 (02)
  • [7] A Read-leveling Data Distribution Scheme for Promoting Read Performance in SSDs with Deduplication
    Lu, Mengting
    Wang, Fang
    Feng, Dan
    Hu, Yuchong
    [J]. PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [8] Reducing Fragmentation for In-line Deduplication Backup Storage via Exploiting Backup History and Cache Knowledge
    Fu, Min
    Feng, Dan
    Hua, Yu
    He, Xubin
    Chen, Zuoning
    Liu, Jingning
    Xia, Wen
    Huang, Fangting
    Liu, Qing
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (03) : 855 - 868
  • [9] Improving Read Performance of SSDs via Balanced Redirected Read
    Liang, Jie
    Xu, Yinlong
    Sun, Dongdong
    Wu, Si
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON NETWORKING ARCHITECTURE AND STORAGE (NAS), 2016,
  • [10] Improving the Performance of Deduplication-Based Storage Cache via Content-Driven Cache Management Methods
    Tan, Yujuan
    Xu, Congcong
    Xie, Jing
    Yan, Zhichao
    Jiang, Hong
    Srisa-an, Witawas
    Chen, Xianzhang
    Liu, Duo
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (01) : 214 - 228