PLC-Cache: Endurable SSD Cache for Deduplication-based Primary Storage

被引:0
|
作者
Liu, Jian [1 ,2 ,3 ]
Chai, Yunpeng [1 ,2 ]
Qin, Xiao [4 ]
Xiao, Yuan [2 ]
机构
[1] MOE, Key Lab Data Engn & Knowledge Engn, Beijing, Peoples R China
[2] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[3] Natl Comp Syst Engn Res Inst China, Beijing, Peoples R China
[4] Auburn Univ, Samuel Ginn Coll Engn, Dept Comp Sci & Software Engn, Auburn, AL USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Deduplication; SSD; Cache; Endurance; Primary Storage;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data deduplication techniques improve cost efficiency by dramatically reducing space needs of storage systems. SSD-based data cache has been adopted to remedy the declining I/O performance induced by deduplication operations in the latency-sensitive primary storage. Unfortunately, frequent data updates caused by classical cache algorithms (e. g., FIFO, LRU, and LFU) inevitably slow down SSDs' I/O processing speed while significantly shortening SSDs' lifetime. To address this problem, we propose a new approach - PLC-Cache - to greatly improve the I/O performance as well as write durability of SSDs. PLC-Cache is conducive to amplifying the proportion of the Popular and Long-term Cached (PLC) data, which is infrequently written and kept in SSD cache in a long time period to catalyze cache hits, in an entire SSD written data set. PLC-Cache advocates a two-phase approach. First, non-popular data are ruled out from being written into SSDs. Second, PLC-Cache makes an effort to convert SSD written data into PLC-data as much as possible. Our experimental results based on a practical deduplication system indicate that compared with the existing caching schemes, PLC-Cache shortens data access latency by an average of 23.4%. Importantly, PLC-Cache improves the lifetime of SSD-based caches by reducing the amount of data written to SSDs by a factor of 15.7.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Improving the Performance of Deduplication-Based Storage Cache via Content-Driven Cache Management Methods
    Tan, Yujuan
    Xu, Congcong
    Xie, Jing
    Yan, Zhichao
    Jiang, Hong
    Srisa-an, Witawas
    Chen, Xianzhang
    Liu, Duo
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (01) : 214 - 228
  • [2] Endurable SSD-Based Read Cache for Improving the Performance of Selective Restore from Deduplication Systems
    Liu, Jian
    Chai, Yun-Peng
    Qin, Xiao
    Liu, Yao-Hong
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2018, 33 (01) : 58 - 78
  • [3] Endurable SSD-Based Read Cache for Improving the Performance of Selective Restore from Deduplication Systems
    Jian Liu
    Yun-Peng Chai
    Xiao Qin
    Yao-Hong Liu
    [J]. Journal of Computer Science and Technology, 2018, 33 : 58 - 78
  • [4] Improving RAID Performance Using an Endurable SSD Cache
    Li, Chu
    Feng, Dan
    Hua, Yu
    Wang, Fang
    [J]. PROCEEDINGS 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - ICPP 2016, 2016, : 396 - 405
  • [5] A high-performance and endurable SSD cache for parity-based RAID
    Chu Li
    Dan Feng
    Yu Hua
    Fang Wang
    [J]. Frontiers of Computer Science, 2019, 13 : 16 - 34
  • [6] A high-performance and endurable SSD cache for parity-based RAID
    Li, Chu
    Feng, Dan
    Hua, Yu
    Wang, Fang
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2019, 13 (01) : 16 - 34
  • [7] DM-Pages: Improving Energy Efficiency of Disk Storage Systems and Cache Performance Using Deduplication-based Mixed Pages
    Si, Lei
    Pang, Shujie
    Deng, Yuhui
    Zhu, Weiheng
    Zhou, Yi
    Zhu, Yifeng
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (16)
  • [8] NCache: A cache algorithm of SSD in storage system
    Yang, Heng
    [J]. Metallurgical and Mining Industry, 2015, 7 (08): : 456 - 462
  • [9] CareDedup: Cache-aware Deduplication for Reading Performance Optimization in Primary Storage
    Lin, Bin
    Li, Shanshan
    Liao, Xiangke
    Liu, Xiaodong
    Zhang, Jing
    Jia, Zhouyang
    [J]. 2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 1 - 10
  • [10] A Duplication-Aware SSD-Based Cache Architecture for Primary Storage in Virtualization Environment
    Chen, Xian
    Chen, Wenzhi
    Lu, Zhongyong
    Long, Peng
    Yang, Shuiqiao
    Wang, Zonghui
    [J]. IEEE SYSTEMS JOURNAL, 2017, 11 (04): : 2578 - 2589