EAD: elasticity aware deduplication manager for datacenters with multi-tier storage systems

被引:10
|
作者
Yang, Zhengyu [1 ]
Wang, Yufeng [2 ]
Bhamini, Janki [1 ]
Tan, Chiu C. [3 ]
Mi, Ningfang [1 ]
机构
[1] Northeastern Univ, 360 Huntington Ave, Boston, MA 02115 USA
[2] Temple Univ, 1801 N Broad St, Philadelphia, PA 19122 USA
[3] Temple Univ, Dept Comp & Informat Sci, 1801 N Broad St, Philadelphia, PA 19122 USA
关键词
Deduplication estimation; Scalability; Migration; Cloud storage systems; Fusion disk; Adaptive dynamical sampling keyword; Cluster computing; Cloud computing;
D O I
10.1007/s10586-018-2141-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The popularity of Big Data applications places pressures on storage systems to efficiently scale to meet the demand. At the same time, new developments like solid-state drives have changed to traditional storage hierarchy. Cloud storage systems are transitioning towards a hybrid architecture consisting of large amounts of memory, solid-state disks (SSDs), and traditional magnetic hard disks (HD). This paper presents elasticity aware deduplication (EAD), a data deduplication framework designed for multi-tier cloud storage architectures consisting of SSD and HD. EAD dynamically adjusts the deduplication parameters at runtime in order to improve performance. Experimental results indicate that EAD is able to detect more than 98% of all duplicate data, but it only consumes less than 5% of expected memory space. Additionally, EAD saves approximately 74% of overall IO access cost compared to the traditional design.
引用
收藏
页码:1561 / 1579
页数:19
相关论文
共 50 条
  • [1] EAD: elasticity aware deduplication manager for datacenters with multi-tier storage systems
    Zhengyu Yang
    Yufeng Wang
    Janki Bhamini
    Chiu C. Tan
    Ningfang Mi
    [J]. Cluster Computing, 2018, 21 : 1561 - 1579
  • [2] Workload-Aware Placement of Multi-Tier Applications in Virtualized Datacenters
    RahimiZadeh, Keyvan
    AnaLoui, Morteza
    Kabiri, Peyman
    Javadi, Bahman
    [J]. COMPUTER JOURNAL, 2017, 60 (02): : 210 - 239
  • [3] Improving recoverability in multi-tier storage systems
    Aguilera, Marcos K.
    Keeton, Kimberly
    Merchant, Arif
    Muniswamy-Reddy, Kiran-Kumar
    Uysal, Mustafa
    [J]. 37TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2007, : 677 - +
  • [4] Enabling cost-aware and adaptive elasticity of multi-tier cloud applications
    Han, Rui
    Ghanem, Moustafa M.
    Guo, Li
    Guo, Yike
    Osmond, Michelle
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 32 : 82 - 98
  • [5] pWebDAV: a multi-tier storage system
    Filippidis, Christos
    Cotronis, Yiannis
    [J]. 2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 643 - 650
  • [6] Multi-tier Shuttle-based Storage and Retrieval Systems
    Lerher, Tone
    [J]. FME TRANSACTIONS, 2016, 44 (03): : 285 - 290
  • [7] Adaptive multi-tier intelligent data manager for Exascale
    Carretero, Jesus
    Garcia-Blas, Javier
    Aldinucci, Marco
    Besnard, Jean Baptiste
    Acquaviva, Jean-Thomas
    Brinkmann, Andre
    Vef, Marc-Andre
    Jeannot, Emmanuel
    Miranda, Alberto
    Nou, Ramon
    Riedel, Morris
    Torquati, Massimo
    Wolf, Felix
    [J]. PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2023, CF 2023, 2023, : 285 - 290
  • [8] Optimal disk storage allocation for multi-tier storage system
    Shi, Haixiang
    Arumugam, Rajesh Vellore
    Foh, Chuan Heng
    Khaing, Kyawt Kyawt
    [J]. 2012 DIGEST ASIA-PACIFIC MAGNETIC RECORDING CONFERENCE (APMRC), 2012,
  • [9] Towards a fast multi-tier storage system simulator
    San-Lucas, Cesar
    Abad, Cristina L.
    [J]. 2016 IEEE ECUADOR TECHNICAL CHAPTERS MEETING (ETCM), 2016,
  • [10] Predicting failures in multi-tier distributed systems
    Mariani, Leonardo
    Pezze, Mauro
    Riganelli, Oliviero
    Xin, Rui
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 161