Aurora: Adaptive Block Replication in Distributed File Systems

被引:10
|
作者
Zhang, Qi [1 ]
Zhang, Sai Qian [1 ]
Leon-Garcia, Alberto [1 ]
Boutaba, Raouf [2 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 1A1, Canada
[2] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
关键词
D O I
10.1109/ICDCS.2015.52
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed file systems such as Google File System and Hadoop Distributed File System have been used to store large volumes of data in Cloud data centers. These systems divide data sets in blocks of fixed size and replicate them over multiple machines to achieve both reliability and efficiency. Recent studies have shown that data blocks tend to have a wide disparity in data popularity. In this context, the naive block replication schemes used by these systems often cause an uneven load distribution across machines, which reduces the overall I/O throughput of the system. While many replication algorithms have been proposed, existing solutions have not carefully studied the placement of data blocks that balances the load across machines, while ensuring node and rack-level reliability requirements are satisfied. In this paper, we study the dynamic data replication problem with the goal of balancing machine load while ensuring machine and rack-level reliability requirements are met. We propose several local search algorithms that provide constant approximation guarantees, yet simple and practical for implementation. We further present Aurora, a dynamic block placement mechanism that implements these algorithms in the Hadoop Distributed File System with minimal overhead. Through experiments using workload traces from Yahoo! and Facebook, we show Aurora reduces machine load imbalance by up to 26.9% compared to existing solutions, while satisfying node and rack-level reliability requirements.
引用
收藏
页码:442 / 451
页数:10
相关论文
共 50 条
  • [1] Adaptive Replica Synchronization for Distributed File Systems
    Liao, Jianwei
    Li, Li
    Chen, Huaidong
    Liu, Xiaoyan
    [J]. IEEE SYSTEMS JOURNAL, 2015, 9 (03): : 865 - 877
  • [2] Block Placement in Distributed File Systems Based on Block Access Frequency
    Liao, Jianwei
    Cai, Zhigang
    Trahay, Francois
    Peng, Xiaoning
    [J]. IEEE ACCESS, 2018, 6 : 38411 - 38420
  • [3] Cost Based Approach to Block Placement for Distributed File Systems
    Srinivasan, Lakshminarayanan
    Varma, Vasudeva
    [J]. 2014 INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD), 2014, : 132 - 138
  • [4] ALGORITHMS FOR FILE REPLICATION IN A DISTRIBUTED SYSTEM
    HAC, A
    JIN, XW
    SOO, JH
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 1991, 14 (03) : 173 - 181
  • [5] ALDM: Adaptive Loading Data Migration in Distributed File Systems
    Tan, Zhipeng
    Zhou, Wei
    Feng, Dan
    Zhang, Wenhua
    [J]. IEEE TRANSACTIONS ON MAGNETICS, 2013, 49 (06) : 2645 - 2652
  • [6] Block I/O Scheduling on Storage Servers of Distributed File Systems
    Liao, Jianwei
    Yin, Dong
    Peng, Xiaoning
    [J]. JOURNAL OF GRID COMPUTING, 2018, 16 (02) : 299 - 316
  • [7] Block I/O Scheduling on Storage Servers of Distributed File Systems
    Jianwei Liao
    Dong Yin
    Xiaoning Peng
    [J]. Journal of Grid Computing, 2018, 16 : 299 - 316
  • [8] An Efficient and Adaptive Decentralized File Replication Algorithm in P2P File Sharing Systems
    Shen, Haiying
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2010, 21 (06) : 827 - 840
  • [9] Distributed file systems and distributed memory
    Doeppner, TW
    [J]. ACM COMPUTING SURVEYS, 1996, 28 (01) : 229 - 231
  • [10] EAD: An Efficient and Adaptive Decentralized File Replication Algorithm in P2P File Sharing Systems
    Shen, Haiying
    [J]. P2P'08: EIGHTH INTERNATIONAL CONFERENCE ON PEER-TO-PEER COMPUTING, PROCEEDINGS, 2008, : 99 - 108