FileScale: Fast and Elastic Metadata Management for Distributed File Systems

被引:0
|
作者
Liao, Gang [1 ]
Abadi, Daniel J. [2 ]
机构
[1] ByteDance Infrastruct Syst Lab, San Jose, CA 95050 USA
[2] Univ Maryland, College Pk, MD USA
基金
美国国家科学基金会;
关键词
Distributed File System; Metadata Management; Elastic Computing; Distributed Database;
D O I
10.1145/3620678.3624784
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
File systems that store metadata on a single machine or via a shared-disk abstraction face scalability challenges, especially in contexts demanding the management of billions of files. Recent work has shown that employing shared-nothing, distributed database system (DDBMS) for metadata storage can alleviate these scalability challenges without compromising on high availability guarantees. However, for low-scale deployments - where metadata can fit in memory on a single machine - these DDBMS-based systems typically perform an order of magnitude worse than systems that store metadata in memory on a single machine. This has limited the impact of these distributed database approaches, since they are only currently applicable to file systems of extreme scale. This paper describes FileScale, a three-tier architecture that incorporates a DDBMS as part of a comprehensive approach to file system metadata management. In contrast to previous approaches, FileScale performs comparably to the single-machine architecture at a small scale, while enabling linear scalability as the file system metadata increases1.
引用
收藏
页码:459 / 474
页数:16
相关论文
共 50 条
  • [1] Design and Implementation of a Metadata Management Scheme for Large Distributed File Systems
    Yun, Jong Hyeon
    Park, Yong Hun
    Seo, Dong Min
    Lee, Seok Jae
    Yoo, Jae Soo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (07): : 1475 - 1478
  • [2] Metadata Namespace Management of Distributed File System
    Luo, Baoshan
    Zhang, Xinyan
    Tan, Zhipeng
    [J]. 14TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS, ENGINEERING AND SCIENCE (DCABES 2015), 2015, : 21 - 25
  • [3] File and metadata management for BESIII distributed computing
    Nicholson, C.
    Lin, L.
    Deng, Z. Y.
    Li, W. D.
    Zhang, X. M.
    Zheng, Y. H.
    [J]. INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS 2012 (CHEP2012), PTS 1-6, 2012, 396
  • [4] Analyzing Metadata Performance in Distributed File Systems
    Biardzki, Christoph
    Ludwig, Thomas
    [J]. PARALLEL COMPUTING TECHNOLOGIES, PROCEEDINGS, 2009, 5698 : 8 - +
  • [5] A Flattened Metadata Service for Distributed File Systems
    Li, Siyang
    Liu, Fenlin
    Shu, Jiwu
    Lu, Youyou
    Li, Tao
    Hu, Yang
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (12) : 2641 - 2657
  • [6] DMetabench—a metadata benchmark for distributed file systems
    Christoph Biardzki
    Thomas Ludwig
    [J]. The Journal of Supercomputing, 2011, 57 : 179 - 188
  • [7] A Distributed Cache Framework for Metadata Service of Distributed File Systems
    Sun, Yao
    Liu, Jie
    Ye, Dan
    Zhong, Hua
    [J]. 2013 19TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2013), 2013, : 51 - 58
  • [8] Research on Metadata Management Scheme of Distributed File System
    Huo, Lin
    Yi, Ran
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATIONS (CSA), 2015, : 37 - 41
  • [9] Distributed Metadata Management for Exascale Parallel File System
    Yamamoto, Keiji
    Hori, Atushi
    Ishikawa, Yutaka
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1438 - 1438
  • [10] DMetabench-a metadata benchmark for distributed file systems
    Biardzki, Christoph
    Ludwig, Thomas
    [J]. JOURNAL OF SUPERCOMPUTING, 2011, 57 (02): : 179 - 188