StageFS: A Parallel File System Optimizing Metadata Performance for SSD Based Clusters

被引:0
|
作者
Wu, Huijun [1 ]
Zhu, Liming [1 ]
Wu, Dongyao [1 ]
Lu, Kai [2 ]
Li, Gen [2 ]
机构
[1] Univ New South Wales, Data61, CSIRO, Kensington, NSW, Australia
[2] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
基金
美国国家科学基金会;
关键词
parallel file system; metadata; LSM-tree; small file;
D O I
10.1109/TrustCom.2016.328
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Parallel file systems are important infrastructures for both cloud and high performance computing. The performance of metadata operations is critical to achieve high scalability in parallel file systems. Nevertheless, traditional parallel file systems are lack of scalable metadata service. To alleviate these problems, some previous research distributes metadata to separated large-scale clusters and uses write-optimized techniques like log-structured merge tree (LSM-tree) to store metadata. However, LSM-tree design does not consider the features of solid state drive devices (SSD) which are widely deployed in modern parallel computing systems. The design of using LSM-trees to store metadata has not explored the potential benefits of SSD devices. In this paper, we present StageFS, which is a parallel file system optimized for SSD based clusters. StageFS stores both the metadata and small files in LSM-trees for fast indexing. For larger files, the file blocks are separately stored to reduce the write amplifications. In addition, the parallel I/O feature of SSD devices is used to improve the performance of accessing directories and large files. To avoid frequent small writes, StageFS uses buffering to better utilize the bandwidth of SSD devices. Experimental results show that StageFS provides better performance in metadata operations (up to 21.28x) and small file access (1.92x to two orders of magnitude) compared with Ceph and HDFS.
引用
收藏
页码:2147 / 2152
页数:6
相关论文
共 50 条
  • [41] A Novel Metadata Management Architecture Based on Service Separation in Cluster File System
    Zhang, Junwei
    Zhang, Jingliang
    Zhang, Jiangang
    Han, Xiaoming
    Xu, Lu
    2009 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT 2009), 2009, : 110 - 115
  • [42] Research on the Metadata Storage Mode and Efficiency of Distributed File System Based on HGML
    Miao Fang
    Cheng Fu-chao
    Yang Wen-hui
    Tan Li
    ADVANCES IN SCIENCE AND ENGINEERING, PTS 1 AND 2, 2011, 40-41 : 221 - 227
  • [43] A parallel file system based on spatial information object
    Huang, KY
    Li, GQ
    Liu, DS
    Zhang, WY
    NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2005, 3779 : 153 - 162
  • [44] Efficient Logging of Metadata Using NVRAM for NAND Flash based File System
    Lee, Chul
    Lim, Seung-Ho
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (01) : 86 - 94
  • [45] Efficient Logging of Metadata using NVRAM for NAND Flash based File System
    Lee, Chul
    Lim, Seung-Ho
    2012 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2012, : 453 - +
  • [46] The Research and Implementation of Metadata Cache Backup Technology Based on CEPH File System
    Zhan, Ling
    Fang, Xieyun
    Li, Duping
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2016), 2016, : 72 - 77
  • [47] PASS - A MULTIUSER PARALLEL FILE SYSTEM BASED ON MICROCOMPUTERS
    MILLER, LL
    INGLETT, SR
    HURSON, AR
    JOURNAL OF SYSTEMS AND SOFTWARE, 1992, 19 (01) : 75 - 83
  • [48] Optimizing Energy and Performance for Server-Class File System Workloads
    Sehgal, Priya
    Tarasov, Vasily
    Zadok, Erez
    ACM TRANSACTIONS ON STORAGE, 2010, 6 (03)
  • [49] hybridFS: Integrating NAND Flash-Based SSD and HDD for Hybrid File System
    Suk, Jinsun
    No, Jaechun
    NEW ASPECTS OF SYSTEMS THEORY AND SCIENTIFIC COMPUTATION, 2010, : 178 - +
  • [50] High-Performance Metadata Integrity Protection in the WAFL Copy-on-Write File System
    Kumar, Harendra
    Patel, Yuvraj
    Kesavan, Ram
    Makam, Sumith
    PROCEEDINGS OF FAST '17: 15TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, 2017, : 197 - 211