StageFS: A Parallel File System Optimizing Metadata Performance for SSD Based Clusters

被引:0
|
作者
Wu, Huijun [1 ]
Zhu, Liming [1 ]
Wu, Dongyao [1 ]
Lu, Kai [2 ]
Li, Gen [2 ]
机构
[1] Univ New South Wales, Data61, CSIRO, Kensington, NSW, Australia
[2] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
基金
美国国家科学基金会;
关键词
parallel file system; metadata; LSM-tree; small file;
D O I
10.1109/TrustCom.2016.328
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Parallel file systems are important infrastructures for both cloud and high performance computing. The performance of metadata operations is critical to achieve high scalability in parallel file systems. Nevertheless, traditional parallel file systems are lack of scalable metadata service. To alleviate these problems, some previous research distributes metadata to separated large-scale clusters and uses write-optimized techniques like log-structured merge tree (LSM-tree) to store metadata. However, LSM-tree design does not consider the features of solid state drive devices (SSD) which are widely deployed in modern parallel computing systems. The design of using LSM-trees to store metadata has not explored the potential benefits of SSD devices. In this paper, we present StageFS, which is a parallel file system optimized for SSD based clusters. StageFS stores both the metadata and small files in LSM-trees for fast indexing. For larger files, the file blocks are separately stored to reduce the write amplifications. In addition, the parallel I/O feature of SSD devices is used to improve the performance of accessing directories and large files. To avoid frequent small writes, StageFS uses buffering to better utilize the bandwidth of SSD devices. Experimental results show that StageFS provides better performance in metadata operations (up to 21.28x) and small file access (1.92x to two orders of magnitude) compared with Ceph and HDFS.
引用
收藏
页码:2147 / 2152
页数:6
相关论文
共 50 条
  • [21] ADAPTIVE TRADEOFF IN METADATA-BASED SMALL FILE OPTIMIZATIONS FOR A CLUSTER FILE SYSTEM
    Li, Xiuqiao
    Dong, Bin
    Xiao, Limin
    Ruan, Li
    INTERNATIONAL JOURNAL OF NUMERICAL ANALYSIS AND MODELING, 2012, 9 (02) : 289 - 303
  • [22] Erratum to: ONFS: a hierarchical hybrid file system based on memory, SSD, and HDD for high performance computers
    Xin Liu
    Yu-tong Lu
    Jie Yu
    Peng-fei Wang
    Jie-ting Wu
    Ying Lu
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 308 - 308
  • [23] Optimizing Read and Write Performance Based on Deep Understanding of SSD
    Liu, Xin
    Lu, Yutong
    Yu, Jie
    Lu, Ying
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 2607 - 2616
  • [24] Efficient and Consistent NVMM Cache for SSD-Based File System
    Chen, Youmin
    Lu, Youyou
    Chen, Pei
    Shu, Jiwu
    IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (08) : 1147 - 1158
  • [25] Optimizing Performance for Open-Channel SSD in Cloud Storage System
    Zhang, Xiaoyi
    Zhu, Feng
    Li, Shu
    Wang, Kun
    Xu, Wei
    Xu, Dengcai
    2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 902 - 911
  • [26] Optimizations based on hints in a parallel file system
    Pérez, MS
    Sánchez, A
    Robles, V
    Peña, JM
    Pérez, F
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 3, PROCEEDINGS, 2004, 3038 : 347 - 354
  • [27] Design and evaluation of a high performance parallel file system
    Ou, L
    He, XB
    Scott, SL
    Xu, ZY
    Fang, YC
    LCN 2005: 30TH CONFERENCE ON LOCAL COMPUTER NETWORKS, PROCEEDINGS, 2005, : 100 - 107
  • [28] A Windows-based parallel file system
    Yeh, Lungpin
    Sun, Juei-Ting
    Hung, Sheng-Kai
    Hsu, Yarsun
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2007, 4782 : 7 - 18
  • [29] IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion
    Ren, Kai
    Zheng, Qing
    Patil, Swapnil
    Gibson, Garth
    SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 237 - 248
  • [30] Volume based metadata isolation in Blue Whale Cluster File System
    Zhang, Jingliang
    Si, Chengxiang
    Jia, Yajun
    Zhang, Jiangang
    Han, Xiaoming
    Xu, Lu
    HPCC: 2009 11TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2009, : 654 - +