StageFS: A Parallel File System Optimizing Metadata Performance for SSD Based Clusters

被引:0
|
作者
Wu, Huijun [1 ]
Zhu, Liming [1 ]
Wu, Dongyao [1 ]
Lu, Kai [2 ]
Li, Gen [2 ]
机构
[1] Univ New South Wales, Data61, CSIRO, Kensington, NSW, Australia
[2] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
基金
美国国家科学基金会;
关键词
parallel file system; metadata; LSM-tree; small file;
D O I
10.1109/TrustCom.2016.328
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Parallel file systems are important infrastructures for both cloud and high performance computing. The performance of metadata operations is critical to achieve high scalability in parallel file systems. Nevertheless, traditional parallel file systems are lack of scalable metadata service. To alleviate these problems, some previous research distributes metadata to separated large-scale clusters and uses write-optimized techniques like log-structured merge tree (LSM-tree) to store metadata. However, LSM-tree design does not consider the features of solid state drive devices (SSD) which are widely deployed in modern parallel computing systems. The design of using LSM-trees to store metadata has not explored the potential benefits of SSD devices. In this paper, we present StageFS, which is a parallel file system optimized for SSD based clusters. StageFS stores both the metadata and small files in LSM-trees for fast indexing. For larger files, the file blocks are separately stored to reduce the write amplifications. In addition, the parallel I/O feature of SSD devices is used to improve the performance of accessing directories and large files. To avoid frequent small writes, StageFS uses buffering to better utilize the bandwidth of SSD devices. Experimental results show that StageFS provides better performance in metadata operations (up to 21.28x) and small file access (1.92x to two orders of magnitude) compared with Ceph and HDFS.
引用
收藏
页码:2147 / 2152
页数:6
相关论文
共 50 条
  • [31] A Zones-Based Metadata Management Method for Distributed File System
    Xie, Xiaowei
    Yang, Yu
    Lu, Yueming
    TRUSTWORTHY COMPUTING AND SERVICES, 2014, 426 : 169 - 175
  • [32] Optimizing the Ceph Distributed File System for High Performance Computing
    Jeong, Kisik
    Duffy, Carl
    Kim, Jin-Soo
    Lee, Joonwon
    2019 27TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP), 2019, : 446 - 451
  • [33] URFS: A User-space Raw File System based on NVMe SSD
    Tu, Yaofeng
    Han, Yinjun
    Chen, Zhenghua
    Chen, Zhengguang
    Chen, Bing
    2020 IEEE 26TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2020, : 494 - 501
  • [34] Implementation and performance of a parallel file system for high performance distributed applications
    Ligon, WB
    Ross, RB
    PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 1996, : 471 - 480
  • [35] The Composite-File File System: Decoupling One-to-One Mapping of Files and Metadata for Better Performance
    Zhang, Shuanglong
    Roy, Robert
    Rumancik, Leah
    Wang, An-I Andy
    ACM TRANSACTIONS ON STORAGE, 2020, 16 (01)
  • [36] The Composite-file File System: Decoupling the One-to-one Mapping of Files and Metadata for Better Performance
    Zhang, Shuanglong
    Catanese, Helen
    Wang, An-I Andy
    14TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES (FAST '16), 2016, : 15 - 22
  • [37] Dynamic file prefetching scheme based on file access patterns in VIA-based parallel file system
    Lee, YY
    Kim, CY
    Seo, DW
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2002, E85D (04) : 714 - 721
  • [38] Characterizing Concurrency Control Performance for the PIOUS Parallel File System
    Moyer, S. A.
    Sunderam, V. S.
    Journal of Parallel and Distributed Computing, 38 (01):
  • [39] Characterizing concurrency control performance for the PIOUS parallel file system
    Moyer, SA
    Sunderam, VS
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1996, 38 (01) : 81 - 91
  • [40] Performance evaluation and relative predictive model of parallel file system
    Zhao T.-Z.
    Dong S.-B.
    Verdi M.
    See S.
    Ruan Jian Xue Bao/Journal of Software, 2011, 22 (09): : 2206 - 2221