StageFS: A Parallel File System Optimizing Metadata Performance for SSD Based Clusters

被引:0
|
作者
Wu, Huijun [1 ]
Zhu, Liming [1 ]
Wu, Dongyao [1 ]
Lu, Kai [2 ]
Li, Gen [2 ]
机构
[1] Univ New South Wales, Data61, CSIRO, Kensington, NSW, Australia
[2] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
基金
美国国家科学基金会;
关键词
parallel file system; metadata; LSM-tree; small file;
D O I
10.1109/TrustCom.2016.328
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Parallel file systems are important infrastructures for both cloud and high performance computing. The performance of metadata operations is critical to achieve high scalability in parallel file systems. Nevertheless, traditional parallel file systems are lack of scalable metadata service. To alleviate these problems, some previous research distributes metadata to separated large-scale clusters and uses write-optimized techniques like log-structured merge tree (LSM-tree) to store metadata. However, LSM-tree design does not consider the features of solid state drive devices (SSD) which are widely deployed in modern parallel computing systems. The design of using LSM-trees to store metadata has not explored the potential benefits of SSD devices. In this paper, we present StageFS, which is a parallel file system optimized for SSD based clusters. StageFS stores both the metadata and small files in LSM-trees for fast indexing. For larger files, the file blocks are separately stored to reduce the write amplifications. In addition, the parallel I/O feature of SSD devices is used to improve the performance of accessing directories and large files. To avoid frequent small writes, StageFS uses buffering to better utilize the bandwidth of SSD devices. Experimental results show that StageFS provides better performance in metadata operations (up to 21.28x) and small file access (1.92x to two orders of magnitude) compared with Ceph and HDFS.
引用
收藏
页码:2147 / 2152
页数:6
相关论文
共 50 条
  • [1] Optimizing a hybrid SSD/HDD HPC storage system based on file size distributions
    Welch, Brent
    Noer, Geoffrey
    2013 IEEE 29TH SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST), 2013,
  • [2] Distributed Metadata Management for Exascale Parallel File System
    Yamamoto, Keiji
    Hori, Atushi
    Ishikawa, Yutaka
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1438 - 1438
  • [3] A flexible multiagent parallel file system for clusters
    Pérez, MS
    Carretero, J
    García, F
    Peña, JM
    Robles, V
    COMPUTATIONAL SCIENCE - ICCS 2003, PT IV, PROCEEDINGS, 2003, 2660 : 248 - 256
  • [4] PVFS: A parallel file system for Linux clusters
    Carns, PH
    Ligon, WB
    Ross, RB
    Thakur, R
    USENIX ASSOCIATION PROCEEDINGS OF THE 4TH ANNUAL LINUX SHOWCASE AND CONFERENCE, ATLANTA, 2000, : 317 - 327
  • [5] masFS: File System Based on Memory and SSD in Compute Nodes for High Performance Computers
    Liu, Xin
    Lu, Ying
    Lu, Yutong
    Wu, Chunjia
    Wu, Jieting
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 569 - 576
  • [6] MAPFS:: A flexible multiagent parallel file system for clusters
    Pérez, MS
    Carretero, J
    García, F
    Peña, JM
    Robles, V
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2006, 22 (05): : 620 - 632
  • [7] ENHANCING PERFORMANCE IN A PARALLEL FILE SYSTEM
    MILLER, LL
    INGLETT, SR
    MICROPROCESSING AND MICROPROGRAMMING, 1994, 40 (04): : 261 - 274
  • [8] Cooperation model of a multiagent parallel file system for clusters
    Pérez, MS
    Sánchez, A
    Robles, V
    Peña, JM
    Abawajy, J
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID - CCGRID 2004, 2004, : 595 - 601
  • [9] Using asynchronous writes on metadata to improve file system performance
    Feng, LC
    Chang, RC
    JOURNAL OF SYSTEMS AND SOFTWARE, 1996, 35 (01) : 43 - 54
  • [10] Data investigation based on XFS file system metadata
    Park, Yongmin
    Chang, Hyunsoo
    Shon, Taeshik
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (22) : 14721 - 14743