PSA: A Performance and Space-Aware Data Layout Scheme for Hybrid Parallel File Systems

被引:5
|
作者
He, Shuibing [1 ,2 ]
Liu, Yan [2 ]
Sun, Xian-He [2 ]
机构
[1] Wuhan Univ, Sch Comp, Wuhan, Hubei, Peoples R China
[2] IIT, Dept Comp Sci, Chicago, IL 60616 USA
关键词
Parallel I/O System; Parallel File system; Data Layout; Solid State Drive;
D O I
10.1109/DISCS.2014.10
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The underlying storage of hybrid parallel file systems (PFS) is composed of both SSD-based file servers (SServer) and HDD-based file servers (HServer). Unlike a traditional HServer, an SServer consistently provides improved storage performance but lacks storage space. However, most current data layout schemes do not consider the differences in performance and space between heterogeneous servers, and may significantly degrade the performance of the hybrid PFSs. In this paper, we propose PSA, a novel data layout scheme, which maximizes the hybrid PFSs performance by applying adaptive varied-size file stripes. PSA dispatches data on heterogeneous file servers not only based on storage performance but also storage space. We have implemented PSA within OrangeFS, a popular parallel file system in the HPC domain. Our extensive experiments using a representative benchmark show that PSA provides superior I/O throughput than the default and performance-aware file data layout schemes.
引用
收藏
页码:41 / 48
页数:8
相关论文
共 50 条
  • [1] Enhancing hybrid parallel file system through performance and space-aware data layout
    He, Shuibing
    Liu, Yan
    Wang, Yang
    Sun, Xian-He
    Huang, Chuanhe
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2016, 30 (04): : 396 - 410
  • [2] A Migratory Heterogeneity-Aware Data Layout Scheme for Parallel File Systems
    He, Shuibing
    Sun, Xian-He
    Wang, Yang
    Xu, Chengzhong
    [J]. 2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 1133 - 1142
  • [3] HAS: Heterogeneity-Aware Selective Layout Scheme for Parallel File Systems on Hybrid Servers
    He, Shuibing
    Sun, Xian-He
    Haider, Adnan
    [J]. 2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 613 - 622
  • [4] Performance-Aware Data Placement in Hybrid Parallel File Systems
    He, Shuibing
    Sun, Xian-He
    Feng, Bo
    Feng, Kun
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT I, 2014, 8630 : 563 - 576
  • [5] A Heterogeneity-Aware Region-Level Data Layout for Hybrid Parallel File Systems
    He, Shuibing
    Sun, Xian-He
    Wang, Yang
    Kougkas, Antonis
    Haider, Adnan
    [J]. 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 340 - 349
  • [6] Space-Aware Data Integration for Ocean Observing Systems
    Li, Longzhuang
    Nalluri, Anil Kumar
    Ai, Lirong
    [J]. 2011 2ND INTERNATIONAL CONFERENCE ON CHALLENGES IN ENVIRONMENTAL SCIENCE AND COMPUTER ENGINEERING (CESCE 2011), VOL 11, PT A, 2011, 11 : 285 - 290
  • [7] Boosting Parallel File System Performance via Heterogeneity-Aware Selective Data Layout
    He, Shuibing
    Wang, Yang
    Sun, Xian-He
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (09) : 2492 - 2505
  • [8] GLOBAL AND LOCAL SYNCHRONIZATION IN PARALLEL SPACE-AWARE APPLICATIONS
    Cicirelli, Franco
    Forestiero, Agostino
    Giordano, Andrea
    Mastroianni, Carlo
    Razumchik, Rostislav
    [J]. 32ND EUROPEAN CONFERENCE ON MODELLING AND SIMULATION (ECMS 2018), 2018, : 491 - 497
  • [9] A Cost-intelligent Application-specific Data Layout Scheme for Parallel File Systems
    Song, Huaiming
    Yin, Yanlong
    Chen, Yong
    Sun, Xian-He
    [J]. HPDC 11: PROCEEDINGS OF THE 20TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2011, : 37 - 48
  • [10] Parallel execution of space-aware applications in a Cloud environment
    Cicirelli, Franco
    Forestiero, Agostino
    Giordano, Andrea
    Mastroianni, Carlo
    Spezzano, Giandomenico
    [J]. 2016 24TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP), 2016, : 686 - 693