Parallel File System Analysis Through Application I/O Tracing

被引:16
|
作者
Wright, S. A. [1 ]
Hammond, S. D. [2 ]
Pennycook, S. J. [1 ]
Bird, R. F. [1 ]
Herdman, J. A. [3 ]
Miller, I. [3 ]
Vadgama, A. [3 ]
Bhalerao, A. [1 ]
Jarvis, S. A. [1 ]
机构
[1] Univ Warwick, Dept Comp Sci, Coventry CV4 7AL, W Midlands, England
[2] Sandia Natl Labs, Ctr Res Comp, Albuquerque, NM 87185 USA
[3] UK Atom Weap Estab, Supercomp Solut Ctr, Reading, Berks, England
来源
COMPUTER JOURNAL | 2013年 / 56卷 / 02期
基金
美国能源部;
关键词
high performance computing; input/output; MPI; checkpointing; file systems; FLASH; CODE; MPI;
D O I
10.1093/comjnl/bxs044
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Input/Output (I/O) operations can represent a significant proportion of the run-time of parallel scientific computing applications. Although there have been several advances in file format libraries, file system design and I/O hardware, a growing divergence exists between the performance of parallel file systems and the compute clusters that they support. In this paper, we document the design and application of the RIOT I/O toolkit (RIOT) being developed at the University of Warwick with our industrial partners at the Atomic Weapons Establishment and Sandia National Laboratories. We use the toolkit to assess the performance of three industry-standard I/O benchmarks on three contrasting supercomputers, ranging from a mid-sized commodity cluster to a large-scale proprietary IBM BlueGene/P system. RIOT provides a powerful framework in which to analyse I/O and parallel file system behaviour-we demonstrate, for example, the large file locking overhead of IBM's General Parallel File System, which can consume nearly 30% of the total write time in the FLASH-IO benchmark. Through I/O trace analysis, we also assess the performance of HDF-5 in its default configuration, identifying a bottleneck created by the use of suboptimal Message Passing Interface hints. Furthermore, we investigate the performance gains attributed to the Parallel Log-structured File System (PLFS) being developed by EMC Corporation and the Los Alamos National Laboratory. Our evaluation of PLFS involves two high-performance computing systems with contrasting I/O backplanes and illustrates the varied improvements to I/O that result from the deployment of PLFS (ranging from up to 25x speed-up in I/O performance on a large I/O installation to 2x speed-up on the much smaller installation at the University of Warwick).
引用
收藏
页码:141 / 155
页数:15
相关论文
共 50 条
  • [1] Benchmarking Parallel File System Sensitiveness to I/O patterns
    Chasapis, Konstantinos
    Vet, Jean-Yves
    Acquaviva, Jean-Thomas
    [J]. 2019 IEEE 27TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2019), 2019, : 427 - 428
  • [2] UFCR: An efficient I/O method for parallel file system
    Huo, Yanmei
    Ju, Jiubin
    Hu, Liang
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2006, : 223 - +
  • [3] Performance analysis of parallel file system for I/O load balancing in distributed applications
    Deepalakshmi, S
    Baswaraj, K
    Rao, SSSP
    Jagdish, BS
    Sonavane, DD
    Dhekne, PS
    Kaura, HK
    [J]. PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 713 - 720
  • [4] Distributed parallel file system for I/O intensive parallel computing on clusters
    Domínguez-Domínguez, S
    Buenabad-Chávez, J
    [J]. 2004 1st International Conference on Electrical and Electronics Engineering (ICEEE), 2004, : 194 - 199
  • [5] Enabling dynamic file I/O path selection at runtime for parallel file system
    Xiuqiao Li
    Limin Xiao
    Meikang Qiu
    Bin Dong
    Li Ruan
    [J]. The Journal of Supercomputing, 2014, 68 : 996 - 1021
  • [6] Enabling dynamic file I/O path selection at runtime for parallel file system
    Li, Xiuqiao
    Xiao, Limin
    Qiu, Meikang
    Dong, Bin
    Ruan, Li
    [J]. JOURNAL OF SUPERCOMPUTING, 2014, 68 (02): : 996 - 1021
  • [7] FILE CONCEPTS FOR PARALLEL I/O
    CROCKETT, TW
    [J]. PROCEEDINGS : SUPERCOMPUTING 89, 1989, : 574 - 579
  • [8] A File Assignment Strategy for Parallel I/O System with Minimum I/O Contention Probability
    Dong, Bin
    Li, Xiuqiao
    Xiao, Limin
    Ruan, Li
    [J]. GRID AND DISTRIBUTED COMPUTING, 2011, 261 : 445 - 454
  • [9] Parallel I/O Architecture Modelling Based on File System Counters
    El Sayed, Salem
    Bolten, Matthias
    Pleiter, Dirk
    [J]. HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2016 INTERNATIONAL WORKSHOPS, 2016, 9945 : 627 - 637
  • [10] Using file system counters in modelling parallel I/O architectures
    El Sayed, Salem
    Bolten, Matthias
    Pleiter, Dirk
    [J]. OPERATING SYSTEMS REVIEW, 2016, 50 (02) : 37 - 46