Making the case for reforming the I/O software stack of extreme-scale systems

被引:5
|
作者
Isaila, Florin [1 ]
Garcia, Javier [2 ]
Carretero, Jesus [2 ]
Ross, Rob [1 ]
Kimpe, Dries [1 ]
机构
[1] Argonne Natl Lab, 9700 S Cass Ave, Argonne, IL 60439 USA
[2] Univ Carlos III, Getafe, Spain
关键词
Storage; I/O software stack; Data locality; Energy efficiency; Resilience;
D O I
10.1016/j.advengsoft.2016.07.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ever-increasing data needs of scientific and engineering applications require novel approaches to managing and exploring huge amounts of information in order to advance scientific discovery. In order to achieve this goal, one of the main priorities of the international scientific community is addressing the challenges of performing scientific computing on exascale machines within the next decade. Exascale platforms likely will be characterized by a three to four orders of magnitude increase in concurrency, a substantially larger storage capacity, and a deepening of the storage hierarchy. The current development model of independently applying optimizations at each layer of the system I/O software stack will not scale to the new levels of concurrency, storage hierarchy, and capacity. In this article we discuss the current development model for the I/O software stack of high-performance computing platforms. We identify the challenges of improving scalability, performance, energy efficiency, and resilience of the I/O software stack, while accessing a deepening hierarchy of volatile and nonvolatile storage. We advocate for radical new approaches to reforming the I/O software stack in order to advance toward exascale. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:26 / 31
页数:6
相关论文
共 50 条
  • [31] A Multi-Faceted Approach to Job Placement for Improved Performance on Extreme-Scale Systems
    Zimmer, Christopher
    Gupta, Saurabh
    Atchley, Scott
    Vazhkudai, Sudharshan S.
    Albing, Carl
    SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 1015 - 1025
  • [32] A Practical Approach to Reconciling Availability, Performance, and Capacity in Provisioning Extreme-scale Storage Systems
    Wan, Lipeng
    Wang, Feiyi
    Oral, Sarp
    Tiwari, Devesh
    Vazhkudai, Sudharshan S.
    Cao, Qing
    PROCEEDINGS OF SC15: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2015,
  • [33] LFTI: A New Performance Metric for Assessing Interconnect Designs for Extreme-Scale HPC Systems
    Yuan, Xin
    Mahapatra, Santosh
    Lang, Michael
    Pakin, Scott
    2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [34] A Case Study in Using Massively Parallel Simulation for Extreme-Scale Torus Network Codesign
    Mubarak, Misbah
    Carothers, Christopher D.
    Ross, Robert B.
    Carns, Philip
    SIGSIM-PADS'14: PROCEEDINGS OF THE 2014 ACM CONFERENCE ON SIGSIM PRINCIPLES OF ADVANCED DISCRETE SIMULATION, 2014, : 27 - 38
  • [35] SharP Unified Memory Allocator: An Intent-Based Memory Allocator for Extreme-Scale Systems
    Aderholdt, Ferrol
    Venkata, Manjunath Gorentla
    Parchman, Zachary W.
    EURO-PAR 2018: PARALLEL PROCESSING, 2018, 11014 : 533 - 545
  • [36] LIBI: A framework for bootstrapping extreme scale software systems
    Goehner, J. D.
    Arnold, D. C.
    Ahn, D. H.
    Lee, G. L.
    de Supinski, B. R.
    LeGendre, M. P.
    Miller, B. P.
    Schulz, M.
    PARALLEL COMPUTING, 2013, 39 (03) : 167 - 176
  • [37] Modeling and Simulation of Extreme-Scale Fat-Tree Networks for HPC Systems and Data Centers
    Liu, Ning
    Haider, Adnan
    Jin, Dong
    Sun, Xian-He
    ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2017, 27 (02):
  • [38] Visualization and parallel I/O at extreme scale
    Ross, R. B.
    Peterka, T.
    Shen, H-W
    Hong, Y.
    Ma, K-L
    Yu, H.
    Moreland, K.
    SCIDAC 2008: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2008, 125
  • [39] Lazy Checkpointing : Exploiting Temporal Locality in Failures to Mitigate Checkpointing Overheads on Extreme-Scale Systems
    Tiwari, Devesh
    Gupta, Saurabh
    Vazhkudai, Sudharshan S.
    2014 44TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2014, : 25 - 36
  • [40] Performance characterization of irregular I/O at the extreme scale
    Herbein, S.
    McDaniel, S.
    Podhorszki, N.
    Logan, J.
    Klasky, S.
    Taufer, M.
    PARALLEL COMPUTING, 2016, 51 : 17 - 36