Making the case for reforming the I/O software stack of extreme-scale systems

被引：5

作者：

Isaila, Florin ^{[1
]}

Garcia, Javier ^{[2
]}

Carretero, Jesus ^{[2
]}

Ross, Rob ^{[1
]}

Kimpe, Dries ^{[1
]}

机构：

[1] Argonne Natl Lab, 9700 S Cass Ave, Argonne, IL 60439 USA

[2] Univ Carlos III, Getafe, Spain

来源：

ADVANCES IN ENGINEERING SOFTWARE | 2017年 / 111卷

关键词：

Storage; I/O software stack; Data locality; Energy efficiency; Resilience;

D O I：

10.1016/j.advengsoft.2016.07.003

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The ever-increasing data needs of scientific and engineering applications require novel approaches to managing and exploring huge amounts of information in order to advance scientific discovery. In order to achieve this goal, one of the main priorities of the international scientific community is addressing the challenges of performing scientific computing on exascale machines within the next decade. Exascale platforms likely will be characterized by a three to four orders of magnitude increase in concurrency, a substantially larger storage capacity, and a deepening of the storage hierarchy. The current development model of independently applying optimizations at each layer of the system I/O software stack will not scale to the new levels of concurrency, storage hierarchy, and capacity. In this article we discuss the current development model for the I/O software stack of high-performance computing platforms. We identify the challenges of improving scalability, performance, energy efficiency, and resilience of the I/O software stack, while accessing a deepening hierarchy of volatile and nonvolatile storage. We advocate for radical new approaches to reforming the I/O software stack in order to advance toward exascale. (C) 2017 Elsevier Ltd. All rights reserved.

引用

页码：26 / 31

页数：6

共 50 条

[31] A Multi-Faceted Approach to Job Placement for Improved Performance on Extreme-Scale Systems
Zimmer, Christopher
Gupta, Saurabh
Atchley, Scott
Vazhkudai, Sudharshan S.
Albing, Carl
SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 1015 - 1025
[32] A Practical Approach to Reconciling Availability, Performance, and Capacity in Provisioning Extreme-scale Storage Systems
Wan, Lipeng
Wang, Feiyi
Oral, Sarp
Tiwari, Devesh
Vazhkudai, Sudharshan S.
Cao, Qing
PROCEEDINGS OF SC15: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2015,
[33] LFTI: A New Performance Metric for Assessing Interconnect Designs for Extreme-Scale HPC Systems
Yuan, Xin
Mahapatra, Santosh
Lang, Michael
Pakin, Scott
2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
[34] A Case Study in Using Massively Parallel Simulation for Extreme-Scale Torus Network Codesign
Mubarak, Misbah
Carothers, Christopher D.
Ross, Robert B.
Carns, Philip
SIGSIM-PADS'14: PROCEEDINGS OF THE 2014 ACM CONFERENCE ON SIGSIM PRINCIPLES OF ADVANCED DISCRETE SIMULATION, 2014, : 27 - 38
[35] SharP Unified Memory Allocator: An Intent-Based Memory Allocator for Extreme-Scale Systems
Aderholdt, Ferrol
Venkata, Manjunath Gorentla
Parchman, Zachary W.
EURO-PAR 2018: PARALLEL PROCESSING, 2018, 11014 : 533 - 545
[36] LIBI: A framework for bootstrapping extreme scale software systems
Goehner, J. D.
Arnold, D. C.
Ahn, D. H.
Lee, G. L.
de Supinski, B. R.
LeGendre, M. P.
Miller, B. P.
Schulz, M.
PARALLEL COMPUTING, 2013, 39 (03) : 167 - 176
[37] Modeling and Simulation of Extreme-Scale Fat-Tree Networks for HPC Systems and Data Centers
Liu, Ning
Haider, Adnan
Jin, Dong
Sun, Xian-He
ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2017, 27 (02):
[38] Visualization and parallel I/O at extreme scale
Ross, R. B.
Peterka, T.
Shen, H-W
Hong, Y.
Ma, K-L
Yu, H.
Moreland, K.
SCIDAC 2008: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2008, 125
[39] Lazy Checkpointing : Exploiting Temporal Locality in Failures to Mitigate Checkpointing Overheads on Extreme-Scale Systems
Tiwari, Devesh
Gupta, Saurabh
Vazhkudai, Sudharshan S.
2014 44TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2014, : 25 - 36
[40] Performance characterization of irregular I/O at the extreme scale
Herbein, S.
McDaniel, S.
Podhorszki, N.
Logan, J.
Klasky, S.
Taufer, M.
PARALLEL COMPUTING, 2016, 51 : 17 - 36

← 1 2 3 4 5 →