Making the case for reforming the I/O software stack of extreme-scale systems

被引:5
|
作者
Isaila, Florin [1 ]
Garcia, Javier [2 ]
Carretero, Jesus [2 ]
Ross, Rob [1 ]
Kimpe, Dries [1 ]
机构
[1] Argonne Natl Lab, 9700 S Cass Ave, Argonne, IL 60439 USA
[2] Univ Carlos III, Getafe, Spain
关键词
Storage; I/O software stack; Data locality; Energy efficiency; Resilience;
D O I
10.1016/j.advengsoft.2016.07.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ever-increasing data needs of scientific and engineering applications require novel approaches to managing and exploring huge amounts of information in order to advance scientific discovery. In order to achieve this goal, one of the main priorities of the international scientific community is addressing the challenges of performing scientific computing on exascale machines within the next decade. Exascale platforms likely will be characterized by a three to four orders of magnitude increase in concurrency, a substantially larger storage capacity, and a deepening of the storage hierarchy. The current development model of independently applying optimizations at each layer of the system I/O software stack will not scale to the new levels of concurrency, storage hierarchy, and capacity. In this article we discuss the current development model for the I/O software stack of high-performance computing platforms. We identify the challenges of improving scalability, performance, energy efficiency, and resilience of the I/O software stack, while accessing a deepening hierarchy of volatile and nonvolatile storage. We advocate for radical new approaches to reforming the I/O software stack in order to advance toward exascale. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:26 / 31
页数:6
相关论文
共 50 条
  • [1] Cross-layer coordination in the I/O software stack of extreme-scale systems
    Yu, Jie
    Liu, Guangming
    Li, Xiaoyong
    Dong, Wenrui
    Li, Qiong
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (10):
  • [2] Providing a Flexible and Comprehensive Software Stack Via Spack, an Extreme-Scale Scientific Software Stack, and Software Development Kits
    Willenbring, James M.
    Shende, Sameer S.
    Gamblin, Todd
    COMPUTING IN SCIENCE & ENGINEERING, 2024, 26 (01) : 20 - 30
  • [3] Memory-Conscious Collective I/O for Extreme-Scale HPC Systems
    Lu, Yin
    Chen, Yong
    Thakur, Rajeev
    Zhuang, Yu
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1360 - 1360
  • [4] Memory-Conscious Collective I/O for Extreme-scale HPC Systems
    Lu, Yin
    Chen, Yong
    Thakur, Rajeev
    Zhuang, Yu
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1361 - +
  • [5] An Empirical Roofline Model for Extreme-Scale I/O Workload Analysis
    Zhu, Zhaobin
    Bartelheimer, Niklas
    Neuwirth, Sarah
    2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW, 2023, : 622 - 627
  • [6] Optimizing I/O forwarding techniques for extreme-scale event tracing
    Thomas Ilsche
    Joseph Schuchart
    Jason Cope
    Dries Kimpe
    Terry Jones
    Andreas Knüpfer
    Kamil Iskra
    Robert Ross
    Wolfgang E. Nagel
    Stephen Poole
    Cluster Computing, 2014, 17 : 1 - 18
  • [7] Optimizing I/O forwarding techniques for extreme-scale event tracing
    Ilsche, Thomas
    Schuchart, Joseph
    Cope, Jason
    Kimpe, Dries
    Jones, Terry
    Knuepfer, Andreas
    Iskra, Kamil
    Ross, Robert
    Nagel, Wolfgang E.
    Poole, Stephen
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2014, 17 (01): : 1 - 18
  • [8] mOS: An Architecture for Extreme-Scale Operating Systems
    Wisniewski, Robert W.
    Inglett, Todd
    Keppel, Pardo
    Murty, Ravi
    Riesen, Rolf
    PROCEEDINGS OF THE 4TH INTERNATIONAL WORKSHOP ON RUNTIME AND OPERATING SYSTEMS FOR SUPERCOMPUTERS, ROSS 2014, 2014,
  • [9] Design and Implementation of Broadcast Algorithms for Extreme-Scale Systems
    Shamis, Pavel
    Graham, Richard
    Venkata, Manjunath Gorentla
    Ladd, Joshua
    2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 74 - 83
  • [10] A characterization of workflow management systems for extreme-scale applications
    da Silva, Rafael Ferreira
    Filgueira, Rosa
    Pietri, Ilia
    Jiang, Ming
    Sakellariou, Rizos
    Deelman, Ewa
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 75 : 228 - 238