Mimir: Extending I/O Interfaces to Express User Intent for Complex Workloads in HPC

被引:0
|
作者
Devarajan, Hariharan [1 ]
Mohror, Kathryn [1 ]
机构
[1] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
关键词
Mimir; user intent; I/O extension; multi-stage intent; Intent Infrastructure;
D O I
10.1109/IPDPS54959.2023.00027
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The complexity of data management in HPC systems stems from the diversity in I/O behavior exhibited by new workloads, multistage workflows, and the presence of multitiered storage systems. This complexity is managed by the storage systems, which provide user-level configurations to allow the tuning of workload I/O within the system. However, these configurations are difficult to set by users who lack expertise in I/O subsystems. We propose a paradigm change in which users specify the intent of I/O operations and storage systems automatically set various configurations based on the supplied intent. To this end, we developed the Mimir infrastructure to assist users in passing I/O intent to the underlying storage system. We demonstrate several use cases that map user-defined intents to storage configurations that lead to optimized I/O. In this study, we make three observations. First, I/O intents should be applied to each level of the I/O storage stack, from HDF5 to MPI-IO to POSIX, and integrated using lightweight adaptors in the existing stack. Second, the Mimir infrastructure supports up to 400M Ops/sec throughput of intents in the system, with a low memory overhead of 6.85KB per node. Third, intents assist in configuring a hierarchical cache to preload I/O, buffer in a node-local device, and store data in a global cache to optimize I/O workloads by 2.33x, 4x, and 2.1x, respectively. Our Mimir infrastructure optimizes complex large-scale workflows by up to 4x better I/O performance on the Lassen supercomputer by using automatically derived I/O intents.
引用
收藏
页码:178 / 188
页数:11
相关论文
共 11 条
  • [1] Extracting and characterizing I/O behavior of HPC workloads
    Devarajan, Hariharan
    Mohror, Kathryn
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2022), 2022, : 243 - 255
  • [2] An I/O Analysis of HPC Workloads on CephFS and Lustre
    Chiusole, Alberto
    Cozzini, Stefano
    van der Ster, Daniel
    Lamanna, Massimo
    Giuliani, Graziano
    [J]. HIGH PERFORMANCE COMPUTING: ISC HIGH PERFORMANCE 2019 INTERNATIONAL WORKSHOPS, 2020, 11887 : 300 - 316
  • [3] Replicating HPC I/O Workloads With Proxy Applications
    Dickson, James
    Wright, Steven
    Maheswaran, Satheesh
    Herdman, Andy
    Miller, Mark C.
    Jarvis, Stephen
    [J]. PROCEEDINGS OF PDSW-DISCS 2016 - 1ST JOINT INTERNATIONAL WORKSHOP ON PARALLEL DATA STORAGE AND DATA INTENSIVE SCALABLE COMPUTING SYSTEMS, 2016, : 13 - 18
  • [4] Detecting I/O Access Patterns of HPC Workloads at Runtime
    Bez, Jean Luca
    Boito, Francieli Zanon
    Nou, Ramon
    Miranda, Alberto
    Cortes, Toni
    Navaux, Philippe O. A.
    [J]. 2019 31ST INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2019), 2019, : 80 - 87
  • [5] Parallel I/O Evaluation Techniques and Emerging HPC Workloads: A Perspective
    Neuwirth, Sarah
    Paul, Arnab K.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2021), 2021, : 671 - 679
  • [6] Characterizing I/O Workloads of HPC Applications Through Online Analysis
    Dong, Wenrui
    Liu, Guangming
    Yu, Jie
    Zuo, You
    [J]. 2015 IEEE 34TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2015,
  • [7] Characterizing Machine Learning I/O Workloads on Leadership Scale HPC Systems
    Paul, Arnab K.
    Karimi, Ahmad Maroof
    Wang, Feiyi
    [J]. 29TH INTERNATIONAL SYMPOSIUM ON THE MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2021), 2021, : 198 - 205
  • [8] Does Varying BeeGFS Configuration Affect the I/O Performance of HPC Workloads?
    Borkar, Arnav
    Tony, Joel
    Vamsi, Hari K. N.
    Barman, Tushar
    Bhisikar, Yash
    Sreenath, T. M.
    Paul, Arnab K.
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING WORKSHOPS, CLUSTER WORKSHOPS, 2023, : 5 - 7
  • [9] Arbitration Policies for On-Demand User-Level I/O Forwarding on HPC Platforms
    Bez, Jean Luca
    Miranda, Alberto
    Nou, Ramon
    Boito, Francieli Zanon
    Cortes, Toni
    Navaux, Philippe
    [J]. 2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 577 - 586
  • [10] uMMAP-IO: User-level Memory-mapped I/O for HPC
    Rivas-Gomez, Sergio
    Fanfarillo, Alessandro
    Valat, Sebastien
    Laferriere, Christophe
    Couvee, Philippe
    Narasimhamurthy, Sai
    Markidis, Stefano
    [J]. 2019 IEEE 26TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC), 2019, : 363 - 372