Extracting and characterizing I/O behavior of HPC workloads

被引:0
|
作者
Devarajan, Hariharan [1 ]
Mohror, Kathryn [1 ]
机构
[1] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
关键词
I/O characterization; workflow I/O; Montage; Simulation; LBANN; Workload-aware; Pegasus; DATA-COMPRESSION; PEGASUS;
D O I
10.1109/CLUSTER51413.2022.00037
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
System administrators set default storage-system configuration parameters with the goal of providing high performance for their system's I/O workloads. However, this generalized configuration can lead to suboptimal I/O performance for individual workloads. Users can provide parameter settings to the storage system to obtain better performance for individual applications, but it can be very challenging to determine which parameters to set and to what values. This problem is further exacerbated by the increased complexity of modern storage systems. In this work, we move towards solving this problem by providing a systematic categorization of workload-related information that users or middleware libraries can pass to the storage system to optimize I/O performance for specific workloads. We study applications and workflows from different scientific domains to cover a broad range of HPC use cases. Through our categorization, we find that a) workload features differ based on the hardware, software, and data components involved in the execution of workloads and b) multiple workload features together drive I/O optimizations. The methodology proposed in this work optimizes complex scientific workloads by 2.2x-8x, using workload-aware I/O optimizations. Using the proposed methodology, users can pragmatically characterize their workload, and this characterization can assist the storage system in configuring itself to optimize I/O performance for individual workloads in HPC systems.
引用
收藏
页码:243 / 255
页数:13
相关论文
共 50 条
  • [1] Characterizing I/O Workloads of HPC Applications Through Online Analysis
    Dong, Wenrui
    Liu, Guangming
    Yu, Jie
    Zuo, You
    [J]. 2015 IEEE 34TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2015,
  • [2] Characterizing Machine Learning I/O Workloads on Leadership Scale HPC Systems
    Paul, Arnab K.
    Karimi, Ahmad Maroof
    Wang, Feiyi
    [J]. 29TH INTERNATIONAL SYMPOSIUM ON THE MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2021), 2021, : 198 - 205
  • [3] I/O Behavior Characterizing and Predicting of Virtualization Workloads
    Hu, Yanyan
    Long, Xiang
    Zhang, Jiong
    [J]. JOURNAL OF COMPUTERS, 2012, 7 (07) : 1712 - 1725
  • [4] An I/O Analysis of HPC Workloads on CephFS and Lustre
    Chiusole, Alberto
    Cozzini, Stefano
    van der Ster, Daniel
    Lamanna, Massimo
    Giuliani, Graziano
    [J]. HIGH PERFORMANCE COMPUTING: ISC HIGH PERFORMANCE 2019 INTERNATIONAL WORKSHOPS, 2020, 11887 : 300 - 316
  • [5] Replicating HPC I/O Workloads With Proxy Applications
    Dickson, James
    Wright, Steven
    Maheswaran, Satheesh
    Herdman, Andy
    Miller, Mark C.
    Jarvis, Stephen
    [J]. PROCEEDINGS OF PDSW-DISCS 2016 - 1ST JOINT INTERNATIONAL WORKSHOP ON PARALLEL DATA STORAGE AND DATA INTENSIVE SCALABLE COMPUTING SYSTEMS, 2016, : 13 - 18
  • [6] Detecting I/O Access Patterns of HPC Workloads at Runtime
    Bez, Jean Luca
    Boito, Francieli Zanon
    Nou, Ramon
    Miranda, Alberto
    Cortes, Toni
    Navaux, Philippe O. A.
    [J]. 2019 31ST INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2019), 2019, : 80 - 87
  • [7] Parallel I/O Evaluation Techniques and Emerging HPC Workloads: A Perspective
    Neuwirth, Sarah
    Paul, Arnab K.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2021), 2021, : 671 - 679
  • [8] Characterizing Deep-Learning I/O Workloads in TensorFlow
    Chien, Steven W. D.
    Markidis, Stefano
    Sishtla, Chaitanya Prasad
    Santos, Luis
    Herman, Pawel
    Narasimhamurthy, Sai
    Laure, Erwin
    [J]. PROCEEDINGS OF 2018 IEEE/ACM 3RD JOINT INTERNATIONAL WORKSHOP ON PARALLEL DATA STORAGE & DATA INTENSIVE SCALABLE COMPUTING SYSTEMS (PDSW-DISCS), 2018, : 54 - 63
  • [9] Does Varying BeeGFS Configuration Affect the I/O Performance of HPC Workloads?
    Borkar, Arnav
    Tony, Joel
    Vamsi, Hari K. N.
    Barman, Tushar
    Bhisikar, Yash
    Sreenath, T. M.
    Paul, Arnab K.
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING WORKSHOPS, CLUSTER WORKSHOPS, 2023, : 5 - 7
  • [10] Mimir: Extending I/O Interfaces to Express User Intent for Complex Workloads in HPC
    Devarajan, Hariharan
    Mohror, Kathryn
    [J]. 2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS, 2023, : 178 - 188