Scalable I/O aggregation for asynchronous multi-level checkpointing

被引:0
|
作者
Gossman M.J. [1 ]
Nicolae B. [2 ]
Calhoun J.C. [1 ]
机构
[1] Holcombe Department of Electrical and Computer Engineering, Clemson University, Clemson, 29631, SC
[2] Mathematical and Computer Science Division, Argonne National Laboratory, Lemont, 22222, IL
基金
美国国家科学基金会;
关键词
Asynchronous I/O; Checkpoint-restart; Distributed I/O aggregation;
D O I
10.1016/j.future.2024.06.003
中图分类号
学科分类号
摘要
Checkpointing distributed HPC applications is a common I/O pattern with many use cases: resilience, job management, reproducibility, revisiting previous intermediate results, etc. This is a difficult pattern for a large number of processes that need to capture massive data sizes and write them persistently to shared storage (e.g., parallel file system), which is subject to I/O bottlenecks due to limited I/O bandwidth under concurrency. In addition to I/O performance and scalability considerations, there are often limits that users impose on the number of files or objects that can be used to capture the checkpoints. For example, users need to move checkpoints between HPC systems or parallel file systems, which is inefficient for a large number of files, or need to use the checkpoints in workflows that expect related objects to be grouped together. As a consequence, I/O aggregation is often used to reduce the number of files and objects persistent to shared storage such that it is much lower than the number of processes. However, I/O aggregation is challenging for two reasons: (1) if more than one process is writing checkpointing data to the same file, this causes additional I/O contention that amplifies the I/O bottlenecks; (2) scalable state-of-art checkpointing techniques are asynchronous and rely on multi-level techniques to capture the data structures to local storage or memory, then flush it from there to shared storage in the background, which competes for resources (I/O, memory, network bandwidth) with the application that is running in the foreground. State of art approaches have addressed the problem of I/O aggregation for synchronous checkpointing but are insufficient for asynchronous checkpointing. To fill this gap, we contribute with a novel I/O aggregation strategy that operates efficiently in the background to complement asynchronous C/R. Specifically, we explore how to (1) develop a network of efficient, thread-safe I/O proxies that persist data via limited-sized write buffers, (2) prioritize remote (from non-proxy processes) and local data on I/O proxies to minimize write overhead, and (3) load-balance flushing on I/O proxies. We analyze trade-offs of developing such strategies and discuss the performance impact on large-scale micro-benchmarks, as well as a real HPC application (HACC). © 2024 Elsevier B.V.
引用
收藏
页码:420 / 432
页数:12
相关论文
共 50 条
  • [41] Optimizing Multi-Level Checkpointing for Distributed Deep Learning Workloads on Cloud Spot VM Clusters
    Cho, Yonghyeon
    Kim, Yoochan
    Kim, Kihyun
    Kim, Jinwoo
    Kim, Hong-Yeon
    Kim, Youngjae
    IEEE ACCESS, 2024, 12 : 116891 - 116904
  • [42] A multi-scale and multi-level feature aggregation network for crowd counting
    Zhu, Fushun
    Yan, Hua
    Chen, Xinyue
    Li, Tong
    Zhang, Zhengyu
    NEUROCOMPUTING, 2021, 423 : 46 - 56
  • [43] An Asynchronous Parallelized and Scalable Image Resampling Algorithm with Parallel I/O
    Ma, Yan
    Zhao, Lingjun
    Liu, Dingsheng
    COMPUTATIONAL SCIENCE - ICCS 2009, 2009, 5545 : 357 - 366
  • [44] Multi-level magnification correlation hashing for scalable histopathological image retrieval
    Gu, Yun
    Yang, Jie
    NEUROCOMPUTING, 2019, 351 : 134 - 145
  • [45] xQTL workbench: a scalable web environment for multi-level QTL analysis
    Arends, Danny
    van der Velde, K. Joeri
    Prins, Pjotr
    Broman, Karl W.
    Moller, Steffen
    Jansen, Ritsert C.
    Swertz, Morris A.
    BIOINFORMATICS, 2012, 28 (07) : 1042 - 1044
  • [46] Using DLSim 3: A Scalable, Extensible, Multi-level Logic Simulator
    Salter, Richard M.
    Donaldson, John L.
    ITICSE '08: PROCEEDINGS OF THE 13TH ANNUAL CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, 2008, : 315 - 315
  • [47] A scalable multi-level feature extraction technique to detect malicious executables
    Mohammad M. Masud
    Latifur Khan
    Bhavani Thuraisingham
    Information Systems Frontiers, 2008, 10 : 33 - 45
  • [48] A scalable multi-level feature extraction technique to detect malicious executables
    Masud, Mohammad M.
    Khan, Latifur
    Thuraisingham, Bhavani
    INFORMATION SYSTEMS FRONTIERS, 2008, 10 (01) : 33 - 45
  • [49] Multi-factors-regulated multi-level down-scalable and robust memristors
    Mao, Shuangsuo
    Cao, Zelin
    Zheng, Xingrong
    Zhou, Guangdong
    Qin, Jiajia
    Yang, Yusheng
    Liu, Mingnan
    Rao, Zhaowei
    Zhao, Yong
    Sun, Bai
    MATERIALS TODAY CHEMISTRY, 2024, 35
  • [50] Statistical multi-level shape models for scalable modeling of multi-organ anatomies
    Khan, Nawazish
    Peterson, Andrew C.
    Aubert, Benjamin
    Morris, Alan
    Atkins, Penny R.
    Lenz, Amy L.
    Anderson, Andrew E.
    Elhabian, Shireen Y.
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2023, 11