Runway: In-transit Data Compression on Heterogeneous HPC Systems

被引:1
|
作者
Ravi, John [1 ]
Byna, Suren [2 ]
Becchi, Michela [1 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27695 USA
[2] Ohio State Univ, Columbus, OH USA
基金
美国国家科学基金会;
关键词
Object Data Management; In-transit Computation; Heterogeneous Resources;
D O I
10.1109/CCGRID57682.2023.00030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To alleviate bottlenecks in storing and accessing data on high-performance computing (HPC) systems, I/O libraries are enabling computation while data is in-transit, such as HDF5 filters. For scientific applications that commonly use floatingpoint data, error-bounded lossy compression methods are a critical technique to significantly reduce the storage and bandwidth requirements. Thus far, deciding when and where to schedule in-transit data transformations, such as compression, has been outside the scope of I/O libraries. In this paper, we introduce Runway, a runtime framework that enables computation on in-transit data with an object storage abstraction. Runway is designed to be extensible to execute userdefined functions at runtime. In this effort, we focus on studying methods to offload data compression operations to available processing units based on latency and throughput. We compare the performance of running compression on multi-core CPUs, as well as offloading it to a GPU and a Data Processing Unit (DPU). We implement a state-of-the-art error-bounded lossy compression algorithm, SZ3, as a Runway function with a variant optimized for DPUs. We propose dynamic modeling to guide scheduling decisions for in-transit data compression. We evaluate Runway using four scientific datasets from the SDRBench benchmark suite on a the Perlmutter supercomputer at NERSC.
引用
收藏
页码:229 / 239
页数:11
相关论文
共 50 条
  • [1] Runway: In-transit Data Compression on Heterogeneous HPC Systems
    Ravi, John
    Byna, Suren
    Becchi, Michela
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW, 2023, : 340 - 342
  • [2] Accelerating In-transit Isosurface Generation With Topology Preserving Compression
    Li, Yanliang
    Chen, Jieyang
    2024 IEEE 20TH INTERNATIONAL CONFERENCE ON E-SCIENCE, E-SCIENCE 2024, 2024,
  • [3] Opportunistic Query Execution on SmartNICs for Analyzing In-Transit Data
    Liu, Jianshen
    Maltzahn, Carlos
    Ulmer, Craig
    2023 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE, HPEC, 2023,
  • [4] A Data Structure for Planning Based Workload Management of Heterogeneous HPC Systems
    Keller, Axel
    JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, JSSPP 2017, 2018, 10773 : 132 - 151
  • [5] Experiments with in-transit processing for data intensive Grid workflows
    Bhat, Virai
    Parashar, Manish
    Klasky, Scott
    2007 8TH IEEE/ACM INTERNATIONAL CONFERENCE ON GRID COMPUTING, 2007, : 130 - +
  • [6] Benefits of In-Transit Management Systems through Addition of Admixture
    Straka, Jason
    Klaus, Stephen P.
    Zhu, Junfeng
    Gentile, Pete A.
    Tregger, Nathan A.
    ACI MATERIALS JOURNAL, 2021, 118 (06) : 291 - 299
  • [7] Managing Heterogeneous Resources in HPC Systems
    Agosta, Giovanni
    Fornaciari, William
    Massari, Giuseppe
    Pupykina, Anna
    Reghenzani, Federico
    Zanella, Michele
    PARMA-DITAM 2018: 9TH WORKSHOP ON PARALLEL PROGRAMMING AND RUNTIME MANAGEMENT TECHNIQUES FOR MANY-CORE ARCHITECTURES AND 7TH WORKSHOP ON DESIGN TOOLS AND ARCHITECTURES FOR MULTICORE EMBEDDED COMPUTING PLATFORMS, 2018, : 7 - 12
  • [8] On the Privacy Enhancement of In-Transit Health Data Inspection: A Preliminary Study
    Sancho, Jorge
    Mikkelsen, Gert Laessoe
    Lindstrom, Jonas
    Garcia, Jose
    Alesanco, Alvaro
    XV MEDITERRANEAN CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING AND COMPUTING - MEDICON 2019, 2020, 76 : 855 - 860
  • [9] A Method for Combining Heterogeneous Workflows in HPC Systems
    Lyakhovets, D. S.
    Baranov, A. V.
    Konstantinov, P. A.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2024, 45 (10) : 5111 - 5125
  • [10] Towards RBF Interpolation on Heterogeneous HPC Systems
    Haase, Gundolf
    Martin, Dirk
    Offner, Guenter
    LARGE-SCALE SCIENTIFIC COMPUTING, LSSC 2015, 2015, 9374 : 182 - 190