Runway: In-transit Data Compression on Heterogeneous HPC Systems

被引:1
|
作者
Ravi, John [1 ]
Byna, Suren [2 ]
Becchi, Michela [1 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27695 USA
[2] Ohio State Univ, Columbus, OH USA
基金
美国国家科学基金会;
关键词
Object Data Management; In-transit Computation; Heterogeneous Resources;
D O I
10.1109/CCGRID57682.2023.00030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To alleviate bottlenecks in storing and accessing data on high-performance computing (HPC) systems, I/O libraries are enabling computation while data is in-transit, such as HDF5 filters. For scientific applications that commonly use floatingpoint data, error-bounded lossy compression methods are a critical technique to significantly reduce the storage and bandwidth requirements. Thus far, deciding when and where to schedule in-transit data transformations, such as compression, has been outside the scope of I/O libraries. In this paper, we introduce Runway, a runtime framework that enables computation on in-transit data with an object storage abstraction. Runway is designed to be extensible to execute userdefined functions at runtime. In this effort, we focus on studying methods to offload data compression operations to available processing units based on latency and throughput. We compare the performance of running compression on multi-core CPUs, as well as offloading it to a GPU and a Data Processing Unit (DPU). We implement a state-of-the-art error-bounded lossy compression algorithm, SZ3, as a Runway function with a variant optimized for DPUs. We propose dynamic modeling to guide scheduling decisions for in-transit data compression. We evaluate Runway using four scientific datasets from the SDRBench benchmark suite on a the Perlmutter supercomputer at NERSC.
引用
收藏
页码:229 / 239
页数:11
相关论文
共 50 条
  • [21] Robust production of sustainable concrete through the use of admixtures and in-transit concrete management systems
    Berodier, Elise
    Gibson, Larry R., II
    Burns, Elizabeth
    Roberts, Lawrence
    Cheung, Josephine
    CEMENT & CONCRETE COMPOSITES, 2019, 101 : 52 - 66
  • [22] Minimizing Thermal Variation in Heterogeneous HPC Systems with FPGA Nodes
    Luo, Yingyi
    Wang, Xiaoyang
    Ogrenci-Memik, Seda
    Memik, Gokhan
    Yoshii, Kazutomo
    Beckman, Pete
    2018 IEEE 36TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2018, : 537 - 544
  • [23] Energy Efficient Frequency Scaling on GPUs in Heterogeneous HPC Systems
    Kraljic, Karlo
    Kerger, Daniel
    Schulz, Martin
    ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2022, 2022, 13642 : 3 - 16
  • [24] The Convergence of HPC and Embedded Systems in our Heterogeneous Computing Future
    Kaeli, David
    Akodes, David
    2011 IEEE 29TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2011, : 9 - 11
  • [25] A Proposed Data Partitioning Approach on Heterogeneous HPC Platforms: Data Locality Perspective
    Al-Hashimi, Hind Taha
    Basuhail, Abdullah Ahmad
    IEEE ACCESS, 2021, 9 : 81432 - 81442
  • [26] In-transit interventions using real-time data in Australian berry supply chains
    Rendon-Benavides, Ruben
    Perez-Franco, Roberto
    Elphick-Darling, Rose
    Pla-Aragones, Lluis M.
    Gonzalez Aleu, Fernando
    Verduzco-Garza, Teresa
    Rodriguez-Parral, Ana, V
    TQM JOURNAL, 2023, 35 (03): : 759 - 777
  • [27] Supporting dynamic allocation of heterogeneous storage resources on HPC systems
    Monniot, Julien
    Tessier, Francois
    Robert, Matthieu
    Antoniu, Gabriel
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (28):
  • [28] A Study on Data Deduplication in HPC Storage Systems
    Meister, Dirk
    Kaiser, Juergen
    Brinkmann, Andre
    Cortes, Toni
    Kuhn, Michael
    Kunkel, Julian
    2012 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2012,
  • [29] Trends in Data Locality Abstractions for HPC Systems
    Unat, Didem
    Dubey, Anshu
    Hoefler, Torsten
    Shalf, John
    Abraham, Mark
    Bianco, Mauro
    Chamberlain, Bradford L.
    Cledat, Romain
    Edwards, H. Carter
    Finkel, Hal
    Fuerlinger, Karl
    Hannig, Frank
    Jeannot, Emmanuel
    Kamil, Amir
    Keasler, Jeff
    Kelly, Paul H. J.
    Leung, Vitus
    Ltaief, Hatem
    Maruyama, Naoya
    Newburn, Chris J.
    Pericas, Miquel
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (10) : 3007 - 3020
  • [30] Benchmarking SciDB Data Import on HPC Systems
    Samsi, Siddharth
    Brattain, Laura
    Arcand, William
    Bestor, David
    Bergeron, Bill
    Byun, Chansup
    Gadepally, Vijay
    Hubbell, Matthew
    Jones, Michael
    Klein, Anna
    Michaleas, Peter
    Milechin, Lauren
    Mullen, Julie
    Prout, Andrew
    Rosa, Antonio
    Yee, Charles
    Kepner, Jeremy
    Reuther, Albert
    2016 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2016,