Runway: In-transit Data Compression on Heterogeneous HPC Systems

被引:1
|
作者
Ravi, John [1 ]
Byna, Suren [2 ]
Becchi, Michela [1 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27695 USA
[2] Ohio State Univ, Columbus, OH USA
来源
2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID | 2023年
基金
美国国家科学基金会;
关键词
Object Data Management; In-transit Computation; Heterogeneous Resources;
D O I
10.1109/CCGRID57682.2023.00030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To alleviate bottlenecks in storing and accessing data on high-performance computing (HPC) systems, I/O libraries are enabling computation while data is in-transit, such as HDF5 filters. For scientific applications that commonly use floatingpoint data, error-bounded lossy compression methods are a critical technique to significantly reduce the storage and bandwidth requirements. Thus far, deciding when and where to schedule in-transit data transformations, such as compression, has been outside the scope of I/O libraries. In this paper, we introduce Runway, a runtime framework that enables computation on in-transit data with an object storage abstraction. Runway is designed to be extensible to execute userdefined functions at runtime. In this effort, we focus on studying methods to offload data compression operations to available processing units based on latency and throughput. We compare the performance of running compression on multi-core CPUs, as well as offloading it to a GPU and a Data Processing Unit (DPU). We implement a state-of-the-art error-bounded lossy compression algorithm, SZ3, as a Runway function with a variant optimized for DPUs. We propose dynamic modeling to guide scheduling decisions for in-transit data compression. We evaluate Runway using four scientific datasets from the SDRBench benchmark suite on a the Perlmutter supercomputer at NERSC.
引用
收藏
页码:229 / 239
页数:11
相关论文
共 50 条
  • [41] Tiered data management system: Accelerating data processing on HPC systems
    Cheng, Peng
    Lu, Yutong
    Du, Yunfei
    Chen, Zhiguang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 101 : 894 - 908
  • [42] Efficient Data Access Strategies for Hadoop and Spark on HPC Cluster with Heterogeneous Storage
    Islam, Nusrat Sharmin
    Wasi-ur-Rahman, Md.
    Lu, Xiaoyi
    Panda, Dhabaleswar K. D. K.
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 223 - 232
  • [43] Using Multidimensional Solvers for Optimal Data Partitioning on Dedicated Heterogeneous HPC Platforms
    Rychkov, Vladimir
    Clarke, David
    Lastovetsky, Alexey
    PARALLEL COMPUTING TECHNOLOGIES, 2011, 6873 : 332 - +
  • [44] Implications of Satellitosis or In-transit Metastasis in Cutaneous Squamous Cell Carcinoma A Prognostic Omission in Cancer Staging Systems
    Smile, Timothy D.
    Ruiz, Emily S.
    Kus, Kylee J. B.
    Murad, Fadi
    Wei, Wei
    Xiong, David D.
    Vidimos, Allison T.
    Schmults, Chrysalyne D.
    Koyfman, Shlomo A.
    JAMA DERMATOLOGY, 2022, 158 (04) : 390 - 394
  • [45] Optimizing Industrial Applications for Heterogeneous HPC Systems: The OPTIMA Project Intermediate stage
    Theodoropoulos, Dimitris
    Michel, Olivier
    Malakonakis, Pavlos
    Georgopoulos, Konstantinos
    Isotton, Giovanni
    Pnevmatikatos, Dionisios
    Papaefstathiou, Ioannis
    Perna, Gino
    Zanotti, Marisa
    Miliadis, Panagiotis
    Mpakos, Panagiotis
    Alverti, Chloe
    Ioannou, Aggelos
    Engelen, Max
    Kahira, Albert Njoroge
    Mavroidis, Iakovos
    2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
  • [46] Containerizing HPC Applications on Heterogeneous Systems for Centralized Resource Management: A Case Study
    Kien Trung Pham
    Gia-Bao Vo-Cong
    Minh Thanh Chung
    Nam Thoai
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND APPLICATIONS (ACOMP), 2018, : 104 - 111
  • [47] Early Results of Mapping Industrial Applications on Heterogeneous HPC Systems - The OPTIMA Project
    Theodoropoulos, Dimitris
    Pekridis, Giorgos
    Miliadis, Panagiotis
    Alverti, Chloe
    Mpakos, Panagiotis
    Pnevmatikatos, Dionisios
    Malakonakis, Pavlos
    Georgopoulos, Konstantinos
    Mavroidis, Iakovos
    Perna, Gino
    Zanotti, Marisa
    Isotton, Giovanni
    Engelen, Max
    Ioannou, Aggelos
    Papaefstathiou, Ioannis
    Kahira, Albert Njoroge
    Herten, Andreas
    PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2023, CF 2023, 2023, : 304 - 308
  • [48] Exploiting Hierarchical Parallelism and Reusability in Tensor Kernel Processing on Heterogeneous HPC Systems
    Chen, Yuedan
    Xiao, Guoqing
    Ozsu, M. Tamer
    Tang, Zhuo
    Zomaya, Albert Y.
    Li, Kenli
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2522 - 2535
  • [49] The HPC-DAG Task Model for Heterogeneous Real-Time Systems
    Houssam-Eddine, Zahaf
    Capodieci, Nicola
    Cavicchioli, Roberto
    Lipari, Giuseppe
    Bertogna, Marko
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (10) : 1747 - 1761
  • [50] Topology-Aware Mapping Techniques for Heterogeneous HPC Systems: A Systematic Survey
    Alotaibi, Saad B.
    Alboraei, Fathy
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (10) : 371 - 376