cuZ-Checker: A GPU-Based Ultra-Fast Assessment System for Lossy Compressions

被引:2
|
作者
Yu, Xiaodong [1 ]
Di, Sheng [1 ]
Gok, Ali Murat [2 ]
Tao, Dingwen [3 ]
Cappello, Franck [1 ]
机构
[1] Argonne Natl Lab, Lemont, IL 60439 USA
[2] Cerebras Syst, Los Altos, CA USA
[3] Washington State Univ, Pullman, WA 99164 USA
基金
美国国家科学基金会;
关键词
lossy compression; GPU; performance optimization; quality evaluation; SSIM;
D O I
10.1109/Cluster48925.2021.00065
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Lossy compression is becoming an indispensable technique for the success of today's extreme-scale high-performance computing projects that produce vast volumes of data during scientific simulations or instrument data acquisitions. Comprehensively understanding the compression quality and performance of different lossy compressors is critical to selecting the best-fit compressors and using them properly and efficiently in practice. A few lossy compression assessment tools (e.g., Z-checker) have been developed, but none of them support the execution in a GPU environment. This is a significant gap because many recent extreme-scale applications and lossy compressors (e.g., cuSZ) can run entirely within GPUs. In this work, we develop an efficient lossy compression measuring system (called cuZ-Checker) on the GPU platform, which aims to perform the lossy compression quality and performance assessment completely within the GPU environment. Our contribution is threefold. (1) We develop a novel GPU-based lossy compression measuring framework using a computation pattern-based design approach. This approach classifies the computing-intensive metrics into three categories based on their patterns which creates large opportunities for kernel fusion and data reuse. (2) For each pattern in cuZ-Checker, we develop a CUDA kernel and provide fine-grained optimizations to boost its performance. (3) We thoroughly evaluate our cuZ-checker on a V100 GPU using four real-world scientific application datasets. Experiments show that cuZ-Checker can significantly accelerate the overall lossy compression assessment performance by 23X similar to 31X compared with the OpenMP-based multithreading CPU performance. To the best of our knowledge, this is the first lossy compression measuring system designed for GPU devices.
引用
收藏
页码:307 / 319
页数:13
相关论文
共 32 条
  • [31] The Use of Ultra-Fast Gas Chromatography for Fingerprinting-Based Classification of Zweigelt and Rondo Wines with Regard to Grape Variety and Type of Malolactic Fermentation Combined with Greenness and Practicality Assessment
    Stoj, Anna
    Wojnowski, Wojciech
    Plotka-Wasylka, Justyna
    Czernecki, Tomasz
    Kapusta, Ireneusz Tomasz
    MOLECULES, 2024, 29 (19):
  • [32] Ultra-fast preparation of multifunctional conductive hydrogels with high mechanical strength, self-healing and self-adhesive properties based on Tara Tannin-Fe3+ dynamic redox system for strain sensors applications
    Liu, Jiachang
    Bao, Song
    Ling, Qiangjun
    Fan, Xin
    Gu, Haibin
    POLYMER, 2022, 240