cuZ-Checker: A GPU-Based Ultra-Fast Assessment System for Lossy Compressions

被引:2
|
作者
Yu, Xiaodong [1 ]
Di, Sheng [1 ]
Gok, Ali Murat [2 ]
Tao, Dingwen [3 ]
Cappello, Franck [1 ]
机构
[1] Argonne Natl Lab, Lemont, IL 60439 USA
[2] Cerebras Syst, Los Altos, CA USA
[3] Washington State Univ, Pullman, WA 99164 USA
基金
美国国家科学基金会;
关键词
lossy compression; GPU; performance optimization; quality evaluation; SSIM;
D O I
10.1109/Cluster48925.2021.00065
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Lossy compression is becoming an indispensable technique for the success of today's extreme-scale high-performance computing projects that produce vast volumes of data during scientific simulations or instrument data acquisitions. Comprehensively understanding the compression quality and performance of different lossy compressors is critical to selecting the best-fit compressors and using them properly and efficiently in practice. A few lossy compression assessment tools (e.g., Z-checker) have been developed, but none of them support the execution in a GPU environment. This is a significant gap because many recent extreme-scale applications and lossy compressors (e.g., cuSZ) can run entirely within GPUs. In this work, we develop an efficient lossy compression measuring system (called cuZ-Checker) on the GPU platform, which aims to perform the lossy compression quality and performance assessment completely within the GPU environment. Our contribution is threefold. (1) We develop a novel GPU-based lossy compression measuring framework using a computation pattern-based design approach. This approach classifies the computing-intensive metrics into three categories based on their patterns which creates large opportunities for kernel fusion and data reuse. (2) For each pattern in cuZ-Checker, we develop a CUDA kernel and provide fine-grained optimizations to boost its performance. (3) We thoroughly evaluate our cuZ-checker on a V100 GPU using four real-world scientific application datasets. Experiments show that cuZ-Checker can significantly accelerate the overall lossy compression assessment performance by 23X similar to 31X compared with the OpenMP-based multithreading CPU performance. To the best of our knowledge, this is the first lossy compression measuring system designed for GPU devices.
引用
收藏
页码:307 / 319
页数:13
相关论文
共 32 条
  • [1] GPU-based ultra-fast direct aperture optimization for online adaptive radiation therapy
    Men, Chunhua
    Jia, Xun
    Jiang, Steve B.
    PHYSICS IN MEDICINE AND BIOLOGY, 2010, 55 (15): : 4309 - 4319
  • [2] SOAP3: ultra-fast GPU-based parallel alignment tool for short reads
    Liu, Chi-Man
    Wong, Thomas
    Wu, Edward
    Luo, Ruibang
    Yiu, Siu-Ming
    Li, Yingrui
    Wang, Bingqiang
    Yu, Chang
    Chu, Xiaowen
    Zhao, Kaiyong
    Li, Ruiqiang
    Lam, Tak-Wah
    BIOINFORMATICS, 2012, 28 (06) : 878 - 879
  • [3] GPU-based ultra-fast dose calculation using a finite size pencil beam model
    Gu, Xuejun
    Choi, Dongju
    Men, Chunhua
    Pan, Hubert
    Majumdar, Amitava
    Jiang, Steve B.
    PHYSICS IN MEDICINE AND BIOLOGY, 2009, 54 (20): : 6287 - 6297
  • [4] CUDASW++4.0: ultra-fast GPU-based Smith-Waterman protein sequence database search
    Schmidt, Bertil
    Kallenborn, Felix
    Chacon, Alejandro
    Hundt, Christian
    BMC BIOINFORMATICS, 2024, 25 (01):
  • [5] GPU-Based Ultra Fast Direct Aperture Optimization in IMRT Treatment Planning
    Men, C.
    Romeijn, E.
    Jia, X.
    Gu, X.
    Jiang, S.
    MEDICAL PHYSICS, 2010, 37 (06) : 3414 - +
  • [6] Fast GPU-based time-of-flight MAP Reconstruction with a factored system matrix
    Lin, Yanguang
    Li, Quanzheng
    Leahy, Richard M.
    2010 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD (NSS/MIC), 2010, : 2889 - 2893
  • [7] CGgraph: An Ultra-fast Graph Processing System on Modern Commodity CPU-GPU Co-processor
    Cui, Pengjie
    Liu, Haotian
    Tang, Bo
    Yuan, Ye
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (06): : 1405 - 1417
  • [8] A GPU-based real time high performance computing service in a fast plant system controller prototype for ITER
    Nieto, J.
    de Arcas, G.
    Ruiz, M.
    Vega, J.
    Lopez, J. M.
    Barrera, E.
    Castro, R.
    Sanz, D.
    Utzel, N.
    Makijarvi, P.
    Zabeo, L.
    FUSION ENGINEERING AND DESIGN, 2012, 87 (12) : 2152 - 2155
  • [9] SnapLoc: An Ultra-Fast UWB-Based Indoor Localization System for an Unlimited Number of Tags
    Stocker, Michael
    Grosswindhager, Bernhard
    Boano, Carlo Alberto
    Roemer, Kay
    IPSN '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2019, : 348 - 349
  • [10] SnapLoc: An Ultra-Fast UWB-Based Indoor Localization System for an Unlimited Number of Tags
    Grosswindhager, Bernhard
    Stocker, Michael
    Rath, Michael
    Boano, Carlo Alberto
    Roemer, Kay
    IPSN '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2019, : 61 - 72