Massively Parallel Huffman Decoding on GPUs

被引:21
|
作者
Weissenberger, Andre [1 ]
Schmidt, Bertil [2 ]
机构
[1] Goethe Univ Frankfurt, D-60323 Frankfurt, Germany
[2] Johannes Gutenberg Univ Mainz, Inst Comp Sci, D-55128 Mainz, Germany
关键词
Data compression; GPUs; CUDA; Huffman Decoding;
D O I
10.1145/3225058.3225076
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data compression is a fundamental building block in a wide range of applications. Besides its intended purpose to save valuable storage on hard disks, compression can be utilized to increase the effective bandwidth to attached storage as realized by state-of-the-art file systems. In the foreseeing future, on-the-fly compression and decompression will gain utmost importance for the processing of data-intensive applications such as streamed Deep Learning tasks or Next Generation Sequencing pipelines, which establishes the need for fast parallel implementations. Huffman coding is an integral part of a number of compression methods. However, efficient parallel implementation of Huffman decompression is difficult due to inherent data dependencies (i.e. the location of a decoded symbol depends on its predecessors). In this paper, we present the first massively parallel decoder implementation that is compatible with Huffman's original method by taking advantage of the self-synchronization property of Huffman codes. Our performance evaluation on three different CUDA-enabled GPUs (TITAN V TITAN XP, GTX 1080) demonstrates speedups of over one order-of-magnitude compared to the state-of-art CPU-based Zstandard Huffman decoder. Our implementation is available at https://github.com/weissenberger/gpuhd.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] FAST DECODING OF THE HUFFMAN CODES
    SIEMINSKI, A
    INFORMATION PROCESSING LETTERS, 1988, 26 (05) : 237 - 241
  • [32] Parallel LDPC Decoding on GPUs Using a Stream-Based Computing Approach
    Gabriel Falcão
    Shinichi Yamagiwa
    Vitor Silva
    Leonel Sousa
    Journal of Computer Science and Technology, 2009, 24 : 913 - 924
  • [33] Parallel LDPC Decoding on GPUs Using a Stream-Based Computing Approach
    Falcao, Gabriel
    Yamagiwa, Shinichi
    Silva, Vitor
    Sousa, Leonel
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2009, 24 (05) : 913 - 924
  • [34] Parallel LDPC Decoding on GPUs Using a Stream-Based Computing Approach
    Gabriel Falco
    Shinichi Yamagiwa
    Vitor Silva
    Leonel Sousa
    JournalofComputerScience&Technology, 2009, 24 (05) : 913 - 924
  • [35] A fast algorithm for Huffman decoding based on a recursion Huffman tree
    Lin, Yih-Kai
    Huang, Shu-Chien
    Yang, Cheng-Hsing
    JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (04) : 974 - 980
  • [36] Efficient parallelization of SPH algorithm on modern multi-core CPUs and massively parallel GPUs
    Jagtap, Pravin
    Nasre, Rupesh
    Sanapala, V. S.
    Patnaik, B. S., V
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2021, 12 (06)
  • [37] Massively parallel neural circuits for stereoscopic color vision: Encoding, decoding and identification
    Lazar, Aurel A.
    Slutskiy, Yevgeniy B.
    Zhou, Yiyin
    NEURAL NETWORKS, 2015, 63 : 254 - 271
  • [38] Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model
    Robin P Smith
    Leila Taher
    Rupali P Patwardhan
    Mee J Kim
    Fumitaka Inoue
    Jay Shendure
    Ivan Ovcharenko
    Nadav Ahituv
    Nature Genetics, 2013, 45 : 1021 - 1028
  • [39] Massively Parallel Inverse Block-sorting Transforms for bzip2 Decompression on GPUs
    Weissenberger, Andre
    Schmidt, Bertil
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 856 - 865
  • [40] Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model
    Smith, Robin P.
    Taher, Leila
    Patwardhan, Rupali P.
    Kim, Mee J.
    Inoue, Fumitaka
    Shendure, Jay
    Ovcharenko, Ivan
    Ahituv, Nadav
    NATURE GENETICS, 2013, 45 (09) : 1021 - +