Massively Parallel Huffman Decoding on GPUs

被引：21

作者：

Weissenberger, Andre ^{[1
]}

Schmidt, Bertil ^{[2
]}

机构：

[1] Goethe Univ Frankfurt, D-60323 Frankfurt, Germany

[2] Johannes Gutenberg Univ Mainz, Inst Comp Sci, D-55128 Mainz, Germany

来源：

PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING | 2018年

关键词：

Data compression; GPUs; CUDA; Huffman Decoding;

D O I：

10.1145/3225058.3225076

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Data compression is a fundamental building block in a wide range of applications. Besides its intended purpose to save valuable storage on hard disks, compression can be utilized to increase the effective bandwidth to attached storage as realized by state-of-the-art file systems. In the foreseeing future, on-the-fly compression and decompression will gain utmost importance for the processing of data-intensive applications such as streamed Deep Learning tasks or Next Generation Sequencing pipelines, which establishes the need for fast parallel implementations. Huffman coding is an integral part of a number of compression methods. However, efficient parallel implementation of Huffman decompression is difficult due to inherent data dependencies (i.e. the location of a decoded symbol depends on its predecessors). In this paper, we present the first massively parallel decoder implementation that is compatible with Huffman's original method by taking advantage of the self-synchronization property of Huffman codes. Our performance evaluation on three different CUDA-enabled GPUs (TITAN V TITAN XP, GTX 1080) demonstrates speedups of over one order-of-magnitude compared to the state-of-art CPU-based Zstandard Huffman decoder. Our implementation is available at https://github.com/weissenberger/gpuhd.

引用

页数：10

共 50 条

[31] FAST DECODING OF THE HUFFMAN CODES
SIEMINSKI, A
INFORMATION PROCESSING LETTERS, 1988, 26 (05) : 237 - 241
[32] Parallel LDPC Decoding on GPUs Using a Stream-Based Computing Approach
Gabriel Falcão
Shinichi Yamagiwa
Vitor Silva
Leonel Sousa
Journal of Computer Science and Technology, 2009, 24 : 913 - 924
[33] Parallel LDPC Decoding on GPUs Using a Stream-Based Computing Approach
Falcao, Gabriel
Yamagiwa, Shinichi
Silva, Vitor
Sousa, Leonel
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2009, 24 (05) : 913 - 924
[34] Parallel LDPC Decoding on GPUs Using a Stream-Based Computing Approach
Gabriel Falco
Shinichi Yamagiwa
Vitor Silva
Leonel Sousa
JournalofComputerScience&Technology, 2009, 24 (05) : 913 - 924
[35] A fast algorithm for Huffman decoding based on a recursion Huffman tree
Lin, Yih-Kai
Huang, Shu-Chien
Yang, Cheng-Hsing
JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (04) : 974 - 980
[36] Efficient parallelization of SPH algorithm on modern multi-core CPUs and massively parallel GPUs
Jagtap, Pravin
Nasre, Rupesh
Sanapala, V. S.
Patnaik, B. S., V
INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2021, 12 (06)
[37] Massively parallel neural circuits for stereoscopic color vision: Encoding, decoding and identification
Lazar, Aurel A.
Slutskiy, Yevgeniy B.
Zhou, Yiyin
NEURAL NETWORKS, 2015, 63 : 254 - 271
[38] Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model
Robin P Smith
Leila Taher
Rupali P Patwardhan
Mee J Kim
Fumitaka Inoue
Jay Shendure
Ivan Ovcharenko
Nadav Ahituv
Nature Genetics, 2013, 45 : 1021 - 1028
[39] Massively Parallel Inverse Block-sorting Transforms for bzip2 Decompression on GPUs
Weissenberger, Andre
Schmidt, Bertil
53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 856 - 865
[40] Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model
Smith, Robin P.
Taher, Leila
Patwardhan, Rupali P.
Kim, Mee J.
Inoue, Fumitaka
Shendure, Jay
Ovcharenko, Ivan
Ahituv, Nadav
NATURE GENETICS, 2013, 45 (09) : 1021 - +

← 1 2 3 4 5 →