Efficient GPU and CPU-based LDPC decoders for long codewords

被引:15
|
作者
Gronroos, Stefan [1 ]
Nybom, Kristian [1 ]
Bjorkqvist, Jerker [1 ]
机构
[1] Abo Akad Univ, FIN-20520 Turku, Finland
关键词
DVB-T2; LDPC; SDR; CUDA; SSE; SIMD;
D O I
10.1007/s10470-012-9895-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The next generation DVB-T2, DVB-S2, and DVB-C2 standards for digital television broadcasting specify the use of low-density parity-check (LDPC) codes with codeword lengths of up to 64800 bits. The real-time decoding of these codes on general purpose computing hardware is useful for completely software defined receivers, as well as for testing and simulation purposes. Modern graphics processing units (GPUs) are capable of massively parallel computation, and can in some cases, given carefully designed algorithms, outperform general purpose CPUs (central processing units) by an order of magnitude or more. The main problem in decoding LDPC codes on GPU hardware is that LDPC decoding generates irregular memory accesses, which tend to carry heavy performance penalties (in terms of efficiency) on GPUs. Memory accesses can be efficiently parallelized by decoding several codewords in parallel, as well as by using appropriate data structures. In this article we present the algorithms and data structures used to make log-domain decoding of the long LDPC codes specified by the DVB-T2 standard-at the high data rates required for television broadcasting-possible on a modern GPU. Furthermore, we also describe a similar decoder implemented on a general purpose CPU, and show that high performance LDPC decoders are also possible on modern multi-core CPUs.
引用
收藏
页码:583 / 595
页数:13
相关论文
共 50 条
  • [1] Efficient GPU and CPU-based LDPC decoders for long codewords
    Stefan Grönroos
    Kristian Nybom
    Jerker Björkqvist
    Analog Integrated Circuits and Signal Processing, 2012, 73 : 583 - 595
  • [2] A Methodology for Comparing the Reliability of GPU-Based and CPU-Based HPCs
    Cini, Nevin
    Yalcin, Gulay
    ACM COMPUTING SURVEYS, 2020, 53 (01)
  • [3] Efficient CPU-based volume ray tracing techniques
    Marmitt, Gerd
    Friedrich, Heiko
    Slusallek, Philipp
    COMPUTER GRAPHICS FORUM, 2008, 27 (06) : 1687 - 1709
  • [4] Efficient On-The-Fly Hypothesis Rescoring in a Hybrid GPU/CPU-based Large Vocabulary Continuous Speech Recognition Engine
    Kim, Jungsuk
    Chong, Jike
    Lane, Ian
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1034 - 1037
  • [5] Robust and Efficient CPU-Based RGB-D Scene Reconstruction
    Li, Jianwei
    Gao, Wei
    Li, Heping
    Tang, Fulin
    Wu, Yihong
    SENSORS, 2018, 18 (11)
  • [6] Robust and efficient cpu-based rgb-d scene reconstruction
    Li J.
    Gao W.
    Li H.
    Tang F.
    Wu Y.
    Gao, Wei (wgao@nlpr.ia.ac.cn), 2018, MDPI AG (18):
  • [7] Efficient Quantization Schemes for LDPC Decoders
    Zarubica, Radivoje
    Hinton, Ryan
    Wilson, Stephen G.
    Hall, Eric K.
    2008 IEEE MILITARY COMMUNICATIONS CONFERENCE: MILCOM 2008, VOLS 1-7, 2008, : 1327 - +
  • [8] GPU and CPU-Based Parallel FDTD Methods for Frequency-Dependent Transmission Line Models
    Gunawardana, Manuja
    Kordi, Behzad
    IEEE LETTERS ON ELECTROMAGNETIC COMPATIBILITY PRACTICE AND APPLICATIONS, 2022, 4 (03): : 66 - 70
  • [9] An Efficient Implementation of LDPC Decoders on ARM Processors
    Liu, Bing
    Liu, Rongke
    Liu, Zhanxian
    Zhao, Ling
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2018, : 287 - 291
  • [10] An Efficient VLSI Architecture for Nonbinary LDPC Decoders
    Lin, Jun
    Sha, Jin
    Wang, Zhongfeng
    Li, Li
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2010, 57 (01) : 51 - 55