GPU-assisted HEVC intra decoder

被引:8
|
作者
de Souza, Diego F. [1 ]
Ilic, Aleksandar [1 ]
Roma, Nuno [1 ]
Sousa, Leonel [1 ]
机构
[1] Univ Lisbon, INESC ID, IST, Rua Alves Redol 9, P-1000029 Lisbon, Portugal
关键词
HEVC; Intra decoder; GPU; Parallel processing; IMPLEMENTATION; TRANSFORM;
D O I
10.1007/s11554-015-0519-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The added encoding efficiency and visual quality offered by the High Efficiency Video Coding (HEVC) standard is attained at the cost of a significant computational complexity of both the encoder and the decoder. In particular, the considerable amount of intra prediction modes that are now considered by this standard, together with the increased complexity of the adopted block coding tree structures using a larger diversity of transforms imposes demanding computational efforts that can hardly be satisfied by current general-purpose processors to attain hard real-time requirements. Furthermore, the strict data dependencies that are imposed make parallelization a difficult and hardly efficient option with conventional approaches. To circumvent this adversity, this paper exploits Graphics Processing Units (GPUs) to accelerate the intra decoding procedure in HEVC, encompassing the most demanding modules of the decoder (i.e., de-quantization, inverse transform, intra prediction, deblocking filter, and sample adaptive offset). The presented approaches comprehensively exploit both coarse and fine-grained parallelization opportunities in an integrated perspective by re-designing the execution pattern of the involved modules, while simultaneously coping with their inherent computational complexity and strict data dependencies. As a result, the proposed parallelization, which is fully compliant with the HEVC standard, has shown to be a remarkable viable approach, being capable of satisfying hard real-time requirements by processing each Ultra HD 4 K intra frame in less than 25 ms (about 40 fps).
引用
收藏
页码:531 / 547
页数:17
相关论文
共 50 条
  • [1] GPU-assisted HEVC intra decoder
    Diego F. de Souza
    Aleksandar Ilic
    Nuno Roma
    Leonel Sousa
    [J]. Journal of Real-Time Image Processing, 2016, 12 : 531 - 547
  • [2] GPU-assisted malware
    Vasiliadis, Giorgos
    Polychronakis, Michalis
    Ioannidis, Sotiris
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2015, 14 (03) : 289 - 297
  • [3] GPU-assisted malware
    Giorgos Vasiliadis
    Michalis Polychronakis
    Sotiris Ioannidis
    [J]. International Journal of Information Security, 2015, 14 : 289 - 297
  • [4] GPU-Assisted Buffer Management
    Zhong, Jianlong
    He, Bingsheng
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 362 - 371
  • [5] Collocating CPU-only jobs with GPU-assisted jobs on GPU-assisted HPC
    Wu, Jiadong
    Hong, Bo
    [J]. PROCEEDINGS OF THE 2013 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID 2013), 2013, : 418 - 425
  • [6] GPU-Assisted Memory Expansion
    Srinuan, Pisacha
    Sigdel, Purushottam
    Yuan, Xu
    Peng, Lu
    Darby, Paul
    Aucoin, Christopher
    Tzeng, Nian-Feng
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2021, : 132 - 139
  • [7] GPU-Assisted Simulations of SDM Systems
    Uvarov, Alexander
    Karelin, Nikolay
    Koltchanov, Igor
    Richter, Andre
    Louchet, Hadrien
    Shkred, Gena
    [J]. 2017 19TH INTERNATIONAL CONFERENCE ON TRANSPARENT OPTICAL NETWORKS (ICTON), 2017,
  • [8] GPU-assisted ray casting of large scenes
    Balciunas, Daniel A.
    Dulley, Lucas P.
    Zuffo, Marcelo K.
    [J]. RT 06: IEEE SYMPOSIUM ON INTERACTIVE RAY TRACING 2006, PROCEEDINGS, 2006, : 95 - +
  • [9] A GPU-Assisted Personal Video Organizing System
    Mohiuddin, K. Wasif
    Narayanan, P. J.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [10] A Reliable and Secure GPU-Assisted File System
    Lin, Shang-Chieh
    Liao, Yu-Cheng
    Hsu, Yarsun
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT I, 2014, 8630 : 71 - 84