An Optimized Parallel IDCT on Graphics Processing Units

被引:0
|
作者
Wang, Biao [1 ]
Alvarez-Mesa, Mauricio [1 ]
Chi, Chi Ching [1 ]
Juurlink, Ben [1 ]
机构
[1] Tech Univ Berlin, Berlin, Germany
关键词
IDCT; GPU; H.264; OpenCL; parallel programming;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we present an implementation of the H.264/AVC Inverse Discrete Cosine Transform (IDCT) optimized for Graphics Processing Units (GPUs) using OpenCL. By exploiting that most of the input data of the IDCT for real videos are zero valued coefficients a new compacted data representation is created that allows for several optimizations. Experimental evaluations conducted on different GPUs show average speedups from 1.7x to 7.4x compared to an optimized single-threaded SIMD CPU version.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 50 条
  • [21] Massively Parallel Computation of Linear Recurrence Equations with Graphics Processing Units
    Sung, Wonyong
    Lee, Dong-hwan
    Hwang, Kyuyeon
    2018 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION (SAMOS XVIII), 2018, : 10 - 17
  • [22] Massively Parallel Discrete Element Method Simulations on Graphics Processing Units
    Steuben, John
    Mustoe, Graham
    Turner, Cameron
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2016, 16 (03)
  • [23] Scaling soft matter physics to thousands of graphics processing units in parallel
    Gray, Alan
    Hart, Alistair
    Henrich, Oliver
    Stratford, Kevin
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2015, 29 (03): : 274 - 283
  • [24] Parallel Computing for Simultaneous Iterative Tomographic Imaging by Graphics Processing Units
    Bello-Maldonado, Pedro D.
    Lopez, Ricardo
    Rogers, Colleen
    Jin, Yuanwei
    Lu, Enyue
    COMPUTATIONAL IMAGING, 2016, 9870
  • [25] Acceleration of a parallel BDDC solver by using graphics processing units on subdomains
    Sistek, Jakub
    Oberhuber, Tomas
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2023, 37 (02): : 151 - 164
  • [26] Parallel bucket sorting on graphics processing units based on convex optimization
    Beliakov, Gleb
    Li, Gang
    Liu, Shaowu
    OPTIMIZATION, 2015, 64 (04) : 1033 - 1055
  • [27] Parallel construction of large circular cartograms using graphics processing units
    Tang, Wenwu
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2013, 27 (11) : 2182 - 2206
  • [28] Parallel Execution of SVM Training using Graphics Processing Units (SVMTrGPUs)
    Salleh, Nur Shakirah Md
    Baharim, Muhammad Fahim
    PROCEEDINGS 5TH IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2015), 2015, : 260 - 263
  • [29] Cofactorization on Graphics Processing Units
    Miele, Andrea
    Bos, Joppe W.
    Kleinjung, Thorsten
    Lenstra, Arjen K.
    CRYPTOGRAPHIC HARDWARE AND EMBEDDED SYSTEMS - CHES 2014, 2014, 8731 : 335 - 352
  • [30] Graphics processing units for handhelds
    Akenine-Moller, Tomas
    Strom, Jacob
    PROCEEDINGS OF THE IEEE, 2008, 96 (05) : 779 - 789