A Efficient Parallel Deblocking Filter Based on GPU: Implementation and Optimization

被引:0
|
作者
Su, Huayou [1 ]
Zhang, Chunyuan [1 ]
Chai, Jun [1 ]
Yang, Qianming [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha, Hunan, Peoples R China
关键词
Deblocking filter; parallel processing; GPU; H.264/AVC;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The deblocking filter represents one of the most time consuming tasks of the H.264/AVC standard. Due to its characteristics of data dependencies and frequent memory access, it poses an arduous challenge to mapping the algorithm onto massively parallel architecture efficiently. In this paper, a novel parallel deblocking filter is proposed based on GPU, which weaken the dependencies between MBs by rearrange the filter orders of boundaries. We implemented the proposed algorithm on GPU and optimized the program through three strategies, including kernel combination, reusing the intermediate data and optimizing data representation. Experimental results show that applying the proposed parallel method supports real-time processing throughput for 1080p at 450fps. We have also observed 3.78x and 16.68x speedup for comprehensive optimization parallel deblocking filter on two-core processor and the state-of-the-art GPU-based implementation, respectively.
引用
收藏
页码:280 / 285
页数:6
相关论文
共 50 条
  • [1] A parallel implementation of deblocking filter based on video array architecture for HEVC
    Jiang, Lin
    Yang, Qian
    Zhu, Yun
    Deng, JunYong
    [J]. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2016,
  • [2] EFFICIENT DEBLOCKING FILTER IMPLEMENTATION ON RECONFIGURABLE PROCESSOR
    Maiti, Kausik
    Pasupuleti, Sirish K.
    Gadde, Raj N.
    Lee, SangJo
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1050 - 1054
  • [3] An efficient hardware implementation for deblocking filter of AVS decoder
    Huang You-wen
    [J]. 2011 2ND INTERNATIONAL CONFERENCE ON CHALLENGES IN ENVIRONMENTAL SCIENCE AND COMPUTER ENGINEERING (CESCE 2011), VOL 11, PT A, 2011, 11 : 505 - 510
  • [4] Parallel Deblocking Filtering Algorithm on GPU
    Qian, Zhou
    Jiao, Long
    Hao, Zhang
    Lei, Lei
    Zhang, Jiashu
    [J]. TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806
  • [5] A Hardware-Efficient Parallel Architecture for HEVC Deblocking Filter
    Ayadi, Lella Aicha
    Boubakri, Wided
    Loukil, Hassen
    Masmoudi, Nouri
    [J]. 2019 16TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2019, : 669 - 673
  • [6] A Parallel and Area-Efficient Architecture for Deblocking Filter and Adaptive Loop Filter
    Du, Juan
    Yu, Lu
    [J]. 2011 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2011, : 945 - 948
  • [7] Implementation and optimization of the wideband matched filter on the GPU
    Zhou, Hang
    Cai, Zhiming
    Wang, Ximin
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2015, 42 (03): : 135 - 140
  • [8] Design and Implementation of Efficient Streaming Deblocking and SAO Filter for HEVC Decoder
    Baldev, Swamy
    Shukla, Kaustubh
    Gogoi, Sushanta
    Rathore, Pradeep Kumar
    Peesapati, Rangababu
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2018, 64 (01) : 127 - 135
  • [9] DSP implementation of deblocking filter for AVS
    Yang, Zhigang
    Gao, Wen
    Liu, Yan
    Zhao, Debin
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 3001 - 3004
  • [10] Efficient Parallel Framework for HEVC Deblocking Filter on Many-core Platform
    Yan, Chenggang
    Zhang, Yongdong
    Dai, Feng
    Li, Liang
    [J]. 2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 530 - 530