Sparse/DCT (S/DCT) Two-Layered Representation of Prediction Residuals for Video Coding

被引:24
|
作者
Kang, Je-Won [1 ,2 ]
Gabbouj, Moncef [3 ]
Kuo, C. -C. Jay [4 ,5 ]
机构
[1] Qualcomm Technol Inc, Multimedia R&D, San Diego, CA 92121 USA
[2] Qualcomm Technol Inc, Standard Team, San Diego, CA 92121 USA
[3] Tampere Univ Technol, Dept Signal Proc, Tampere 33720, Finland
[4] Univ So Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90089 USA
[5] Univ So Calif, Signal & Image Proc Inst, Los Angeles, CA 90089 USA
关键词
rho domain rate model; discrete cosine transform (DCT); high efficiency video coding (HEVC); multilayered coding; overcomplete dictionary based video coding; residual coding; sparse representation; IMAGE; ALGORITHM;
D O I
10.1109/TIP.2013.2256917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a cascaded sparse/DCT (S/DCT) two-layer representation of prediction residuals, and implement this idea on top of the state-of-the-art high efficiency video coding (HEVC) standard. First, a dictionary is adaptively trained to contain featured patterns of residual signals so that a high portion of energy in a structured residual can be efficiently coded via sparse coding. It is observed that the sparse representation alone is less effective in the R-D performance due to the side information overhead at higher bit rates. To overcome this problem, the DCT representation is cascaded at the second stage. It is applied to the remaining signal to improve coding efficiency. The two representations successfully complement each other. It is demonstrated by experimental results that the proposed algorithm outperforms the HEVC reference codec HM5.0 in the Common Test Condition.
引用
收藏
页码:2711 / 2722
页数:12
相关论文
共 50 条
  • [31] A video coding algorithm based on image warping and nonrectangular DCT coding
    Chou, YM
    Hang, HM
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '97, PTS 1-2, 1997, 3024 : 176 - 187
  • [32] Sparse Representation Approach to Inverse Halftoning in Terms of DCT Dictionary
    Ohta, Yuhri
    Aida, Toshiaki
    2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 1377 - 1380
  • [33] A multimodal fusion method for Alzheimer's disease based on DCT convolutional sparse representation
    Zhang, Guo
    Nie, Xixi
    Liu, Bangtao
    Yuan, Hong
    Li, Jin
    Sun, Weiwei
    Huang, Shixin
    FRONTIERS IN NEUROSCIENCE, 2023, 16
  • [34] Two-layered coding in evolvable hardware
    Fan, Yuanyuan
    Li, Yuanxiang
    Tu, Hang
    Yan, Xuesong
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 728 - 731
  • [35] Zero-Quantized Inter DCT Coefficient Prediction for Real-Time Video Coding
    Li, Jin
    Gabbouj, Moncef
    Takala, Jarmo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (02) : 249 - 259
  • [36] MODE DEPENDENT DCT/DST FOR INTRA PREDICTION IN BLOCK-BASED IMAGE/VIDEO CODING
    Saxena, Ankur
    Fernandes, Felix C.
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1685 - 1688
  • [37] Managing drift in DCT-based scalable video coding
    Reibman, AR
    Bottou, L
    DCC 2001: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2001, : 351 - 360
  • [38] A LOW-RATE VIDEO CODING BASED ON DCT VQ
    MAENG, J
    HEIN, D
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING IV, PTS 1-3, 1989, 1199 : 267 - 273
  • [39] Sparse Coding of Intra Prediction Residuals for Screen Content Coding
    Schimpf, Michael G.
    Ling, Nam
    Shi, Yunhui
    Liu, Ying
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
  • [40] Buffer control of DCT-based intrafield video coding
    Lou, Shengqiang
    Huangpu, Kan
    Zhou, Liangzhu
    Wan, Jianwei
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 20 (04): : 59 - 64