Sparse/DCT (S/DCT) Two-Layered Representation of Prediction Residuals for Video Coding

被引：24

作者：

Kang, Je-Won ^{[1
,2
]}

Gabbouj, Moncef ^{[3
]}

Kuo, C. -C. Jay ^{[4
,5
]}

机构：

[1] Qualcomm Technol Inc, Multimedia R&D, San Diego, CA 92121 USA

[2] Qualcomm Technol Inc, Standard Team, San Diego, CA 92121 USA

[3] Tampere Univ Technol, Dept Signal Proc, Tampere 33720, Finland

[4] Univ So Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90089 USA

[5] Univ So Calif, Signal & Image Proc Inst, Los Angeles, CA 90089 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2013年 / 22卷 / 07期

关键词：

rho domain rate model; discrete cosine transform (DCT); high efficiency video coding (HEVC); multilayered coding; overcomplete dictionary based video coding; residual coding; sparse representation; IMAGE; ALGORITHM;

D O I：

10.1109/TIP.2013.2256917

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a cascaded sparse/DCT (S/DCT) two-layer representation of prediction residuals, and implement this idea on top of the state-of-the-art high efficiency video coding (HEVC) standard. First, a dictionary is adaptively trained to contain featured patterns of residual signals so that a high portion of energy in a structured residual can be efficiently coded via sparse coding. It is observed that the sparse representation alone is less effective in the R-D performance due to the side information overhead at higher bit rates. To overcome this problem, the DCT representation is cascaded at the second stage. It is applied to the remaining signal to improve coding efficiency. The two representations successfully complement each other. It is demonstrated by experimental results that the proposed algorithm outperforms the HEVC reference codec HM5.0 in the Common Test Condition.

引用

页码：2711 / 2722

页数：12

共 50 条

[31] A video coding algorithm based on image warping and nonrectangular DCT coding
Chou, YM
Hang, HM
VISUAL COMMUNICATIONS AND IMAGE PROCESSING '97, PTS 1-2, 1997, 3024 : 176 - 187
[32] Sparse Representation Approach to Inverse Halftoning in Terms of DCT Dictionary
Ohta, Yuhri
Aida, Toshiaki
2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 1377 - 1380
[33] A multimodal fusion method for Alzheimer's disease based on DCT convolutional sparse representation
Zhang, Guo
Nie, Xixi
Liu, Bangtao
Yuan, Hong
Li, Jin
Sun, Weiwei
Huang, Shixin
FRONTIERS IN NEUROSCIENCE, 2023, 16
[34] Two-layered coding in evolvable hardware
Fan, Yuanyuan
Li, Yuanxiang
Tu, Hang
Yan, Xuesong
PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 728 - 731
[35] Zero-Quantized Inter DCT Coefficient Prediction for Real-Time Video Coding
Li, Jin
Gabbouj, Moncef
Takala, Jarmo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (02) : 249 - 259
[36] MODE DEPENDENT DCT/DST FOR INTRA PREDICTION IN BLOCK-BASED IMAGE/VIDEO CODING
Saxena, Ankur
Fernandes, Felix C.
2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1685 - 1688
[37] Managing drift in DCT-based scalable video coding
Reibman, AR
Bottou, L
DCC 2001: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2001, : 351 - 360
[38] A LOW-RATE VIDEO CODING BASED ON DCT VQ
MAENG, J
HEIN, D
VISUAL COMMUNICATIONS AND IMAGE PROCESSING IV, PTS 1-3, 1989, 1199 : 267 - 273
[39] Sparse Coding of Intra Prediction Residuals for Screen Content Coding
Schimpf, Michael G.
Ling, Nam
Shi, Yunhui
Liu, Ying
2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
[40] Buffer control of DCT-based intrafield video coding
Lou, Shengqiang
Huangpu, Kan
Zhou, Liangzhu
Wan, Jianwei
Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 20 (04): : 59 - 64

← 1 2 3 4 5 →