Deep Frame Prediction for Video Coding

被引：46

作者：

Choi, Hyomin ^{[1
]}

Bajic, Ivan V. ^{[1
]}

机构：

[1] Simon Fraser Univ, Sch Engn Sci, Burnaby, BC V5A 1S6, Canada

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2020年 / 30卷 / 07期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Video compression; frame prediction; texture prediction; deep neural network (DNN); deep learning; DESIGN;

D O I：

10.1109/TCSVT.2019.2924657

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We propose a novel frame prediction method using a deep neural network (DNN), with the goal of improving the video coding efficiency. The proposed DNN makes use of decoded frames, at both the encoder and decoder to predict the textures of the current coding block. Unlike conventional inter-prediction, the proposed method does not require any motion information to be transferred between the encoder and the decoder. Still, both the uni-directional and bi-directional predictions are possible using the proposed DNN, which is enabled by the use of the temporal index channel, in addition to the color channels. In this paper, we developed a jointly trained DNN for both uni-directional and bi-directional predictions, as well as separate networks for uni-directional and bi-directional predictions, and compared the efficacy of both the approaches. The proposed DNNs were compared with the conventional motion-compensated prediction in the latest video coding standard, High Efficiency Video Coding (HEVC), in terms of the BD-bitrate. The experiments show that the proposed joint DNN (for both uni-directional and bi-directional predictions) reduces the luminance bitrate by about 4.4%, 2.4%, and 23% in the low delay P, low delay, and random access configurations, respectively. In addition, using the separately trained DNNs brings further bit savings of about 03%-0.5%.

引用

页码：1843 / 1855

页数：13

共 50 条

[41] Frame design for multiple description video coding
Wang, D
Canagarajah, N
Bull, D
2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 2719 - 2722
[42] Distributed Video Coding with Frame Estimation at Decoder
Chiam, Kin Honn
Salleh, Mohd Fadzli Mohd
ADVANCED COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY, 2015, 315 : 299 - 308
[43] Coding the displaced frame difference for video compression
Ratakonda, K
Yoon, SC
Ahuja, N
INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL I, 1997, : 353 - 356
[44] Interweaved Prediction for Video Coding
Zhang, Kai
Zhang, Li
Liu, Hongbin
Xu, Jizheng
Deng, Zhipin
Wang, Yue
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6422 - 6437
[45] Prediction Matching for Video Coding
Zheng, Yunfei
Yin, Peng
Divorra Escoda, Oscar
Sole, Joel
Gomila, Cristina
VISUAL INFORMATION PROCESSING AND COMMUNICATION, 2010, 7543
[46] Segmental Prediction for Video Coding
Zhang, Kai
An, Jicheng
Huang, Han
Lin, Jian-Liang
Huang, Yu-Wen
Lei, Shaw-Min
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (11) : 2425 - 2436
[47] Content-based irregularly shaped macroblock partition for inter frame prediction in video coding
Li, Zhibin
Chang, Yilin
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (08) : 610 - 621
[48] Deep Multi-Domain Prediction for 3D Video Coding
Lei, Jianjun
Shi, Yanan
Pan, Zhaoqing
Liu, Dong
Jin, Dengchao
Chen, Ying
Ling, Nam
IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (04) : 813 - 823
[49] Multiple Resolution Prediction With Deep Up-Sampling for Depth Video Coding
Li, Ge
Lei, Jianjun
Pan, Zhaoqing
Peng, Bo
Ling, Nam
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6337 - 6346
[50] Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding
Zhu, Linwei
Zhang, Yun
Wang, Shiqi
Kwong, Sam
Jin, Xin
Qiao, Yu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3168 - 3181

← 1 2 3 4 5 →