Deep Frame Prediction for Video Coding

被引:46
|
作者
Choi, Hyomin [1 ]
Bajic, Ivan V. [1 ]
机构
[1] Simon Fraser Univ, Sch Engn Sci, Burnaby, BC V5A 1S6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Video compression; frame prediction; texture prediction; deep neural network (DNN); deep learning; DESIGN;
D O I
10.1109/TCSVT.2019.2924657
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a novel frame prediction method using a deep neural network (DNN), with the goal of improving the video coding efficiency. The proposed DNN makes use of decoded frames, at both the encoder and decoder to predict the textures of the current coding block. Unlike conventional inter-prediction, the proposed method does not require any motion information to be transferred between the encoder and the decoder. Still, both the uni-directional and bi-directional predictions are possible using the proposed DNN, which is enabled by the use of the temporal index channel, in addition to the color channels. In this paper, we developed a jointly trained DNN for both uni-directional and bi-directional predictions, as well as separate networks for uni-directional and bi-directional predictions, and compared the efficacy of both the approaches. The proposed DNNs were compared with the conventional motion-compensated prediction in the latest video coding standard, High Efficiency Video Coding (HEVC), in terms of the BD-bitrate. The experiments show that the proposed joint DNN (for both uni-directional and bi-directional predictions) reduces the luminance bitrate by about 4.4%, 2.4%, and 23% in the low delay P, low delay, and random access configurations, respectively. In addition, using the separately trained DNNs brings further bit savings of about 03%-0.5%.
引用
收藏
页码:1843 / 1855
页数:13
相关论文
共 50 条
  • [21] Can learned frame prediction compete with block motion compensation for video coding?
    Serkan Sulun
    A. Murat Tekalp
    Signal, Image and Video Processing, 2021, 15 : 401 - 410
  • [22] FOR/SOR VIDEO CODING WITH SUPER MACROBLOCK AND INTER-FRAME STRIPE PREDICTION
    Kang, Je-Won
    Kim, Seung-Hwan
    Kuo, C. -C. Jay
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 821 - 824
  • [23] Video Frame Prediction by Deep Multi-Branch Mask Network
    Li, Sen
    Fang, Jianwu
    Xu, Hongke
    Xue, Jianru
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (04) : 1283 - 1295
  • [24] On the encoding of the anchor frame in video coding
    Kondi, LP
    Katsaggelos, AK
    INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 1997 DIGEST OF TECHNICAL PAPERS, 1997, : 18 - 19
  • [25] On the encoding of the anchor frame in video coding
    Kondi, LP
    Katsaggelos, AK
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1997, 43 (03) : 279 - 285
  • [26] ENHANCED CTU-LEVEL INTER PREDICTION WITH DEEP FRAME RATE UP-CONVERSION FOR HIGH EFFICIENCY VIDEO CODING
    Zhao, Lei
    Wang, Shiqi
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    Gao, Wen
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 206 - 210
  • [27] Reduced frame quantization in video coding
    Toivonen, Tuukka
    Heikkila, Janne
    VISUAL CONTENT PROCESSING AND REPRESENTATION, 2006, 3893 : 61 - 67
  • [28] A New Approach to Video Coding Leveraging Hybrid Coding and Video Frame Interpolation
    Brascher, Andre Beims
    da Silveira, Gabriela Furtado
    Cancellier, Luiz Henrique
    Seidel, Ismael
    Grellert, Mateus
    Guntzel, Jose Luis
    2023 36TH SBC/SBMICRO/IEEE/ACM SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, SBCCI, 2023, : 161 - 166
  • [29] CONTENT-ADAPTIVE REFERENCE FRAME COMPRESSION BASED ON INTRA-FRAME PREDICTION FOR MULTIVIEW VIDEO CODING
    Sampaio, Felipe
    Zatt, Bruno
    Shafique, Muhammad
    Agostini, Luciano
    Henkel, Joerg
    Bampi, Sergio
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1831 - 1835
  • [30] Enhanced Motion-Compensated Video Coding With Deep Virtual Reference Frame Generation
    Zhao, Lei
    Wang, Shiqi
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    Gao, Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 4832 - 4844