Deep Reference Frame Generation Method for VVC Inter Prediction Enhancement

被引:0
|
作者
Jia, Jianghao [1 ]
Zhang, Yuantong [1 ]
Zhu, Han [1 ]
Chen, Zhenzhong [1 ]
Liu, Zizheng [2 ]
Xu, Xiaozhong [3 ]
Liu, Shan [3 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430072, Peoples R China
[2] Tencent Shenzhen, Shenzhen 518000, Peoples R China
[3] Tencent Amer, Palo Alto, CA 94306 USA
关键词
Interpolation; Optical flow; Extrapolation; Bidirectional control; Kernel; Encoding; Streaming media; Neural-network-based video coding; versatile video coding (VVC); inter prediction; deep learning; NETWORK;
D O I
10.1109/TCSVT.2023.3299410
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In video coding, inter prediction aims to reduce temporal redundancy by using previously encoded frames as references. The quality of reference frames is crucial to the performance of inter prediction. This paper presents a deep reference frame generation method to optimize the inter prediction in Versatile Video Coding (VVC). Specifically, reconstructed frames are sent to a well-designed frame generation network to synthesize a picture similar to the current encoding frame. The synthesized picture serves as an additional reference frame inserted into the reference picture list (RPL) to provide a more reliable reference for subsequent motion estimation (ME) and motion compensation (MC). The frame generation network employs optical flow to predict motion precisely. Moreover, an optical flow reorganization strategy is proposed to enable bi-directional and uni-directional predictions with only a single network architecture. To reasonably apply our method to VVC, we further introduce a normative modification of the temporal motion vector prediction (TMVP). Integrated into the VVC reference software VTM-15.0, the deep reference frame generation method achieves coding efficiency improvements of 5.22%, 3.61%, and 3.83% for the Y component under random access (RA), low delay B (LDB), and low delay P (LDP) configurations, respectively. The proposed method has been discussed in Joint Video Exploration Team (JVET) meeting and is currently part of Exploration Experiments (EE) for further study.
引用
收藏
页码:3111 / 3124
页数:14
相关论文
共 50 条
  • [1] Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding
    Jia, Jianghao
    Liu, Zizheng
    Xu, Xiaozhong
    Liu, Shan
    Chen, Zhenzhong
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [2] Faster Inter Prediction by NR-Frame in VVC
    Chan, Ka-Hou
    Im, Sio-Kei
    PROCEEDINGS OF 2023 THE 7TH INTERNATIONAL CONFERENCE ON GRAPHICS AND SIGNAL PROCESSING, ICGSP, 2023, : 24 - 28
  • [3] Faster Inter Prediction by NR-Frame in VVC
    Chan, Ka-Hou
    Im, Sio-Kei
    ACM International Conference Proceeding Series, 2023, : 24 - 28
  • [4] Deep Inter Prediction via Reference Frame Interpolation for Blurry Video Coding
    Zhu, Zezhi
    Zhao, Lili
    Lin, Xuhu
    Guo, Xuezhou
    Chen, Jianwen
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [5] Deep Affine Motion Compensation Network for Inter Prediction in VVC
    Jin, Dengchao
    Lei, Jianjun
    Peng, Bo
    Li, Wanqing
    Ling, Nam
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3923 - 3933
  • [6] Reference Frame Generation Algorithm using Dynamical Learning PredNet for VVC
    Katayama, Takafumi
    Song, Tian
    Shimamoto, Takashi
    Jiang, Xiantao
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
  • [7] A Hardware-Friendly and Configurable Heuristic Targeting VVC Inter-Frame Prediction
    Loose, Marta
    Viana, Ramiro
    Sagrilo, Fernando
    Sanchez, Gustavo
    Correa, Guilherme
    Agostini, Luciano
    2022 29TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (IEEE ICECS 2022), 2022,
  • [8] Deep Reference Generation With Multi-Domain Hierarchical Constraints for Inter Prediction
    Liu, Jiaying
    Xia, Sifeng
    Yang, Wenhan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (10) : 2497 - 2510
  • [9] Inter prediction multiple reference frames impact on H266-VVC encoder
    Jassem, Rana
    Damak, Taheni
    Ben Ayed, Mohamed Ali
    Masmoudi, Nouri
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 50329 - 50348
  • [10] Inter prediction multiple reference frames impact on H266-VVC encoder
    Rana Jassem
    Taheni Damak
    Mohamed Ali Ben Ayed
    Nouri Masmoudi
    Multimedia Tools and Applications, 2024, 83 : 50329 - 50348