Decoder-Side Cross Resolution Synthesis for Video Compression Enhancement

被引:4
|
作者
Lu, Ming [1 ]
Chen, Tong [1 ]
Dai, Zhenyu [2 ]
Wang, Dong [2 ]
Ding, Dandan [3 ]
Ma, Zhan [1 ]
机构
[1] Nanjing Univ, Nanjing 210023, Peoples R China
[2] OPPO Inc, Nanjing, Peoples R China
[3] Hangzhou Normal Univ, Hangzhou 310030, Peoples R China
基金
中国国家自然科学基金;
关键词
Video coding; cross resolution synthesis; super resolution; deep learning; SUPERRESOLUTION;
D O I
10.1109/TMM.2022.3142414
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a decoder-side Cross Resolution Synthesis (CRS) module to pursue better compression efficiency beyond the latest Versatile Video Coding (VVC), where we encode intra frames at original high resolution (HR), compress inter frames at a lower resolution (LR), and then super-resolve decoded LR inter frames with the help from preceding HR intra and neighboring LR inter frames. For a LR inter frame, a motion alignment and aggregation network (MAN) is devised to produce temporally aggregated motion representation to best guarantee the temporal smoothness; Another texture compensation network (TCN) is utilized to generate texture representation from decoded HR intra frame for better augmenting spatial details; Finally, a similarity-driven fusion engine synthesizes motion and texture representations to upscale LR inter frames for the removal of compression and resolution re-sampling noises. We enhance the VVC using proposed CRS, showing averaged 8.76% and 11.93% Bjontegaard Delta Rate (BD-Rate) gains against the latest VVC anchor in Random Access (RA) and Low-delay P (LDP) settings respectively. In addition, experimental comparisons to the state-of-the-art super-resolution (SR) based VVC enhancement methods, and ablation studies are conducted to further report superior efficiency and generalization of the proposed algorithm. All materials will be made to public at https://njuvision.github.io/CRS for reproducible research.
引用
收藏
页码:2097 / 2110
页数:14
相关论文
共 50 条
  • [1] PATCH DECODER-SIDE DEPTH ESTIMATION IN MPEG IMMERSIVE VIDEO
    Milovanovic, Marta
    Henry, Felix
    Cagnazzo, Marco
    Jung, Joel
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1945 - 1949
  • [2] Light Field Image Compression via CNN-Based EPI Super-Resolution and Decoder-Side Quality Enhancement
    Zhao, Jinbo
    An, Ping
    Huang, Xinpeng
    Yang, Chao
    Shen, Liquan
    [J]. IEEE ACCESS, 2019, 7 : 135982 - 135998
  • [3] Decoder-side Chroma Intra Mode Derivation in Video Coding
    Li, Xinwei
    Liao, Ru-Ling
    Chen, Jie
    Ye, Yan
    [J]. 2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 22 - 31
  • [4] Decoder-side super-resolution and frame interpolation for improved H.264 video coding
    Ates, Hasan F.
    [J]. 2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 83 - 92
  • [5] IMPROVED VIEW SYNTHESIS PREDICTION USING DECODER-SIDE MOTION DERIVATION FOR MULTIVIEW VIDEO CODING
    Shimizu, Shinya
    Kimata, Hideaki
    [J]. 2010 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON 2010), 2010,
  • [6] On the future of decoder-side depth estimation in MPEG immersive video coding
    Mieloch, Dawid
    Dziembowski, Adrian
    Jeong, Jun Young
    Lee, Gwangsoon
    [J]. 2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 354 - 354
  • [7] Overview and Efficiency of Decoder-Side Depth Estimation in MPEG Immersive Video
    Mieloch, Dawid
    Garus, Patrick
    Milovanovic, Marta
    Jung, Joel
    Jeong, Jun Young
    Ravi, Smitha Lingadahalli
    Salahieh, Basel
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6360 - 6374
  • [8] DECODER-SIDE HEVC QUALITY ENHANCEMENT WITH SCALABLE CONVOLUTIONAL NEURAL NETWORK
    Yang, Ren
    Xu, Mai
    Wang, Zulin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 817 - 822
  • [9] DECODER-SIDE MOTION VECTOR DERIVATION FOR HYBRID VIDEO INTER CODING
    Kamp, Steffen
    Wien, Mathias
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1277 - 1280
  • [10] A New Approach to Decoder-Side Depth Estimation in Immersive Video Transmission
    Mieloch, Dawid
    Dziembowski, Adrian
    Kloska, Dominika
    Szydelko, Blazej
    Jeong, Jun Young
    Lee, Gwangsoon
    [J]. IEEE TRANSACTIONS ON BROADCASTING, 2023, 69 (04) : 951 - 965