Hierarchical B-frame Video Coding Using Two-Layer CANF without Motion Coding

被引:4
|
作者
Alexandre, David [1 ]
Hang, Hsueh-Ming [2 ]
Peng, Wen-Hsiao [3 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Elect Engn & Comp Sci Int Grad Program, Hsinchu, Taiwan
[2] Natl Yang Ming Chiao Tung Univ, Inst Elect, Hsinchu, Taiwan
[3] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
ENHANCEMENT;
D O I
10.1109/CVPR52729.2023.00988
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typical video compression systems consist of two main modules: motion coding and residual coding. This general architecture is adopted by classical coding schemes (such as international standards H.265 and H.266) and deep learning-based coding schemes. We propose a novel B-frame coding architecture based on two-layer Conditional Augmented Normalization Flows (CANF). It has the striking feature of not transmitting any motion information. Our proposed idea of video compression without motion coding offers a new direction for learned video coding. Our base layer is a low-resolution image compressor that replaces the full-resolution motion compressor. The low-resolution coded image is merged with the warped high-resolution images to generate a high-quality image as a conditioning signal for the enhancement-layer image coding in full resolution. One advantage of this architecture is significantly reduced computational complexity due to eliminating the motion information compressor. In addition, we adopt a skip-mode coding technique to reduce the transmitted latent samples. The rate-distortion performance of our scheme is slightly lower than that of the state-of-the-art learned B-frame coding scheme, B-CANF, but outperforms other learned B-frame coding schemes. However, compared to B-CANF, our scheme saves 45% of multiply-accumulate operations (MACs) for encoding and 27% of MACs for decoding. The code is available at https://nycu-clab.github.io.
引用
收藏
页码:10249 / 10258
页数:10
相关论文
共 50 条
  • [41] Two-layer image coding compatible with JPEG XS
    Kobayashi, Hiroyuki
    Kiya, Hitoshi
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 2021, 11766
  • [42] Independent key frame coding using correlated pixels in distributed video coding
    Adikari, A. B. B.
    Fernando, W. A. C.
    Weerakkody, W. A. R. J.
    ELECTRONICS LETTERS, 2007, 43 (07) : 387 - 388
  • [43] Cross-layer frame discarding for cellular video coding
    Zhang, Chongyang
    Yu, Songyu
    Yang, Hua
    Xiong, Hongkai
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 777 - +
  • [44] Multi-Frame Motion Compensation using Extrapolated Frame by Optical Flow for Lossless Video Coding
    Kameda, Yusuke
    Kishi, Hiroyuki
    Ishikawa, Tomokazu
    Matsuda, Ichiro
    Itoh, Susumu
    2016 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2016, : 300 - 304
  • [45] Low complexity video encoding using B-frame direct modes
    Liu, YX
    Prades-Nebot, J
    Salama, P
    Delp, EJ
    IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2005, PTS 1 AND 2, 2005, 5685 : 1065 - 1076
  • [46] Bidirectional Hierarchical Anchoring of Motion Fields for Scalable Video Coding
    Rufenacht, Dominic
    Mathew, Reji
    Taubman, David
    2014 IEEE 16TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2014,
  • [47] Hierarchical motion estimation based on visual patterns for video coding
    Zhong, S
    Chin, FC
    Cheung, YS
    Kwan, D
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 2323 - 2326
  • [48] HIERARCHICAL ANCHORING OF MOTION FIELDS FOR FULLY SCALABLE VIDEO CODING
    Ruefenacht, Dominic
    Mathew, Reji
    Taubman, David
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3180 - 3184
  • [49] Key-frame reference selection for error resilient video coding using low-delay hierarchical coding structure
    Xu, Jiajun
    Wang, Bing
    Peng, Qiang
    Li, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 18 (1) : 215 - 222
  • [50] Key-frame reference selection for error resilient video coding using low-delay hierarchical coding structure
    Jiajun Xu
    Bing Wang
    Qiang Peng
    Wei Li
    Signal, Image and Video Processing, 2024, 18 : 215 - 222