Hierarchical B-frame Video Coding Using Two-Layer CANF without Motion Coding

被引:4
|
作者
Alexandre, David [1 ]
Hang, Hsueh-Ming [2 ]
Peng, Wen-Hsiao [3 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Elect Engn & Comp Sci Int Grad Program, Hsinchu, Taiwan
[2] Natl Yang Ming Chiao Tung Univ, Inst Elect, Hsinchu, Taiwan
[3] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
ENHANCEMENT;
D O I
10.1109/CVPR52729.2023.00988
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typical video compression systems consist of two main modules: motion coding and residual coding. This general architecture is adopted by classical coding schemes (such as international standards H.265 and H.266) and deep learning-based coding schemes. We propose a novel B-frame coding architecture based on two-layer Conditional Augmented Normalization Flows (CANF). It has the striking feature of not transmitting any motion information. Our proposed idea of video compression without motion coding offers a new direction for learned video coding. Our base layer is a low-resolution image compressor that replaces the full-resolution motion compressor. The low-resolution coded image is merged with the warped high-resolution images to generate a high-quality image as a conditioning signal for the enhancement-layer image coding in full resolution. One advantage of this architecture is significantly reduced computational complexity due to eliminating the motion information compressor. In addition, we adopt a skip-mode coding technique to reduce the transmitted latent samples. The rate-distortion performance of our scheme is slightly lower than that of the state-of-the-art learned B-frame coding scheme, B-CANF, but outperforms other learned B-frame coding schemes. However, compared to B-CANF, our scheme saves 45% of multiply-accumulate operations (MACs) for encoding and 27% of MACs for decoding. The code is available at https://nycu-clab.github.io.
引用
收藏
页码:10249 / 10258
页数:10
相关论文
共 50 条
  • [1] B-CANF: Adaptive B-Frame Coding With Conditional Augmented Normalizing Flows
    Chen, Mu-Jung
    Chen, Yi-Hsin
    Peng, Wen-Hsiao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2908 - 2921
  • [2] Two-layer motion estimation algorithm for video coding
    Paramkusam, A. V.
    Reddy, V. S. K.
    ELECTRONICS LETTERS, 2014, 50 (04) : 276 - 277
  • [3] Conditional Variational Autoencoders for Hierarchical B-frame Coding
    Gao, Zong-Lin
    Chen, Cheng-Wei
    Yao, Yi-Chen
    Ho, Cheng-Yuan
    Peng, Wen-Hsiao
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [4] NOVEL TWO-LAYER MOTION ESTIMATION FOR VIDEO CODING
    A.V.Paramkusam
    V.S.K.Reddy
    Journal of Electronics(China), 2014, 31 (04) : 354 - 365
  • [5] Two-layer hierarchical coding for MPEG-2 video
    Garzelli, A
    ELECTRONICS LETTERS, 2000, 36 (20) : 1696 - 1697
  • [6] A hierarchical error concealment algorithm for entire b-frame loss in stereoscopic video coding with hierarchical B pictures
    Zhou, Yang
    Jiang, Gang-Yi
    Yu, Mei
    Hu, Fang-Ning
    Wang, Hai-Quan
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2014, 36 (02): : 377 - 383
  • [7] Scalable MPEG video coding with improved B-frame prediction
    Domanski, M
    Luczak, A
    Mackowiak, S
    ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL II: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 273 - 276
  • [8] Fast Inter Predication for B-Frame in HBP Based Stereo Video Coding
    Sun, Fengfei
    Yu, Mei
    Jiang, Zhidi
    Fu, Randi
    Fu, Songyin
    Jiang, Gangyi
    2010 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS 1-3, 2010, : 752 - 755
  • [9] Two-layer video coding using pyramid structure for ATM networks
    Lee, CB
    Hong, SH
    Park, RH
    DIGITAL COMPRESSION TECHNOLOGIES AND SYSTEMS FOR VIDEO COMMUNICATIONS, 1996, 2952 : 94 - 102
  • [10] H.264-based hierarchical two-layer lossless video coding method
    Chien, Wei-Da
    Liao, Ke-Ying
    Yang, Jar-Ferr
    IET SIGNAL PROCESSING, 2014, 8 (01) : 21 - 29