Hierarchical B-frame Video Coding Using Two-Layer CANF without Motion Coding

被引:4
|
作者
Alexandre, David [1 ]
Hang, Hsueh-Ming [2 ]
Peng, Wen-Hsiao [3 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Elect Engn & Comp Sci Int Grad Program, Hsinchu, Taiwan
[2] Natl Yang Ming Chiao Tung Univ, Inst Elect, Hsinchu, Taiwan
[3] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
ENHANCEMENT;
D O I
10.1109/CVPR52729.2023.00988
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typical video compression systems consist of two main modules: motion coding and residual coding. This general architecture is adopted by classical coding schemes (such as international standards H.265 and H.266) and deep learning-based coding schemes. We propose a novel B-frame coding architecture based on two-layer Conditional Augmented Normalization Flows (CANF). It has the striking feature of not transmitting any motion information. Our proposed idea of video compression without motion coding offers a new direction for learned video coding. Our base layer is a low-resolution image compressor that replaces the full-resolution motion compressor. The low-resolution coded image is merged with the warped high-resolution images to generate a high-quality image as a conditioning signal for the enhancement-layer image coding in full resolution. One advantage of this architecture is significantly reduced computational complexity due to eliminating the motion information compressor. In addition, we adopt a skip-mode coding technique to reduce the transmitted latent samples. The rate-distortion performance of our scheme is slightly lower than that of the state-of-the-art learned B-frame coding scheme, B-CANF, but outperforms other learned B-frame coding schemes. However, compared to B-CANF, our scheme saves 45% of multiply-accumulate operations (MACs) for encoding and 27% of MACs for decoding. The code is available at https://nycu-clab.github.io.
引用
收藏
页码:10249 / 10258
页数:10
相关论文
共 50 条
  • [21] An efficient multi-layer reference frame motion estimation for video coding
    Adapa Venkata Paramkusam
    Vustikayala Siva Kumar Reddy
    Journal of Real-Time Image Processing, 2016, 11 : 645 - 661
  • [22] An efficient multi-layer reference frame motion estimation for video coding
    Paramkusam, Adapa Venkata
    Reddy, Vustikayala Siva Kumar
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2016, 11 (04) : 645 - 661
  • [23] The Bidirectional-Based SRMC for Hierarchical B Frame in Scalable Video Coding
    Lee, Yu-Xuan
    Liu, Hsing-Chuang
    Tsai, Tsung-Han
    2008 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2, 2008, : 908 - 911
  • [24] Two-layer video coding and priority statistical multiplexing over ATM networks
    Gao, CW
    Meditch, JS
    1996 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS - CONVERGING TECHNOLOGIES FOR TOMORROW'S APPLICATIONS, VOLS. 1-3, 1996, : 127 - 136
  • [25] Multilayer reference frame motion estimation for video coding
    A. V. Paramkusam
    V. S. K. Reddy
    Signal, Image and Video Processing, 2015, 9 : 1851 - 1860
  • [26] Multilayer reference frame motion estimation for video coding
    Paramkusam, A. V.
    Reddy, V. S. K.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (08) : 1851 - 1860
  • [27] Combined frame memory motion compensation for video coding
    Chang, Nelson Yen-Chung
    Chang, Tian-Sheuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2006, 16 (10) : 1280 - 1285
  • [28] VIDEO CODING BY SEGMENTING MOTION VECTORS AND FRAME DIFFERENCES
    CHAE, SB
    KIM, JS
    PARK, RH
    OPTICAL ENGINEERING, 1993, 32 (04) : 870 - 876
  • [29] Multiple description video coding using hierarchical B pictures
    Liu, Minglei
    Zhu, Ce
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1367 - 1370
  • [30] Adaptive frame/field motion compensated video coding
    Puri, Atul
    Aravind, R.
    Haskell, Barry
    Signal Processing: Image Communication, 1993, 5 (1-2) : 39 - 58