B-CANF: Adaptive B-Frame Coding With Conditional Augmented Normalizing Flows

被引:3
|
作者
Chen, Mu-Jung [1 ]
Chen, Yi-Hsin [1 ]
Peng, Wen-Hsiao [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu 30010, Taiwan
关键词
Image coding; Encoding; Codecs; Transforms; Adaptive coding; Decoding; Video compression; Neural video coding; conditional coding; B-frame coding; VIDEO;
D O I
10.1109/TCSVT.2023.3301016
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Over the past few years, learning-based video compression has become an active research area. However, most works focus on P-frame coding. Learned B-frame coding is under-explored and more challenging. This work introduces a novel B-frame coding framework, termed B-CANF, that exploits conditional augmented normalizing flows for B-frame coding. B-CANF additionally features two novel elements: frame-type adaptive coding and B*-frames. Our frame-type adaptive coding learns better bit allocation for hierarchical B-frame coding by dynamically adapting the feature distributions according to the B-frame type. Our B*-frames allow greater flexibility in specifying the group-of-pictures (GOP) structure by reusing the B-frame codec to mimic P-frame coding, without the need for an additional, separate P-frame codec. On commonly used datasets, B-CANF achieves the state-of-the-art compression performance as compared to the other learned B-frame codecs and shows comparable BD-rate results to HM-16.23 under the random access configuration in terms of PSNR. When evaluated on different GOP structures, our B*-frames achieve similar performance to the additional use of a separate P-frame codec.
引用
收藏
页码:2908 / 2921
页数:14
相关论文
共 50 条
  • [21] ANFPCGC++: Point Cloud Geometry Coding Using Augmented Normalizing Flows and Transformer-Based Entropy Model
    Chiang, Jui-Chiu
    Chiu, Ji-Jin
    Yim, Monyneath
    IEEE Access, 2024, 12 : 163410 - 163423
  • [22] Adaptive search range scaling for B pictures coding
    Yang, Zhigang
    Gao, Wen
    Liu, Yan
    Zhao, Debin
    Advances in Multimedia Information Processing - PCM 2006, Proceedings, 2006, 4261 : 704 - 713
  • [23] The Bidirectional-Based SRMC for Hierarchical B Frame in Scalable Video Coding
    Lee, Yu-Xuan
    Liu, Hsing-Chuang
    Tsai, Tsung-Han
    2008 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2, 2008, : 908 - 911
  • [24] Adaptive frame structure in B3G-TDD uplink
    Lihua, Li
    Mingyu, Zhou
    Xiaofeng, Tao
    Ping, Zhang
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2007, 7 (08): : 985 - 993
  • [25] B-picture coding with motion compensated frame rate up-conversion
    Sasai, H
    Kondo, S
    IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2005, PTS 1 AND 2, 2005, 5685 : 792 - 800
  • [26] A Long-Term Reference Frame for Hierarchical B-Picture-Based Video Coding
    Paul, Manoranjan
    Lin, Weisi
    Lau, Chiew-Tong
    Lee, Bu Sung
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (10) : 1729 - 1742
  • [27] Two-Layer Learning-based P-Frame Coding with Super-Resolution and Content-Adaptive Conditional ANF
    Alexandre, David
    Hang, Hsueh-Ming
    Peng, Wen-Hsiao
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [28] Novel Direct Mode Decision for H.264/AVC Inter B frame Video Coding
    Fang, Yan-Neng
    Lin, Yinyi
    Hsieh, Hui-Jane
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2013, : 198 - 202
  • [29] Efficient Multi-Reference Frame Selection Algorithm for Hierarchical B Pictures in Multiview Video Coding
    Zhang, Yun
    Kwong, Sam
    Jiang, Gangyi
    Wang, Hanli
    IEEE TRANSACTIONS ON BROADCASTING, 2011, 57 (01) : 15 - 23
  • [30] Metallaborane reaction chemistry.: Part 7.: B-frame supported bimetallics:: Ligand-to-β-metal organometallic interaction in dimetallaboranes and an interesting ligand displacement cascade
    Kim, Y
    McKinnes, YM
    Cooke, PA
    Greatrex, R
    Kennedy, JD
    Thornton-Pett, M
    COLLECTION OF CZECHOSLOVAK CHEMICAL COMMUNICATIONS, 1999, 64 (06) : 938 - 946