B-CANF: Adaptive B-Frame Coding With Conditional Augmented Normalizing Flows

被引:3
|
作者
Chen, Mu-Jung [1 ]
Chen, Yi-Hsin [1 ]
Peng, Wen-Hsiao [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu 30010, Taiwan
关键词
Image coding; Encoding; Codecs; Transforms; Adaptive coding; Decoding; Video compression; Neural video coding; conditional coding; B-frame coding; VIDEO;
D O I
10.1109/TCSVT.2023.3301016
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Over the past few years, learning-based video compression has become an active research area. However, most works focus on P-frame coding. Learned B-frame coding is under-explored and more challenging. This work introduces a novel B-frame coding framework, termed B-CANF, that exploits conditional augmented normalizing flows for B-frame coding. B-CANF additionally features two novel elements: frame-type adaptive coding and B*-frames. Our frame-type adaptive coding learns better bit allocation for hierarchical B-frame coding by dynamically adapting the feature distributions according to the B-frame type. Our B*-frames allow greater flexibility in specifying the group-of-pictures (GOP) structure by reusing the B-frame codec to mimic P-frame coding, without the need for an additional, separate P-frame codec. On commonly used datasets, B-CANF achieves the state-of-the-art compression performance as compared to the other learned B-frame codecs and shows comparable BD-rate results to HM-16.23 under the random access configuration in terms of PSNR. When evaluated on different GOP structures, our B*-frames achieve similar performance to the additional use of a separate P-frame codec.
引用
收藏
页码:2908 / 2921
页数:14
相关论文
共 50 条
  • [31] HOMO-BIMETALLIC AND HETERO-BIMETALLIC B-FRAME COMPOUNDS - SOME NOVEL 12-VERTEX RU/RU, OS/OS, RU/OS, AND OS/RU METALLABORANES
    ELRINGTON, M
    GREENWOOD, NN
    KENNEDY, JD
    THORNTONPETT, M
    JOURNAL OF THE CHEMICAL SOCIETY-CHEMICAL COMMUNICATIONS, 1984, (21) : 1398 - 1399
  • [32] A UNIQUE ACETYLENE-INDUCED REDUCTIVE PPH3-]PH3 STRIPPING OF PHENYL GROUPS FROM TRIPHENYLPHOSPHINE LIGANDS BOUND TO A METALLABORANE B-FRAME MATRIX
    BOULD, J
    BRINT, P
    FONTAINE, XLR
    KENNEDY, JD
    THORNTONPETT, M
    JOURNAL OF THE CHEMICAL SOCIETY-CHEMICAL COMMUNICATIONS, 1989, (22) : 1763 - 1765
  • [33] INEXTENSIBLE FLOWS OF b-TANGENT DEVELOPABLE SURFACES OF BIHARMONIC NEW TYPE b-SLANT HELICES ACCORDING TO BISHOP FRAME IN THE SOL SPACE Sol(3)
    Korpinar, Talat
    Turhan, Essin
    JOURNAL OF SCIENCE AND ARTS, 2012, (02): : 149 - 156
  • [34] Adaptive GOP Structure Determination in Hierarchical B Picture Coding for the Extension of H.264/AVC
    Chen, Hung-Wei
    Yeh, Chia-Hung
    Chi, Ming-Chieh
    Hsu, Ching-Ting
    Chen, Mei-Juan
    2008 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEM, 2008, : 784 - 788
  • [35] CCSDS 131.2-B-1 Transmitter Design on FPGA with Adaptive Coding and Modulation Schemes for Satellite Communications
    Lamoral Coines, Adrian
    Gil Jimenez, Victor P.
    ELECTRONICS, 2021, 10 (20)
  • [36] Adaptive λ estimation in lagrangian rate-distortion optimization for video coding -: art. no. 60772B
    Chen, LL
    Garbacea, I
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2006, PTS 1 AND 2, 2006, 6077
  • [38] Block-wise adaptive motion accuracy based B-Picture coding with low-complexity motion compensation
    Ji, Xiangyang
    Zhao, Debin
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (08) : 1085 - 1090
  • [39] Adaptive Rate-Compatible Non-Binary LDPC Coding Scheme for the B5G Mobile System
    Zhao, Dan-feng
    Tian, Hai
    Xue, Rui
    SENSORS, 2019, 19 (05)
  • [40] ADAPTIVE FAST DIRECT MODE DECISION ALGORITHM USING MODE AND LAGRANGIAN COST PREDICTION FOR B FRAME IN H.264/AVC
    Jin, Xiaocong
    Sun, Jun
    Zhou, Jun
    Huang, Yiqing
    Su, Jia
    Ikenaga, Takeshi
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,