End-to-End Learned Image Compression with Augmented Normalizing Flows

被引:3
|
作者
Ho, Yung-Han [1 ]
Chan, Chih-Chun [1 ]
Peng, Wen-Hsiao [1 ,3 ]
Hang, Hsueh-Ming [2 ,3 ]
机构
[1] Natl Chiao Tung Univ, Comp Sci Dept, Hsinchu, Taiwan
[2] Natl Chiao Tung Univ, Elect Engn Dept, Hsinchu, Taiwan
[3] Natl Chiao Tung Univ, Pervas AI Res PAIR Labs, Hsinchu, Taiwan
关键词
D O I
10.1109/CVPRW53098.2021.00220
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new attempt at using augmented normalizing flows (ANF) for lossy image compression. ANF is a specific type of normalizing flow models that augment the input with an independent noise, allowing a smoother transformation from the augmented input space to the latent space. Inspired by the fact that ANF can offer greater expressivity by stacking multiple variational autoencoders (VAE), we generalize the popular VAE-based compression framework by the autoencoding transforms of ANF. When evaluated on Kodak dataset, our ANF-based model provides 3.4% higher BD-rate saving as compared with a VAE-based baseline that implements hyper-prior with mean prediction. Interestingly, it benefits even more from the incorporation of a post-processing network, showing 11.8% rate saving as compared to 6.0% with the baseline plus post-processing.
引用
收藏
页码:1931 / 1935
页数:5
相关论文
共 50 条
  • [21] End-to-end Image Compression with Swin-Transformer
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    Li, Yue
    Li, Junru
    Wang, Yue
    Wang, Shiqi
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [22] Learning End-to-End Lossy Image Compression: A Benchmark
    Hu, Yueyu
    Yang, Wenhan
    Ma, Zhan
    Liu, Jiaying
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4194 - 4211
  • [23] End-to-End Learned Random Walker for Seeded Image Segmentation
    Cerrone, Lorenzo
    Zeilmann, Alexander
    Hamprecht, Fred A.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12551 - 12560
  • [24] End-to-end image compression method based on perception metric
    Shuai Liu
    Yingcong Huang
    Huoxiang Yang
    Yongsheng Liang
    Wei Liu
    Signal, Image and Video Processing, 2022, 16 : 1803 - 1810
  • [25] End-to-end system consideration of the Galileo image compression system
    Cheung, K
    Tong, K
    Belongie, M
    IGARSS '96 - 1996 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM: REMOTE SENSING FOR A SUSTAINABLE FUTURE, VOLS I - IV, 1996, : 1035 - 1038
  • [26] End-to-End Learning-Based Image Compression: A Review
    Chen Jimin
    Lin Zehao
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
  • [27] End-to-end optimized image compression with competition of prior distributions
    Brummer, Benoit
    De Vleeschouwer, Christophe
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1890 - 1894
  • [28] An end-to-end spike-based image compression architecture
    Doutsi, Effrosyni
    Antonini, Marc
    Tsakalides, Panagiotis
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 818 - 820
  • [29] End-to-end image compression method based on perception metric
    Liu, Shuai
    Huang, Yingcong
    Yang, Huoxiang
    Liang, Yongsheng
    Liu, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (07) : 1803 - 1810
  • [30] A Reference Resource Based End-to-End Image Compression Scheme
    Yin, Wenbin
    Fan, Xiaopeng
    Shi, Yunhui
    Zuo, Wangmeng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 534 - 544