END-TO-END LEARNED IMAGE COMPRESSION WITH CONDITIONAL LATENT SPACE MODELING FOR ENTROPY CODING

被引:0
|
作者
Yesilyurt, Aziz Berkay [1 ]
Kamisli, Fatih [1 ]
机构
[1] Middle East Tech Univ, Dept Elect & Elect Engn, Ankara, Turkey
关键词
image compression; transform coding; deep learning; conditional modeling;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The use of neural networks in image compression enables transforms and probability models for entropy coding which can process images based on much more complex models than the simple Gauss-Markov models in traditional compression methods. All at the expense of higher computational complexity. In the neural-network based image compression literature, various methods to model the dependencies in the transform domain/latent space are proposed. This work uses an alternative method to exploit the dependencies of the latent representation. The joint density of the latent representation is modeled as a product of conditional densities, which are learned using neural networks. However, each latent variable is not conditioned on all previous latent variables as in the chain rule of factoring joint distributions, but only on a few previous variables, in particular the left, upper and upper-left spatial neighbor variables based on a Markov property assumption for a simpler model and algorthm. The compression performance is comparable with the state-of-the-art compression models, while the conditional densities require a much simpler network and training time due to their simplicity and less number of parameters then its counterparts.
引用
收藏
页码:501 / 505
页数:5
相关论文
共 50 条
  • [1] CONTENT-ADAPTIVE PARALLEL ENTROPY CODING FOR END-TO-END IMAGE COMPRESSION
    Li, Shujia
    Wang, Dezhao
    Fan, Zejia
    Liu, Jiaying
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3195 - 3199
  • [2] IMAGE CODING FOR MACHINES: AN END-TO-END LEARNED APPROACH
    Le, Nam
    Zhang, Honglei
    Cricri, Francesco
    Ghaznavi-Youvalari, Ramin
    Rahtu, Esa
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1590 - 1594
  • [3] Unveiling the Future of Human and Machine Coding: A Survey of End-to-End Learned Image Compression
    Huang, Chen-Hsiu
    Wu, Ja-Ling
    [J]. ENTROPY, 2024, 26 (05)
  • [4] Fully Integerized End-to-End Learned Image Compression
    Fang, Yimian
    Fei, Wen
    Li, Shaohui
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    [J]. 2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 337 - 337
  • [5] End-to-End Learned Image Compression with Augmented Normalizing Flows
    Ho, Yung-Han
    Chan, Chih-Chun
    Peng, Wen-Hsiao
    Hang, Hsueh-Ming
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1931 - 1935
  • [6] END-TO-END LEARNED IMAGE COMPRESSION WITH FIXED POINT WEIGHT QUANTIZATION
    Sun, Heming
    Cheng, Zhengxue
    Takeuchi, Masaru
    Katto, Jiro
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3359 - 3363
  • [7] TRELLIS-CODED QUANTIZATION FOR END-TO-END LEARNED IMAGE COMPRESSION
    Suhring, Karsten
    Schafer, Michael
    Pfaff, Jonathan
    Schwarz, Heiko
    Marpe, Detlev
    Wiegand, Thomas
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3306 - 3310
  • [8] An end-to-end dynamic point cloud geometry compression in latent space
    Jiang, Zhaoyi
    Wang, Guoliang
    Tam, Gary K. L.
    Song, Chao
    Li, Frederick W. B.
    Yang, Bailin
    [J]. DISPLAYS, 2023, 80
  • [9] Reducing The Amortization Gap of Entropy Bottleneck In End-to-End Image Compression
    Balcilar, Muhammet
    Damodaran, Bharath
    Hellier, Pierre
    [J]. 2022 PICTURE CODING SYMPOSIUM (PCS), 2022, : 115 - 119
  • [10] Spatial-Channel Context-Based Entropy Modeling for End-to-end Optimized Image Compression
    Li, Chongxin
    Luo, Jixiang
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 222 - 225