END-TO-END LEARNED IMAGE COMPRESSION WITH CONDITIONAL LATENT SPACE MODELING FOR ENTROPY CODING

被引:0
|
作者
Yesilyurt, Aziz Berkay [1 ]
Kamisli, Fatih [1 ]
机构
[1] Middle East Tech Univ, Dept Elect & Elect Engn, Ankara, Turkey
关键词
image compression; transform coding; deep learning; conditional modeling;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The use of neural networks in image compression enables transforms and probability models for entropy coding which can process images based on much more complex models than the simple Gauss-Markov models in traditional compression methods. All at the expense of higher computational complexity. In the neural-network based image compression literature, various methods to model the dependencies in the transform domain/latent space are proposed. This work uses an alternative method to exploit the dependencies of the latent representation. The joint density of the latent representation is modeled as a product of conditional densities, which are learned using neural networks. However, each latent variable is not conditioned on all previous latent variables as in the chain rule of factoring joint distributions, but only on a few previous variables, in particular the left, upper and upper-left spatial neighbor variables based on a Markov property assumption for a simpler model and algorthm. The compression performance is comparable with the state-of-the-art compression models, while the conditional densities require a much simpler network and training time due to their simplicity and less number of parameters then its counterparts.
引用
收藏
页码:501 / 505
页数:5
相关论文
共 50 条
  • [41] End-to-End Learned Scalable Multilayer Feature Compression for Machine Vision Tasks
    Chen, Qiaoxi
    Gao, Changsheng
    Liu, Dong
    [J]. 2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 550 - 550
  • [42] Compression of End-to-End Models
    Pang, Ruoming
    Sainath, Tara N.
    Prabhavalkar, Rohit
    Gupta, Suyog
    Wu, Yonghui
    Zhang, Shuyuan
    Chiu, Chung-cheng
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 27 - 31
  • [43] Conditional End-to-End Audio Transforms
    Haque, Albert
    Guo, Michelle
    Verma, Prateek
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2295 - 2299
  • [44] Saliency Map-Guided End-to-End Image Coding for Machines
    Peng, Bo
    Lin, Tianxiang
    Jin, Dengchao
    Pan, Zhaoqing
    Lei, Jianjun
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1755 - 1759
  • [45] An End-to-End Deep Generative Network for Low Bitrate Image Coding
    Pei, Yifei
    Liu, Ying
    Ling, Nam
    Ren, Yongxiong
    Liu, Lingzhi
    [J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [46] End-to-End Latent Fingerprint Search
    Cao, Kai
    Dinh-Luan Nguyen
    Tymoszek, Cori
    Jain, Anil K.
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 880 - 894
  • [47] Learning-based End-to-End Video Compression Using Predictive Coding
    de Oliveira, Matheus C.
    Martins, Luiz G. R.
    Jung, Henrique Costa
    Guerin Jr, Nilson Donizete
    da Silva, Renam Castro
    Peixoto, Eduardo
    Macchiavello, Bruno
    Hung, Edson M.
    Testoni, Vanessa
    Freitas, Pedro Garcia
    [J]. 2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021), 2021, : 160 - 167
  • [48] End-to-end optimized image compression with the frequency-oriented transform
    Yuefeng Zhang
    Kai Lin
    [J]. Machine Vision and Applications, 2024, 35
  • [49] NN-based Embedment of Watermark in End-to-end Image Compression
    Lee, EunSeong
    Lee, Jongseok
    Seo, Young-Ho
    Sim, Donggyu
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [50] New Results in End-to-end Image and Video Compression by Deep Learning
    Ozsoy, Gokberk
    Yilmaz, Melih
    Kirmemis, Ogun
    Tekalp, A. Murat
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,