END-TO-END LEARNED IMAGE COMPRESSION WITH CONDITIONAL LATENT SPACE MODELING FOR ENTROPY CODING

被引：0

作者：

Yesilyurt, Aziz Berkay ^{[1
]}

Kamisli, Fatih ^{[1
]}

机构：

[1] Middle East Tech Univ, Dept Elect & Elect Engn, Ankara, Turkey

来源：

28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020) | 2021年

关键词：

image compression; transform coding; deep learning; conditional modeling;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The use of neural networks in image compression enables transforms and probability models for entropy coding which can process images based on much more complex models than the simple Gauss-Markov models in traditional compression methods. All at the expense of higher computational complexity. In the neural-network based image compression literature, various methods to model the dependencies in the transform domain/latent space are proposed. This work uses an alternative method to exploit the dependencies of the latent representation. The joint density of the latent representation is modeled as a product of conditional densities, which are learned using neural networks. However, each latent variable is not conditioned on all previous latent variables as in the chain rule of factoring joint distributions, but only on a few previous variables, in particular the left, upper and upper-left spatial neighbor variables based on a Markov property assumption for a simpler model and algorthm. The compression performance is comparable with the state-of-the-art compression models, while the conditional densities require a much simpler network and training time due to their simplicity and less number of parameters then its counterparts.

引用

页码：501 / 505

页数：5

共 50 条

[41] End-to-End Learned Scalable Multilayer Feature Compression for Machine Vision Tasks
Chen, Qiaoxi
Gao, Changsheng
Liu, Dong
[J]. 2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 550 - 550
[42] Compression of End-to-End Models
Pang, Ruoming
Sainath, Tara N.
Prabhavalkar, Rohit
Gupta, Suyog
Wu, Yonghui
Zhang, Shuyuan
Chiu, Chung-cheng
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 27 - 31
[43] Conditional End-to-End Audio Transforms
Haque, Albert
Guo, Michelle
Verma, Prateek
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2295 - 2299
[44] Saliency Map-Guided End-to-End Image Coding for Machines
Peng, Bo
Lin, Tianxiang
Jin, Dengchao
Pan, Zhaoqing
Lei, Jianjun
[J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1755 - 1759
[45] An End-to-End Deep Generative Network for Low Bitrate Image Coding
Pei, Yifei
Liu, Ying
Ling, Nam
Ren, Yongxiong
Liu, Lingzhi
[J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
[46] End-to-End Latent Fingerprint Search
Cao, Kai
Dinh-Luan Nguyen
Tymoszek, Cori
Jain, Anil K.
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 880 - 894
[47] Learning-based End-to-End Video Compression Using Predictive Coding
de Oliveira, Matheus C.
Martins, Luiz G. R.
Jung, Henrique Costa
Guerin Jr, Nilson Donizete
da Silva, Renam Castro
Peixoto, Eduardo
Macchiavello, Bruno
Hung, Edson M.
Testoni, Vanessa
Freitas, Pedro Garcia
[J]. 2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021), 2021, : 160 - 167
[48] End-to-end optimized image compression with the frequency-oriented transform
Yuefeng Zhang
Kai Lin
[J]. Machine Vision and Applications, 2024, 35
[49] NN-based Embedment of Watermark in End-to-end Image Compression
Lee, EunSeong
Lee, Jongseok
Seo, Young-Ho
Sim, Donggyu
[J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
[50] New Results in End-to-end Image and Video Compression by Deep Learning
Ozsoy, Gokberk
Yilmaz, Melih
Kirmemis, Ogun
Tekalp, A. Murat
[J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,

← 1 2 3 4 5 →