RATE-DISTORTION-OPTIMIZATION FOR DEEP IMAGE COMPRESSION

被引：3

作者：

Schaefer, Michael ^{[1
]}

Pientka, Sophie ^{[1
]}

Pfaff, Jonathan ^{[1
]}

Schwarz, Heiko ^{[1
]}

Marpe, Detlev ^{[1
]}

Wiegand, Thomas ^{[1
]}

机构：

[1] Fraunhofer Inst Telecommun, Video Commun & Applicat Dept, Heinrich Hertz Inst, Einsteinufer 37, D-10587 Berlin, Germany

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2021年

关键词：

High Efficiency Video Coding (HEVC); Versatile Video Coding (VVC); Deep Learning; Auto-Encoder; Rate-Distortion-Optimization;

D O I：

10.1109/ICIP42928.2021.9506513

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Given the capabilities of massive GPU hardware, there has been a surge of using artificial neural networks (ANN) for still image compression. These compression systems usually consist of convolutional layers and can be considered as non-linear transform coding. Notably, these ANNs are based on an end-to-end approach where the encoder determines a compressed version of the image as features. In contrast to this, existing image and video codecs employ a block-based architecture with signal-dependent encoder optimizations. A basic requirement for designing such optimizations is estimating the impact of the quantization error on the resulting bitrate and distortion. As for non-linear, multi-layered neural networks, this is a difficult problem. This paper presents a performant auto-encoder architecture for still image compression, which represents the compressed features at multiple scales. Then, we demonstrate how an algorithm, which tests multiple feature candidates, can reduce the Lagrangian cost and optimize compression efficiency. The algorithm avoids multiple network executions by pre-estimating the impact of the quantization on the distortion by a higher-order polynomial.

引用

页码：3737 / 3741

页数：5

共 50 条

[41] Variable Rate Deep Image Compression With a Conditional Autoencoder
Choi, Yoojin
El-Khamy, Mostafa
Lee, Jungwon
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3146 - 3154
[42] An embedded still image coder with rate-distortion optimization
Li, J
Lei, SW
VISUAL COMMUNICATIONS AND IMAGE PROCESSING '98, PTS 1 AND 2, 1997, 3309 : 36 - 47
[43] Embedded still image coder with rate-distortion optimization
Li, Jin
Lei, Shawmin
IEEE Transactions on Image Processing, 8 (07): : 913 - 924
[44] An embedded still image coder with rate-distortion optimization
Li, J
Lei, SM
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (07) : 913 - 924
[45] Variable Rate Image Compression with Content Adaptive Optimization
Guo, Tiansheng
Wang, Jing
Cui, Ze
Feng, Yihui
Ge, Yunying
Bai, Bo
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 533 - 537
[46] CONCEALABILITY-RATE-DISTORTION TRADEOFF IN IMAGE COMPRESSION ANTI-FORENSICS
Chu, Xiaoyu
Stamm, Matthew C.
Chen, Yan
Liu, K. J. Ray
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3063 - 3067
[47] From rate-distortion theory to commercial image and video compression technology
Ortega, A
Ramchandran, K
IEEE SIGNAL PROCESSING MAGAZINE, 1998, 15 (06) : 20 - 22
[48] Image compression using zerotrees of wavelet packets in rate-distortion sense
Sembiring, J
Nakabayashi, M
Soemintapoera, K
Akizuki, K
APCC 2003: 9TH ASIA-PACIFIC CONFERENCE ON COMMUNICATION, VOLS 1-3, PROCEEDINGS, 2003, : 822 - 824
[49] RDONet: Rate-Distortion Optimized Learned Image Compression with Variable Depth
Brand, Fabian
Fischer, Kristian
Kopte, Alexander
Windsheimer, Marc
Kaup, Andre
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1758 - 1762
[50] Rate-Distortion Approach to Bit Allocation in Lossy Image Set Compression
Lerner, Camara
Cheng, Howard
21ST INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2014), 2014, : 227 - 230

← 1 2 3 4 5 →