RATE-DISTORTION-OPTIMIZATION FOR DEEP IMAGE COMPRESSION

被引:3
|
作者
Schaefer, Michael [1 ]
Pientka, Sophie [1 ]
Pfaff, Jonathan [1 ]
Schwarz, Heiko [1 ]
Marpe, Detlev [1 ]
Wiegand, Thomas [1 ]
机构
[1] Fraunhofer Inst Telecommun, Video Commun & Applicat Dept, Heinrich Hertz Inst, Einsteinufer 37, D-10587 Berlin, Germany
关键词
High Efficiency Video Coding (HEVC); Versatile Video Coding (VVC); Deep Learning; Auto-Encoder; Rate-Distortion-Optimization;
D O I
10.1109/ICIP42928.2021.9506513
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the capabilities of massive GPU hardware, there has been a surge of using artificial neural networks (ANN) for still image compression. These compression systems usually consist of convolutional layers and can be considered as non-linear transform coding. Notably, these ANNs are based on an end-to-end approach where the encoder determines a compressed version of the image as features. In contrast to this, existing image and video codecs employ a block-based architecture with signal-dependent encoder optimizations. A basic requirement for designing such optimizations is estimating the impact of the quantization error on the resulting bitrate and distortion. As for non-linear, multi-layered neural networks, this is a difficult problem. This paper presents a performant auto-encoder architecture for still image compression, which represents the compressed features at multiple scales. Then, we demonstrate how an algorithm, which tests multiple feature candidates, can reduce the Lagrangian cost and optimize compression efficiency. The algorithm avoids multiple network executions by pre-estimating the impact of the quantization on the distortion by a higher-order polynomial.
引用
收藏
页码:3737 / 3741
页数:5
相关论文
共 50 条
  • [41] Variable Rate Deep Image Compression With a Conditional Autoencoder
    Choi, Yoojin
    El-Khamy, Mostafa
    Lee, Jungwon
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3146 - 3154
  • [42] An embedded still image coder with rate-distortion optimization
    Li, J
    Lei, SW
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '98, PTS 1 AND 2, 1997, 3309 : 36 - 47
  • [43] Embedded still image coder with rate-distortion optimization
    Li, Jin
    Lei, Shawmin
    IEEE Transactions on Image Processing, 8 (07): : 913 - 924
  • [44] An embedded still image coder with rate-distortion optimization
    Li, J
    Lei, SM
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (07) : 913 - 924
  • [45] Variable Rate Image Compression with Content Adaptive Optimization
    Guo, Tiansheng
    Wang, Jing
    Cui, Ze
    Feng, Yihui
    Ge, Yunying
    Bai, Bo
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 533 - 537
  • [46] CONCEALABILITY-RATE-DISTORTION TRADEOFF IN IMAGE COMPRESSION ANTI-FORENSICS
    Chu, Xiaoyu
    Stamm, Matthew C.
    Chen, Yan
    Liu, K. J. Ray
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3063 - 3067
  • [47] From rate-distortion theory to commercial image and video compression technology
    Ortega, A
    Ramchandran, K
    IEEE SIGNAL PROCESSING MAGAZINE, 1998, 15 (06) : 20 - 22
  • [48] Image compression using zerotrees of wavelet packets in rate-distortion sense
    Sembiring, J
    Nakabayashi, M
    Soemintapoera, K
    Akizuki, K
    APCC 2003: 9TH ASIA-PACIFIC CONFERENCE ON COMMUNICATION, VOLS 1-3, PROCEEDINGS, 2003, : 822 - 824
  • [49] RDONet: Rate-Distortion Optimized Learned Image Compression with Variable Depth
    Brand, Fabian
    Fischer, Kristian
    Kopte, Alexander
    Windsheimer, Marc
    Kaup, Andre
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1758 - 1762
  • [50] Rate-Distortion Approach to Bit Allocation in Lossy Image Set Compression
    Lerner, Camara
    Cheng, Howard
    21ST INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2014), 2014, : 227 - 230