TRELLIS-CODED QUANTIZATION FOR END-TO-END LEARNED IMAGE COMPRESSION

被引:1
|
作者
Suhring, Karsten [1 ]
Schafer, Michael [1 ]
Pfaff, Jonathan [1 ]
Schwarz, Heiko [1 ,2 ]
Marpe, Detlev [1 ]
Wiegand, Thomas [1 ,3 ]
机构
[1] Fraunhofer Inst Telecommun, Heinrich Hertz Inst, Berlin, Germany
[2] Free Univ Berlin, Inst Comp Sci, Berlin, Germany
[3] Tech Univ Berlin, Dept Telecommun Syst, Berlin, Germany
关键词
Deep Learning; Auto-Encoder; Rate-Distortion-Optimization; Trellis-Coded Quantization;
D O I
10.1109/ICIP46576.2022.9897685
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of variational auto-encoders (VAE) for image compression has steadily grown in recent years, thus becoming competitive with advanced visual data compression technologies. These neural networks transform the source image into a latent space with a channel-wise representation. In most works, the latents are scalar quantized before being entropy coded. On the other hand, vector quantizers generally achieve denser packings of high-dimensional data regardless of the source distribution. Hence, low-complexity variants of these quantizers are implemented in the compression standards JPEG 2000 and Versatile Video Coding. In this paper we demonstrate coding gains by using trellis-coded quantization (TCQ) over scalar quantization. For the optimization of the networks with regard to TCQ, we employ a specific noisy representation of the features during the training stage. For variable-rate VAEs, we obtained 7.7% average BD-rate savings on the Kodak images by using TCQ over scalar quantization. When different networks per target bitrate are optimized, we report a relative coding gain of 2.4% due to TCQ.
引用
收藏
页码:3306 / 3310
页数:5
相关论文
共 50 条
  • [1] END-TO-END LEARNED IMAGE COMPRESSION WITH FIXED POINT WEIGHT QUANTIZATION
    Sun, Heming
    Cheng, Zhengxue
    Takeuchi, Masaru
    Katto, Jiro
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3359 - 3363
  • [2] Fully Integerized End-to-End Learned Image Compression
    Fang, Yimian
    Fei, Wen
    Li, Shaohui
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    [J]. 2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 337 - 337
  • [3] TRELLIS-CODED VECTOR QUANTIZATION
    FISCHER, TR
    MARCELLIN, MW
    WANG, M
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1991, 37 (06) : 1551 - 1556
  • [4] End-to-End Learned Image Compression with Augmented Normalizing Flows
    Ho, Yung-Han
    Chan, Chih-Chun
    Peng, Wen-Hsiao
    Hang, Hsueh-Ming
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1931 - 1935
  • [5] Multiple description trellis-coded quantization
    Jafarkhani, H
    Tarokh, V
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1999, 47 (06) : 799 - 803
  • [6] Symmetric trellis-coded vector quantization
    Belzer, B
    Villasenor, JD
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1997, 45 (11) : 1354 - 1357
  • [7] Punctured trellis-coded quantization and modulation
    Tian, Xiaodong
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2006, 54 (11) : 1908 - 1912
  • [8] Image coding with robust channel-optimized trellis-coded quantization
    Lam, TT
    Abousleman, GP
    Karam, LJ
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2000, 18 (06) : 940 - 951
  • [9] TRELLIS-CODED QUANTIZATION DESIGNED FOR NOISY CHANNELS
    WANG, M
    FISCHER, TR
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1994, 40 (06) : 1792 - 1802
  • [10] Hybrid Video Coding with Trellis-Coded Quantization
    Schwarz, Heiko
    Nguyen, Tung
    Marpe, Detlev
    Wiegand, Thomas
    [J]. 2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 182 - 191