Neural Data-Dependent Transform for Learned Image Compression

被引:16
|
作者
Wang, Dezhao [1 ]
Yang, Wenhan [1 ]
Hu, Yueyu [1 ]
Liu, Jiaying [1 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
OPTIMIZATION;
D O I
10.1109/CVPR52688.2022.01686
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learned image compression has achieved great success due to its excellent modeling capacity, but seldom further considers the Rate-Distortion Optimization (RDO) of each input image. To explore this potential in the learned codec,we make the first attempt to build a neural data-dependent transform and introduce a continuous online mode decision mechanism to jointly optimize the coding efficiency for each individual image. Specifically, apart from the image content stream, we employ an additional model stream to generate the transform parameters at the decoder side. The presence of a model stream enables our model to learn more abstract neural-syntax, which helps cluster the latent representations of images more compactly. Beyond the transform stage, we also adopt neural-syntax based post-processing for the scenarios that require higher quality reconstructions regardless of extra decoding overhead. Moreover, the involvement of the model stream further makes it possible to optimize both the representation and the decoder in an online way, i.e. RDO at the testing time. It is equivalent to a continuous online mode decision, like coding modes in the traditional codecs, to improve the coding efficiency based on the individual input image. The experimental results show the effectiveness of the proposed neural-syntax design and the continuous online mode decision mechanism,demonstrating the superiority of our method in coding efficiency.Our project is available at: https://dezhaowang.github.io/Neural-Syntax-Website/.
引用
收藏
页码:17358 / 17367
页数:10
相关论文
共 50 条
  • [1] Image compression using data-dependent triangulations
    Lehner, Burkhard
    Umlauf, Georg
    Hamann, Bernd
    [J]. ADVANCES IN VISUAL COMPUTING, PT I, 2007, 4841 : 351 - +
  • [2] Image coding using data-dependent triangulation
    Rila, L
    Constantinides, AG
    [J]. DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2: SPECIAL SESSIONS, 1997, : 531 - 534
  • [3] Practical Data-Dependent Metric Compression with Provable Guarantees
    Indyk, Piotr
    Razenshteyn, Ilya
    Wagner, Tal
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [4] Image reconstruction using data-dependent triangulation
    Yu, XH
    Morse, BS
    Sederberg, TW
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2001, 21 (03) : 62 - 68
  • [5] Data dependent Haar-like transform for signal and image compression
    Avudainayagam, A
    Vani, C
    [J]. WAVELETS: APPLICATIONS IN SIGNAL AND IMAGE PROCESSING IX, 2001, 4478 : 282 - 289
  • [6] Interpretable Learned Image Compression: A Frequency Transform Decomposition Perspective
    Zhang, Yuefeng
    Lin, Kai
    Jia, Chuanmin
    Ma, Siwei
    [J]. DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 495 - 495
  • [7] Data-dependent weighted average filtering for image sequence restoration
    Meguro, M
    Taguchi, A
    Hamada, N
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2001, 84 (04): : 1 - 10
  • [8] Image interpolation by pixel-level data-dependent triangulation
    Su, D
    Willis, P
    [J]. COMPUTER GRAPHICS FORUM, 2004, 23 (02) : 189 - 201
  • [9] IMAGE-DEPENDENT LOCAL ENTROPY MODELS FOR LEARNED IMAGE COMPRESSION
    Minnen, David
    Toderici, George
    Singh, Saurabh
    Hwang, Sung Jin
    Covell, Michele
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 430 - 434
  • [10] Image Enhancement under Data-Dependent Multiplicative Gamma Noise
    Pacheeripadikkal, Jidesh
    Anattu, Bini
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2014, 2014