OPTIMIZED DECOUPLED STRUCTURE WITH NON-LOCAL ATTENTION FOR DEEP IMAGE COMPRESSION

被引:0
|
作者
Zhang, Xuanye [1 ]
Zhang, Zhaobin [2 ]
Wu, Yaojun [3 ]
Esenlik, Semih [2 ]
Sun, Xiaoyan [1 ]
Zhang, Kai [2 ]
Zhang, Li [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Bytedance Inc, San Diego, CA 95110 USA
[3] Bytedance Inc, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Decoupled; end-to-end; neural networks; non-local attention; IEEE; 1857.11; image compression;
D O I
10.1109/ICIP51287.2024.10648246
中图分类号
学科分类号
摘要
Recently, a decoupled framework for learning-based image compression has been proposed and adopted into the JPEG AI image coding standard developed by ISO/IEC WG1. The decoupled structure disentangles the sample reconstruction process and the entropy decoding process, making the decoding extremely fast. The corresponding techniques constitute the essential parts of the JPEG AI verification model software. However, its analysis transform and synthesis transform are relatively simple, which are built with stacked convolution layers, thereby may lack the capability to interpret data correlations. In this work, we enhance the transform networks by introducing the non-local attention mechanism, which has proven efficient in image compression tasks. The proposed framework thus shares the merits of the fast decoding from the decoupled architecture and the strong transform capabilities from the non-local attention, making it a stronger candidate for practical end-to-end image codec deployment. Experimental results on the Kodak test set and JPEG AI CfP test set show that our method achieves better BDRate performance compared to the original Decoupled-anchor and significantly faster decoding speed compared to NIC. The proposed solution has been adopted by the IEEE 1857.11 Working Subgroup (1857.11 WSG) in developing neural network-based image coding standards in the 10th Meeting.
引用
收藏
页码:3681 / 3687
页数:7
相关论文
共 50 条
  • [21] Unsupervised Deep Exemplar Colorization via Pyramid Dual Non-Local Attention
    Wang, Hanzhang
    Zhai, Deming
    Liu, Xianming
    Jiang, Junjun
    Gao, Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4114 - 4127
  • [22] ENHANCED NON-LOCAL CASCADING NETWORK WITH ATTENTION MECHANISM FOR HYPERSPECTRAL IMAGE DENOISING
    Ma, Hanwen
    Liu, Ganchao
    Yuan, Yuan
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2448 - 2452
  • [23] Non-local self-attention network for image super-resolution
    Zeng, Kun
    Lin, Hanjiang
    Yan, Zhiqiang
    Fang, Jinsheng
    Lai, Taotao
    APPLIED INTELLIGENCE, 2024, 54 (07) : 5336 - 5352
  • [24] Wavelet-like Transform with Subbands Fusion in Decoupled Structure for Deep Image Compression
    Ma, Ke
    Wu, Yaojun
    Zhang, Zhaobin
    Esenlik, Semih
    Sun, Xiaoyan
    Zhang, Kai
    Zhang, Li
    2024 PICTURE CODING SYMPOSIUM, PCS 2024, 2024,
  • [25] MAGNITUDE MR IMAGE DENOISING VIA CURE-OPTIMIZED NON-LOCAL MEANS
    Hong, Jung Ook
    Luisier, Florian
    Wolfe, Patrick J.
    2012 9TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2012, : 610 - 613
  • [26] Non-local text image reconstruction
    Luong, Hiep Quang
    Philips, Wilfried
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 546 - 550
  • [27] NON-LOCAL DUAL IMAGE DENOISING
    Pierazzo, N.
    Lebrun, M.
    Rais, M. E.
    Morel, J. M.
    Facciolo, G.
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 813 - 817
  • [28] Image denoising by non-local averaging
    Buades, A
    Coll, B
    Morel, JM
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 25 - 28
  • [29] A non-local algorithm for image denoising
    Buades, A
    Coll, B
    Morel, JM
    2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 60 - 65
  • [30] Optimized Latent Features for Deep Image Compression
    Kurnianggoro, Laksono
    Jo, Kang-Hyun
    2019 12TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION (HSI), 2019, : 211 - 214