OPTIMIZED DECOUPLED STRUCTURE WITH NON-LOCAL ATTENTION FOR DEEP IMAGE COMPRESSION

被引:0
|
作者
Zhang, Xuanye [1 ]
Zhang, Zhaobin [2 ]
Wu, Yaojun [3 ]
Esenlik, Semih [2 ]
Sun, Xiaoyan [1 ]
Zhang, Kai [2 ]
Zhang, Li [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Bytedance Inc, San Diego, CA 95110 USA
[3] Bytedance Inc, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Decoupled; end-to-end; neural networks; non-local attention; IEEE; 1857.11; image compression;
D O I
10.1109/ICIP51287.2024.10648246
中图分类号
学科分类号
摘要
Recently, a decoupled framework for learning-based image compression has been proposed and adopted into the JPEG AI image coding standard developed by ISO/IEC WG1. The decoupled structure disentangles the sample reconstruction process and the entropy decoding process, making the decoding extremely fast. The corresponding techniques constitute the essential parts of the JPEG AI verification model software. However, its analysis transform and synthesis transform are relatively simple, which are built with stacked convolution layers, thereby may lack the capability to interpret data correlations. In this work, we enhance the transform networks by introducing the non-local attention mechanism, which has proven efficient in image compression tasks. The proposed framework thus shares the merits of the fast decoding from the decoupled architecture and the strong transform capabilities from the non-local attention, making it a stronger candidate for practical end-to-end image codec deployment. Experimental results on the Kodak test set and JPEG AI CfP test set show that our method achieves better BDRate performance compared to the original Decoupled-anchor and significantly faster decoding speed compared to NIC. The proposed solution has been adopted by the IEEE 1857.11 Working Subgroup (1857.11 WSG) in developing neural network-based image coding standards in the 10th Meeting.
引用
收藏
页码:3681 / 3687
页数:7
相关论文
共 50 条
  • [1] A Direction-Decoupled Non-Local Attention Network for Single Image Super-Resolution
    Song, Zijiang
    Zhong, Baojiang
    Ji, Jiahuan
    Ma, Kai-Kuang
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2218 - 2222
  • [2] An anisotropic non-local attention network for image segmentation
    Feiniu Yuan
    Yaowen Zhu
    Kang Li
    Zhijun Fang
    Jinting Shi
    Machine Vision and Applications, 2022, 33
  • [3] An anisotropic non-local attention network for image segmentation
    Yuan, Feiniu
    Zhu, Yaowen
    Li, Kang
    Fang, Zhijun
    Shi, Jinting
    MACHINE VISION AND APPLICATIONS, 2022, 33 (02)
  • [4] NON-LOCAL SELF-ATTENTION STRUCTURE FOR FUNCTION APPROXIMATION IN DEEP REINFORCEMENT LEARNING
    Wang, Zhixiang
    Xiao, Xi
    Hu, Guangwu
    Yao, Yao
    Zhang, Dianyan
    Peng, Zhendong
    Li, Qing
    Xia, Shutao
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3042 - 3046
  • [5] Remote sensing image compression with long-range convolution and improved non-local attention model
    Xiang, Shao
    Liang, Qiaokang
    SIGNAL PROCESSING, 2023, 209
  • [6] MR Image Reconstruction via Non-Local Attention Networks
    Zhou, Liu
    Zhu, Minjie
    Xiong, Dongping
    Ouyang, Lijun
    Ouyang, Yan
    Zhang, Xiaozhi
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [7] Image Super-Resolution with Non-Local Sparse Attention
    Mei, Yiqun
    Fan, Yuchen
    Zhou, Yuqian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3516 - 3525
  • [8] OCT Image Restoration Using Non-Local Deep Image Prior
    Fan, Wenshi
    Yu, Hancheng
    Chen, Tianming
    Ji, Sheng
    ELECTRONICS, 2020, 9 (05)
  • [9] Multispectral image denoising with optimized vector non-local mean filter
    Ben Said, Ahmed
    Hadjidj, Rachid
    Melkemi, Kamal Eddine
    Foufou, Sebti
    DIGITAL SIGNAL PROCESSING, 2016, 58 : 115 - 126
  • [10] End-to-End Learnt Image Compression via Non-Local Attention Optimization and Improved Context Modeling
    Chen, Tong
    Liu, Haojie
    Ma, Zhan
    Shen, Qiu
    Cao, Xun
    Wang, Yao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3179 - 3191