Enhancing Learned Image Compression via Cross Window-Based Attention

被引:0
|
作者
Mudgal, Priyanka [1 ]
Liu, Feng [1 ]
机构
[1] Portland State Univ, Portland, OR 97124 USA
关键词
learned image compression; end-to-end image compression;
D O I
10.1007/978-3-031-77389-1_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, learned image compression methods have demonstrated superior rate-distortion performance compared to traditional image compression methods. Recent methods utilize convolutional neural networks (CNN), variational autoencoders (VAE), invertible neural networks (INN), and transformers. Despite their significant contributions, a main drawback of these models is their poor performance in capturing local redundancy. Therefore, to leverage global features along with local redundancy, we propose a CNN-based solution integrated with a feature encoding module. The feature encoding module encodes important features before feeding them to the CNN and then utilizes cross-scale window-based attention, which further captures local redundancy. Crossscale window-based attention is inspired by the attention mechanism in transformers and effectively enlarges the receptive field. Both the feature encoding module and the cross-scale window-based attention module in our architecture are flexible and can be incorporated into any other network architecture. We evaluate our method on the Kodak and CLIC datasets and demonstrate that our approach is effective and on par with state-of-the-art methods.
引用
收藏
页码:410 / 423
页数:14
相关论文
共 50 条
  • [21] Moving window-based double Haar wavelet transform for image processing
    Wang, Xin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (09) : 2771 - 2779
  • [22] Synthetic Face Discrimination via Learned Image Compression
    Iliopoulou, Sofia
    Tsinganos, Panagiotis
    Ampeliotis, Dimitris
    Skodras, Athanassios
    ALGORITHMS, 2024, 17 (09)
  • [23] IMAGE COMPRESSION VIA MULTIPLE LEARNED GEOMETRIC DICTIONARIES
    Huang, Danlan
    Tao, Xiaoming
    Xu, Mai
    Gao, Shenghua
    Lu, Jianhua
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 1373 - 1377
  • [24] Adaptive window-based multispectral image demosaicking method using pseudo-panchromatic image
    Liao, Ronghao
    Wu, Guangyuan
    Wu, Yuheng
    LASER PHYSICS LETTERS, 2025, 22 (04)
  • [25] Learned Focused Plenoptic Image Compression With Microimage Preprocessing and Global Attention
    Tong, Kedeng
    Jin, Xin
    Yang, Yuqing
    Wang, Chen
    Kang, Jinshi
    Jiang, Fan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 890 - 903
  • [26] Adaptive Moving Window-Based Non-Uniformity Correction of CMOS Image
    Wang Shiwei
    Zhang Guixiang
    Xu Wei
    Wu Yongjie
    Tao Shuping
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (14)
  • [27] A new adaptive window-based guided filtering and interpolation for polarization image demosaicing
    Xie, Fei
    Liu, Shumin
    Chen, Jiajia
    IET IMAGE PROCESSING, 2023, 17 (07) : 2238 - 2255
  • [28] Optimizing data intensive window-based image processing on reconfigurable hardware boards
    Yu, HQ
    Leeser, M
    2005 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS - DESIGN AND IMPLEMENTATION (SIPS), 2005, : 491 - 496
  • [29] Configurable hardware architecture for real-time window-based image processing
    Torres-Huitzil, C
    Arias-Estrada, M
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2003, 2778 : 1008 - 1011
  • [30] Deep CNN based Image Compression with Redundancy Minimization via Attention Guidance
    Mishra, Dipti
    Singh, Satish Kumar
    Singh, Rajat Kumar
    NEUROCOMPUTING, 2022, 507 : 397 - 411