Enhancing Learned Image Compression via Cross Window-Based Attention

被引:0
|
作者
Mudgal, Priyanka [1 ]
Liu, Feng [1 ]
机构
[1] Portland State Univ, Portland, OR 97124 USA
关键词
learned image compression; end-to-end image compression;
D O I
10.1007/978-3-031-77389-1_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, learned image compression methods have demonstrated superior rate-distortion performance compared to traditional image compression methods. Recent methods utilize convolutional neural networks (CNN), variational autoencoders (VAE), invertible neural networks (INN), and transformers. Despite their significant contributions, a main drawback of these models is their poor performance in capturing local redundancy. Therefore, to leverage global features along with local redundancy, we propose a CNN-based solution integrated with a feature encoding module. The feature encoding module encodes important features before feeding them to the CNN and then utilizes cross-scale window-based attention, which further captures local redundancy. Crossscale window-based attention is inspired by the attention mechanism in transformers and effectively enlarges the receptive field. Both the feature encoding module and the cross-scale window-based attention module in our architecture are flexible and can be incorporated into any other network architecture. We evaluate our method on the Kodak and CLIC datasets and demonstrate that our approach is effective and on par with state-of-the-art methods.
引用
收藏
页码:410 / 423
页数:14
相关论文
共 50 条
  • [41] Learned Block-Based Hybrid Image Compression
    Wu, Yaojun
    Li, Xin
    Zhang, Zhizheng
    Jin, Xin
    Chen, Zhibo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3978 - 3990
  • [42] PERCEPTUAL LEARNED IMAGE COMPRESSION VIA END-TO-END JND-BASED OPTIMIZATION
    Pakdaman, Farhad
    Nami, Sanaz
    Gabbouj, Moncef
    2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2024, : 1146 - 1151
  • [43] Learned Image Compression With Efficient Cross-Platform Entropy Coding
    Yang, Runyu
    Liu, Dong
    Wu, Feng
    Gao, Wen
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2025, 15 (01) : 72 - 82
  • [44] Enhancing Skin Cancer Diagnosis Using Swin Transformer with Hybrid Shifted Window-Based Multi-head Self-attention and SwiGLU-Based MLP
    Pacal, Ishak
    Alaftekin, Melek
    Zengul, Ferhat Devrim
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (06): : 3174 - 3192
  • [45] An Efficient Window-Based Vision Transformer Accelerator via Mixed-Granularity Sparsity
    Dong, Qiwei
    Zhang, Siyu
    Wang, Zhongfeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2025,
  • [46] SegTrackDetect: A window-based framework for tiny object detection via semantic segmentation and tracking
    Kos, Aleksandra
    Majek, Karol
    Belter, Dominik
    SOFTWAREX, 2025, 30
  • [47] Entropy Modeling via Gaussian Process Regression for Learned Image Compression
    Cao, Maida
    Dai, Wenrui
    Li, Shaohui
    Li, Chenglin
    Zou, Junni
    Chen, Ying
    Xiong, Hongkai
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 163 - 172
  • [48] Improving Polyphone Disambiguation for Mandarin Chinese by Combining Mix-pooling Strategy and Window-based Attention
    Li, Junjie
    Zhang, Zhiyu
    Chen, Minchuan
    Ma, Jun
    Wang, Shaojun
    Xiao, Jing
    INTERSPEECH 2021, 2021, : 4104 - 4108
  • [49] Memory Allocation for Window-Based Image Processing on Multiple Memory Modules with Simple Addressing Functions
    Waidyasooriya, Hasitha Muthumala
    Hariyama, Masanori
    Kameyama, Michitaka
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (01) : 342 - 351
  • [50] ASHFormer: Axial and Sliding Window-Based Attention With High-Resolution Transformer for Automatic Stratigraphic Correlation
    Liu, Naihao
    Li, Zhuo
    Liu, Rongchang
    Zhang, Haidong
    Gao, Jinghuai
    Wei, Tao
    Si, Jianlou
    Wu, Hao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61