Enhancing Learned Image Compression via Cross Window-Based Attention

被引:0
|
作者
Mudgal, Priyanka [1 ]
Liu, Feng [1 ]
机构
[1] Portland State Univ, Portland, OR 97124 USA
关键词
learned image compression; end-to-end image compression;
D O I
10.1007/978-3-031-77389-1_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, learned image compression methods have demonstrated superior rate-distortion performance compared to traditional image compression methods. Recent methods utilize convolutional neural networks (CNN), variational autoencoders (VAE), invertible neural networks (INN), and transformers. Despite their significant contributions, a main drawback of these models is their poor performance in capturing local redundancy. Therefore, to leverage global features along with local redundancy, we propose a CNN-based solution integrated with a feature encoding module. The feature encoding module encodes important features before feeding them to the CNN and then utilizes cross-scale window-based attention, which further captures local redundancy. Crossscale window-based attention is inspired by the attention mechanism in transformers and effectively enlarges the receptive field. Both the feature encoding module and the cross-scale window-based attention module in our architecture are flexible and can be incorporated into any other network architecture. We evaluate our method on the Kodak and CLIC datasets and demonstrate that our approach is effective and on par with state-of-the-art methods.
引用
收藏
页码:410 / 423
页数:14
相关论文
共 50 条
  • [1] The Devil Is in the Details: Window-based Attention for Image Compression
    Zou, Renjie
    Song, Chunfeng
    Zhang, Zhaoxiang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17471 - 17480
  • [2] Learned Image Compression With Adaptive Channel and Window-Based Spatial Entropy Models
    Wang, Jian
    Ling, Qiang
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (04) : 6430 - 6441
  • [3] Learned Image Compression Using Cross-Component Attention Mechanism
    Duan, Wenhong
    Chang, Zheng
    Jia, Chuanmin
    Wang, Shanshe
    Ma, Siwei
    Song, Li
    Gao, Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5478 - 5493
  • [4] Window-based image registration using variable window sizes
    Krutz, Andreas
    Frater, Michael
    Sikora, Thomas
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 2621 - +
  • [5] Hyperspectral Image Classification via Spatial Window-Based Multiview Intact Feature Learning
    Zhao, Yue
    Cheung, Yiu-ming
    You, Xinge
    Peng, Qinmu
    Peng, Jiangtao
    Yuan, Peipei
    Shi, Yufeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (03): : 2294 - 2306
  • [6] A Window-Based Self-Attention approach for sentence encoding
    Huang, Ting
    Deng, Zhi-Hong
    Shen, Gehui
    Chen, Xi
    NEUROCOMPUTING, 2020, 375 : 25 - 31
  • [7] Defect Coverage-Driven Window-Based Test Compression
    Kavousianos, Xrysovalantis
    Chakrabarty, Krishnendu
    Kalligeros, Emmanouil
    Tenentes, Vasileios
    2010 19TH IEEE ASIAN TEST SYMPOSIUM (ATS 2010), 2010, : 141 - 146
  • [8] Efficient Contextformer: Spatio-Channel Window Attention for Fast Context Modeling in Learned Image Compression
    Koyuncu, A. Burakhan
    Jia, Panqi
    Boev, Atanas
    Alshina, Elena
    Steinbach, Eckehard
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7498 - 7511
  • [9] Image dehazing using window-based integrated means filter
    Dilbag Singh
    Vijay Kumar
    Manjit Kaur
    Multimedia Tools and Applications, 2020, 79 : 34771 - 34793
  • [10] Gaze Estimation Based on Convolutional Structure and Sliding Window-Based Attention Mechanism
    Li, Yujie
    Chen, Jiahui
    Ma, Jiaxin
    Wang, Xiwen
    Zhang, Wei
    SENSORS, 2023, 23 (13)