Deep CNN based Image Compression with Redundancy Minimization via Attention Guidance

被引:7
|
作者
Mishra, Dipti [1 ,3 ]
Singh, Satish Kumar [2 ,4 ]
Singh, Rajat Kumar [2 ,5 ]
机构
[1] Mahindra Univ, Ecole Cent Sch Engn, Hyderabad, India
[2] Indian Inst Informat Technol Allahabad, Prayagraj, India
[3] Mahindra Univ, Indian Inst Informat Technol, Dept Elect & Commun Engn, Allahabad, India
[4] Indian Inst Informat Technol, Dept Informat Technol, Allahabad, India
[5] Indian Inst Informat Technol, Dept Elect & Commun Engn, Allahabad, India
关键词
Contextual loss; Compression; -decompression; Attention network; Redundancy; Multi -size kernel CNN; Perceptual loss; Style loss; TRANSFORM;
D O I
10.1016/j.neucom.2022.08.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Almost all compression algorithms try to minimize the one or other type of visual redundancy present in the image. Compression becomes challenging while considering the preservation of contextual information and other information. Without considering the contextual information, some of the unwanted features are also learned by the learning-based methods, which leads to the wastage of computational resources. Motivated by this fact, we propose an attention mechanism guided multi-size kernel convolution network-based image compression-decompression algorithm, which focuses on important (local and global) features that are needed for better reconstruction. Among various feature maps obtained after convolution at any stage, channel attention focuses on "what" is meaningful, and spatial attention focuses on "where" the important features are present in the entire feature map. Secondly, we propose to use a perceptual loss function for the task of image compression, which is a combination of contextual, style, and '-2 loss functions. The proposed network and training it with perceptual loss function helped achieve significant improvements when tested with various datasets like CLIC 2019, Tecnick, Kodak, FDDB, ECSSD, and HKU-IS datasets. When assessed on CLIC 2019 challenging dataset, the MS-SSIM and PSNR of the proposed algorithm outperformed JPEG, JPEG2000, and BPG by approximately up to 49.6%, 34.61%, 20.69%, and 10.79%, 1.32%, 3.36% respectively, at low-bit rates (around 0.1 bpp). We further investigated the effectiveness of the proposed algorithm on the cartoon images and found them to be superior to other algorithms. Lastly, as the cartoon images are significantly less available for experimentation using deep learning algorithms, we propose a cartoon image dataset, namely CARTAGE.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:397 / 411
页数:15
相关论文
共 50 条
  • [1] IMAGE SETS COMPRESSION VIA PATCH REDUNDANCY
    Corsini, Massimiliano
    Banterle, Francesco
    Ponchio, Federico
    Cignoni, Paolo
    2019 8TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP 2019), 2019, : 10 - 15
  • [2] Attention-based lightweight deep hybrid CNN framework for image restoration
    Karthikeyan, V.
    Visu, Y. Palin
    IMAGING SCIENCE JOURNAL, 2024,
  • [3] Image compression based on spatial redundancy removal and image inpainting
    Vahid BASTANI
    Mohammad Sadegh HELFROUSH
    Keyvan KASIRI
    Journal of Zhejiang University-Science C(Computer & Electronics), 2010, 11 (02) : 92 - 100
  • [4] Image compression based on spatial redundancy removal and image inpainting
    Bastani, Vahid
    Helfroush, Mohammad Sadegh
    Kasiri, Keyvan
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2010, 11 (02): : 92 - 100
  • [5] Image compression based on spatial redundancy removal and image inpainting
    Vahid Bastani
    Mohammad Sadegh Helfroush
    Keyvan Kasiri
    Journal of Zhejiang University SCIENCE C, 2010, 11 : 92 - 100
  • [6] Image compression based on spatial redundancy removal and image inpainting
    Vahid BASTANI
    Mohammad Sadegh HELFROUSH
    Keyvan KASIRI
    Frontiers of Information Technology & Electronic Engineering, 2010, (02) : 92 - 100
  • [7] An Information Model of Image Segmentation Algorithm Based on Redundancy Minimization
    Murashov, Dmitry
    2020 VI INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND NANOTECHNOLOGY (IEEE ITNT-2020), 2020,
  • [8] Enhancing Learned Image Compression via Cross Window-Based Attention
    Mudgal, Priyanka
    Liu, Feng
    ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT II, 2025, 15047 : 410 - 423
  • [9] Multimodal Medical Image Fusion using Rolling Guidance Filter with CNN and Nuclear Norm Minimization
    Liu, Shuaiqi
    Yin, Lu
    Miao, Siyu
    Ma, Jian
    Cong, Shuai
    Hu, Shaohai
    CURRENT MEDICAL IMAGING, 2020, 16 (10) : 1243 - 1258
  • [10] An Ultralightweight Hybrid CNN Based on Redundancy Removal for Hyperspectral Image Classification
    Ma, Xiaohu
    Wang, Wuli
    Li, Wei
    Wang, Jianbu
    Ren, Guangbo
    Ren, Peng
    Liu, Baodi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 12