Deep CNN based Image Compression with Redundancy Minimization via Attention Guidance

被引:7
|
作者
Mishra, Dipti [1 ,3 ]
Singh, Satish Kumar [2 ,4 ]
Singh, Rajat Kumar [2 ,5 ]
机构
[1] Mahindra Univ, Ecole Cent Sch Engn, Hyderabad, India
[2] Indian Inst Informat Technol Allahabad, Prayagraj, India
[3] Mahindra Univ, Indian Inst Informat Technol, Dept Elect & Commun Engn, Allahabad, India
[4] Indian Inst Informat Technol, Dept Informat Technol, Allahabad, India
[5] Indian Inst Informat Technol, Dept Elect & Commun Engn, Allahabad, India
关键词
Contextual loss; Compression; -decompression; Attention network; Redundancy; Multi -size kernel CNN; Perceptual loss; Style loss; TRANSFORM;
D O I
10.1016/j.neucom.2022.08.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Almost all compression algorithms try to minimize the one or other type of visual redundancy present in the image. Compression becomes challenging while considering the preservation of contextual information and other information. Without considering the contextual information, some of the unwanted features are also learned by the learning-based methods, which leads to the wastage of computational resources. Motivated by this fact, we propose an attention mechanism guided multi-size kernel convolution network-based image compression-decompression algorithm, which focuses on important (local and global) features that are needed for better reconstruction. Among various feature maps obtained after convolution at any stage, channel attention focuses on "what" is meaningful, and spatial attention focuses on "where" the important features are present in the entire feature map. Secondly, we propose to use a perceptual loss function for the task of image compression, which is a combination of contextual, style, and '-2 loss functions. The proposed network and training it with perceptual loss function helped achieve significant improvements when tested with various datasets like CLIC 2019, Tecnick, Kodak, FDDB, ECSSD, and HKU-IS datasets. When assessed on CLIC 2019 challenging dataset, the MS-SSIM and PSNR of the proposed algorithm outperformed JPEG, JPEG2000, and BPG by approximately up to 49.6%, 34.61%, 20.69%, and 10.79%, 1.32%, 3.36% respectively, at low-bit rates (around 0.1 bpp). We further investigated the effectiveness of the proposed algorithm on the cartoon images and found them to be superior to other algorithms. Lastly, as the cartoon images are significantly less available for experimentation using deep learning algorithms, we propose a cartoon image dataset, namely CARTAGE.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:397 / 411
页数:15
相关论文
共 50 条
  • [41] CNN deep learning-based image to vector depiction
    Waheed, Safa Riyadh
    Rahim, Mohd Shafry Mohd
    Suaib, Norhaida Mohd
    Salim, A. A.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) : 20283 - 20302
  • [42] Deep CNN-Based Blind Image Quality Predictor
    Kim, Jongyoo
    Anh-Duc Nguyen
    Lee, Sanghoon
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (01) : 11 - 24
  • [43] Image Classification Using an Ensemble-Based Deep CNN
    Neena, Aloysius
    Geetha, M.
    RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 3, 2018, 709 : 445 - 456
  • [44] Image Dehazing Using Residual-Based Deep CNN
    Li, Jinjiang
    Li, Guihui
    Fan, Hui
    IEEE ACCESS, 2018, 6 : 26831 - 26842
  • [45] SAR image change detection based on deep denoising and CNN
    Cao, Xianghai
    Ji, Yamei
    Wang, Lin
    Ji, Beibei
    Jiao, Licheng
    Han, Jungong
    IET IMAGE PROCESSING, 2019, 13 (09) : 1509 - 1515
  • [46] Deep CNN Prior Based Image Reconstruction for Multispectral Imaging
    Manisali, Irfan
    Cam, Refik Mert
    Bezek, Can Deniz
    Oktem, Figen S.
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [47] Real-Time Volumetric Image Guidance Via Deep Learning
    Liang, X.
    Xing, L.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [48] Joint Graph Attention and Asymmetric Convolutional Neural Network for Deep Image Compression
    Tang, Zhisen
    Wang, Hanli
    Yi, Xiaokai
    Zhang, Yun
    Kwong, Sam
    Kuo, C. -C. Jay
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (01) : 421 - 433
  • [49] OPTIMIZED DECOUPLED STRUCTURE WITH NON-LOCAL ATTENTION FOR DEEP IMAGE COMPRESSION
    Zhang, Xuanye
    Zhang, Zhaobin
    Wu, Yaojun
    Esenlik, Semih
    Sun, Xiaoyan
    Zhang, Kai
    Zhang, Li
    2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2024, : 3681 - 3687
  • [50] EFFICIENT DEEP LEARNING-BASED LOSSY IMAGE COMPRESSION VIA ASYMMETRIC AUTOENCODER AND PRUNING
    Kim, Jun-Hyuk
    Choi, Jun-Ho
    Chang, Jaehyuk
    Lee, Jong-Seok
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2063 - 2067