Deep CNN based Image Compression with Redundancy Minimization via Attention Guidance

被引:7
|
作者
Mishra, Dipti [1 ,3 ]
Singh, Satish Kumar [2 ,4 ]
Singh, Rajat Kumar [2 ,5 ]
机构
[1] Mahindra Univ, Ecole Cent Sch Engn, Hyderabad, India
[2] Indian Inst Informat Technol Allahabad, Prayagraj, India
[3] Mahindra Univ, Indian Inst Informat Technol, Dept Elect & Commun Engn, Allahabad, India
[4] Indian Inst Informat Technol, Dept Informat Technol, Allahabad, India
[5] Indian Inst Informat Technol, Dept Elect & Commun Engn, Allahabad, India
关键词
Contextual loss; Compression; -decompression; Attention network; Redundancy; Multi -size kernel CNN; Perceptual loss; Style loss; TRANSFORM;
D O I
10.1016/j.neucom.2022.08.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Almost all compression algorithms try to minimize the one or other type of visual redundancy present in the image. Compression becomes challenging while considering the preservation of contextual information and other information. Without considering the contextual information, some of the unwanted features are also learned by the learning-based methods, which leads to the wastage of computational resources. Motivated by this fact, we propose an attention mechanism guided multi-size kernel convolution network-based image compression-decompression algorithm, which focuses on important (local and global) features that are needed for better reconstruction. Among various feature maps obtained after convolution at any stage, channel attention focuses on "what" is meaningful, and spatial attention focuses on "where" the important features are present in the entire feature map. Secondly, we propose to use a perceptual loss function for the task of image compression, which is a combination of contextual, style, and '-2 loss functions. The proposed network and training it with perceptual loss function helped achieve significant improvements when tested with various datasets like CLIC 2019, Tecnick, Kodak, FDDB, ECSSD, and HKU-IS datasets. When assessed on CLIC 2019 challenging dataset, the MS-SSIM and PSNR of the proposed algorithm outperformed JPEG, JPEG2000, and BPG by approximately up to 49.6%, 34.61%, 20.69%, and 10.79%, 1.32%, 3.36% respectively, at low-bit rates (around 0.1 bpp). We further investigated the effectiveness of the proposed algorithm on the cartoon images and found them to be superior to other algorithms. Lastly, as the cartoon images are significantly less available for experimentation using deep learning algorithms, we propose a cartoon image dataset, namely CARTAGE.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:397 / 411
页数:15
相关论文
共 50 条
  • [31] Inpainting Electrical Logging Images Based on Deep CNN with Attention Mechanisms
    Du, Chunyu
    Xing, Qiang
    Zhang, Jinyan
    Wang, Jun
    Liu, Baodi
    Wang, Yanjiang
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 607 - 610
  • [32] Deep Scalable Image Compression via Hierarchical Feature Decorrelation
    Guo, Zongyu
    Zhang, Zhizheng
    Chen, Zhibo
    2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
  • [33] LAYERED CONCEPTUAL IMAGE COMPRESSION VIA DEEP SEMANTIC SYNTHESIS
    Chang, Jianhui
    Mao, Qi
    Zhao, Zhenghui
    Wang, Shanshe
    Wang, Shiqi
    Zhu, Hong
    Ma, Siwei
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 694 - 698
  • [34] Deep semantic image compression via cooperative network pruning
    Luo, Sihui
    Fang, Gongfan
    Song, Mingli
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [35] Unsupervised Image Enhancement Method Based on Attention Map Network Guidance and Attention Mechanism
    Wu, Mengfei
    Lan, Taiji
    Xue, Xucheng
    Xu, Xinwei
    ELECTRONICS, 2023, 12 (08)
  • [36] CNN-Based DCT-Like Transform for Image Compression
    Liu, Dong
    Ma, Haichuan
    Xiong, Zhiwei
    Wu, Feng
    MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 61 - 72
  • [37] Feedback Attention-Based Dense CNN for Hyperspectral Image Classification
    Yu, Chunyan
    Han, Rui
    Song, Meiping
    Liu, Caiyu
    Chang, Chein-, I
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [38] Coupled Squeeze-and-Excitation Blocks Based CNN for Image Compression
    Du, Jing
    Xu, Yang
    Wei, Zhihui
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 201 - 212
  • [39] Optimization of Remote Desktop with CNN-based Image Compression Model
    Wang, Hejun
    Dai, Hongjun
    Qiu, Meikang
    Liu, Meiqin
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 692 - 703
  • [40] Light Field Image Super-Resolution via Mutual Attention Guidance
    Wang, Zijian
    Lu, Yao
    IEEE ACCESS, 2021, 9 : 129022 - 129031