Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer

被引:2
|
作者
Wang, Hongmei [1 ]
Li, Lin [2 ]
Li, Chenkai [1 ]
Lu, Xuanyu [1 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Peoples R China
[2] Beijing Res Inst Telemetry, Beijing 100076, Peoples R China
关键词
Image fusion; convolutional neural network; transformer; infrared image; visible image; MULTISCALE TRANSFORM; NETWORK; NEST;
D O I
10.1109/ACCESS.2023.3298437
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image fusion model based on autoencoder network gets more attention because it does not need to design fusion rules manually. However, most autoencoder-based fusion networks use two-stream CNNs with the same structure as the encoder, which are unable to extract global features due to the local receptive field of convolutional operations and lack the ability to extract unique features from infrared and visible images. A novel autoencoder-based image fusion network which consist of encoder module, fusion module and decoder module is constructed in this paper. For the encoder module, the CNN and Transformer are combined to capture the local and global feature of the source images simultaneously. In addition, novel contrast and gradient enhancement feature extraction blocks are designed respectively for infrared and visible images to maintain the information specific to each source images. The feature images obtained from encoder module are concatenated by the fusion module and input to the decoder module to obtain the fused image. Experimental results on three datasets show that the proposed network can better preserve both the clear target and detailed information of infrared and visible images respectively, and outperforms some state-of-the-art methods in both subjective and objective evaluation. At the same time, the fused image obtained by our proposed network can acquire the highest mean average precision in the target detection which proves that image fusion is beneficial for downstream tasks.
引用
收藏
页码:78956 / 78969
页数:14
相关论文
共 50 条
  • [1] Hybrid CNN-Transformer Feature Fusion for Single Image Deraining
    Chen, Xiang
    Pan, Jinshan
    Lu, Jiyang
    Fan, Zhentao
    Li, Hao
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 378 - 386
  • [2] HDCCT: Hybrid Densely Connected CNN and Transformer for Infrared and Visible Image Fusion
    Li, Xue
    He, Hui
    Shi, Jin
    [J]. ELECTRONICS, 2024, 13 (17)
  • [3] CTFusion: CNN-transformer-based self-supervised learning for infrared and visible image fusion
    Du, Keying
    Fang, Liuyang
    Chen, Jie
    Chen, Dongdong
    Lai, Hua
    [J]. Mathematical Biosciences and Engineering, 2024, 21 (07) : 6710 - 6730
  • [4] HDCTfusion: Hybrid Dual-Branch Network Based on CNN and Transformer for Infrared and Visible Image Fusion
    Wang, Wenqing
    Li, Lingzhou
    Yang, Yifei
    Liu, Han
    Guo, Runyuan
    [J]. Sensors, 2024, 24 (23)
  • [5] A CNN-transformer fusion network for COVID-19 CXR image classification
    Cao, Kai
    Deng, Tao
    Zhang, Chuanlin
    Lu, Limeng
    Li, Lin
    [J]. PLOS ONE, 2022, 17 (10):
  • [6] LOW LIGHT RGB AND IR IMAGE FUSION WITH SELECTIVE CNN-TRANSFORMER NETWORK
    Jin, Haiyan
    Yang, Yue
    Su, Haonan
    Xiao, Zhaolin
    Wang, Bin
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1255 - 1259
  • [7] Image Deblurring Based on an Improved CNN-Transformer Combination Network
    Chen, Xiaolin
    Wan, Yuanyuan
    Wang, Donghe
    Wang, Yuqing
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [8] Harmful Cyanobacterial Blooms forecasting based on improved CNN-Transformer and Temporal Fusion Transformer
    Ahn, Jung Min
    Kim, Jungwook
    Kim, Hongtae
    Kim, Kyunghyun
    [J]. ENVIRONMENTAL TECHNOLOGY & INNOVATION, 2023, 32
  • [9] CTFNet: CNN-Transformer Fusion Network for Remote-Sensing Image Semantic Segmentation
    Wu, Honglin
    Huang, Peng
    Zhang, Min
    Tang, Wenlong
    [J]. IEEE Geoscience and Remote Sensing Letters, 2024, 21 : 1 - 5
  • [10] Image enhancement with art design: a visual feature approach with a CNN-transformer fusion model
    Xu, Ming
    Cui, Jinwei
    Ma, Xiaoyu
    Zou, Zhiyi
    Xin, Zhisheng
    Bilal, Muhammad
    [J]. PeerJ Computer Science, 2024, 10