Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer

被引:4
|
作者
Wang, Hongmei [1 ]
Li, Lin [2 ]
Li, Chenkai [1 ]
Lu, Xuanyu [1 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Peoples R China
[2] Beijing Res Inst Telemetry, Beijing 100076, Peoples R China
关键词
Image fusion; convolutional neural network; transformer; infrared image; visible image; MULTISCALE TRANSFORM; NETWORK; NEST;
D O I
10.1109/ACCESS.2023.3298437
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image fusion model based on autoencoder network gets more attention because it does not need to design fusion rules manually. However, most autoencoder-based fusion networks use two-stream CNNs with the same structure as the encoder, which are unable to extract global features due to the local receptive field of convolutional operations and lack the ability to extract unique features from infrared and visible images. A novel autoencoder-based image fusion network which consist of encoder module, fusion module and decoder module is constructed in this paper. For the encoder module, the CNN and Transformer are combined to capture the local and global feature of the source images simultaneously. In addition, novel contrast and gradient enhancement feature extraction blocks are designed respectively for infrared and visible images to maintain the information specific to each source images. The feature images obtained from encoder module are concatenated by the fusion module and input to the decoder module to obtain the fused image. Experimental results on three datasets show that the proposed network can better preserve both the clear target and detailed information of infrared and visible images respectively, and outperforms some state-of-the-art methods in both subjective and objective evaluation. At the same time, the fused image obtained by our proposed network can acquire the highest mean average precision in the target detection which proves that image fusion is beneficial for downstream tasks.
引用
收藏
页码:78956 / 78969
页数:14
相关论文
共 50 条
  • [1] MATCNN: Infrared and Visible Image Fusion Method Based on Multiscale CNN With Attention Transformer
    Liu, Jingjing
    Zhang, Li
    Zeng, Xiaoyang
    Liu, Wanquan
    Zhang, Jianhua
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [2] Hybrid CNN-Transformer Feature Fusion for Single Image Deraining
    Chen, Xiang
    Pan, Jinshan
    Lu, Jiyang
    Fan, Zhentao
    Li, Hao
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 378 - 386
  • [3] HDCCT: Hybrid Densely Connected CNN and Transformer for Infrared and Visible Image Fusion
    Li, Xue
    He, Hui
    Shi, Jin
    ELECTRONICS, 2024, 13 (17)
  • [4] HDCTfusion: Hybrid Dual-Branch Network Based on CNN and Transformer for Infrared and Visible Image Fusion
    Wang, Wenqing
    Li, Lingzhou
    Yang, Yifei
    Liu, Han
    Guo, Runyuan
    SENSORS, 2024, 24 (23)
  • [5] CTFusion: CNN-transformer-based self-supervised learning for infrared and visible image fusion
    Du, Keying
    Fang, Liuyang
    Chen, Jie
    Chen, Dongdong
    Lai, Hua
    Mathematical Biosciences and Engineering, 2024, 21 (07) : 6710 - 6730
  • [6] A CNN-transformer fusion network for COVID-19 CXR image classification
    Cao, Kai
    Deng, Tao
    Zhang, Chuanlin
    Lu, Limeng
    Li, Lin
    PLOS ONE, 2022, 17 (10):
  • [7] Image Deblurring Based on an Improved CNN-Transformer Combination Network
    Chen, Xiaolin
    Wan, Yuanyuan
    Wang, Donghe
    Wang, Yuqing
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [8] LOW LIGHT RGB AND IR IMAGE FUSION WITH SELECTIVE CNN-TRANSFORMER NETWORK
    Jin, Haiyan
    Yang, Yue
    Su, Haonan
    Xiao, Zhaolin
    Wang, Bin
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1255 - 1259
  • [9] A synergistic CNN-transformer network with pooling attention fusion for hyperspectral image classification
    Chen, Peng
    He, Wenxuan
    Qian, Feng
    Shi, Guangyao
    Yan, Jingwen
    DIGITAL SIGNAL PROCESSING, 2025, 160
  • [10] Harmful Cyanobacterial Blooms forecasting based on improved CNN-Transformer and Temporal Fusion Transformer
    Ahn, Jung Min
    Kim, Jungwook
    Kim, Hongtae
    Kim, Kyunghyun
    ENVIRONMENTAL TECHNOLOGY & INNOVATION, 2023, 32