Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer

被引:4
|
作者
Wang, Hongmei [1 ]
Li, Lin [2 ]
Li, Chenkai [1 ]
Lu, Xuanyu [1 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Peoples R China
[2] Beijing Res Inst Telemetry, Beijing 100076, Peoples R China
关键词
Image fusion; convolutional neural network; transformer; infrared image; visible image; MULTISCALE TRANSFORM; NETWORK; NEST;
D O I
10.1109/ACCESS.2023.3298437
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image fusion model based on autoencoder network gets more attention because it does not need to design fusion rules manually. However, most autoencoder-based fusion networks use two-stream CNNs with the same structure as the encoder, which are unable to extract global features due to the local receptive field of convolutional operations and lack the ability to extract unique features from infrared and visible images. A novel autoencoder-based image fusion network which consist of encoder module, fusion module and decoder module is constructed in this paper. For the encoder module, the CNN and Transformer are combined to capture the local and global feature of the source images simultaneously. In addition, novel contrast and gradient enhancement feature extraction blocks are designed respectively for infrared and visible images to maintain the information specific to each source images. The feature images obtained from encoder module are concatenated by the fusion module and input to the decoder module to obtain the fused image. Experimental results on three datasets show that the proposed network can better preserve both the clear target and detailed information of infrared and visible images respectively, and outperforms some state-of-the-art methods in both subjective and objective evaluation. At the same time, the fused image obtained by our proposed network can acquire the highest mean average precision in the target detection which proves that image fusion is beneficial for downstream tasks.
引用
收藏
页码:78956 / 78969
页数:14
相关论文
共 50 条
  • [21] Image harmonization with Simple Hybrid CNN-Transformer Network
    Li, Guanlin
    Zhao, Bin
    Li, Xuelong
    NEURAL NETWORKS, 2024, 180
  • [22] Multi-Object Tracking Algorithm Based on CNN-Transformer Feature Fusion
    Zhang, Yingjun
    Bai, Xiaohui
    Xie, Binhong
    Computer Engineering and Applications, 2024, 60 (02) : 180 - 190
  • [23] Object Detection Algorithm Based on CNN-Transformer Dual Modal Feature Fusion
    Yang Chen
    Hou Zhiqiang
    Li Xinyue
    Ma Sugang
    Yang Xiaobao
    ACTA PHOTONICA SINICA, 2024, 53 (03)
  • [24] HCformer: Hybrid CNN-Transformer for LDCT Image Denoising
    Yuan, Jinli
    Zhou, Feng
    Guo, Zhitao
    Li, Xiaozeng
    Yu, Hengyong
    JOURNAL OF DIGITAL IMAGING, 2023, 36 (05) : 2290 - 2305
  • [25] EEG classification algorithm of motor imagery based on CNN-Transformer fusion network
    Liu, Haofeng
    Liu, Yuefeng
    Wang, Yue
    Liu, Bo
    Bao, Xiang
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1302 - 1309
  • [26] HCformer: Hybrid CNN-Transformer for LDCT Image Denoising
    Jinli Yuan
    Feng Zhou
    Zhitao Guo
    Xiaozeng Li
    Hengyong Yu
    Journal of Digital Imaging, 2023, 36 (5) : 2290 - 2305
  • [27] TCCFusion: An infrared and visible image fusion method based on transformer and cross correlation
    Tang, Wei
    He, Fazhi
    Liu, Yu
    PATTERN RECOGNITION, 2023, 137
  • [28] Image inpainting based on CNN-Transformer framework via structure and texture restoration
    Li, Zhan
    Han, Nan
    Wang, Yuning
    Zhang, Yanan
    Yan, Jing
    Du, Yingfei
    Geng, Guohua
    APPLIED SOFT COMPUTING, 2025, 170
  • [29] A CNN-transformer hybrid network with selective fusion and dual attention for image super-resolution
    Zhang, Chun
    Wang, Jin
    Shi, Yunhui
    Yin, Baocai
    Ling, Nam
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [30] PFCFuse: A Poolformer and CNN Fusion Network for Infrared-Visible Image Fusion
    Hu, Xinyu
    Liu, Yang
    Yang, Feng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73