HDCCT: Hybrid Densely Connected CNN and Transformer for Infrared and Visible Image Fusion

被引:0
|
作者
Li, Xue [1 ]
He, Hui [2 ]
Shi, Jin [3 ]
机构
[1] Shandong Jiaotong Univ, Sch Rail Transportat, Jinan 250357, Peoples R China
[2] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
[3] CRSC Res & Design Inst Grp Co Ltd, Beijing 100070, Peoples R China
关键词
convolutional neural network (CNN); transformer; image fusion; encoder-decoder architecture; global and local information; GENERATIVE ADVERSARIAL NETWORK; PERFORMANCE; NEST; GAN;
D O I
10.3390/electronics13173470
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-modal image fusion is a methodology that combines image features from multiple types of sensors, effectively improving the quality and content of fused images. However, most existing deep learning fusion methods need to integrate global or local features, restricting the representation of feature information. To address this issue, a hybrid densely connected CNN and transformer (HDCCT) fusion framework is proposed. In the proposed HDCCT framework, the network of the CNN-based blocks obtain the local structure of the input data, and the transformer-based blocks obtain the global structure of the original data, significantly improving the feature representation. In the fused image, the proposed encoder-decoder architecture is designed for both the CNN and transformer blocks to reduce feature loss while preserving the characterization of all-level features. In addition, the cross-coupled framework facilitates the flow of feature structures, retains the uniqueness of information, and makes the transform model long-range dependencies based on the local features already extracted by the CNN. Meanwhile, to retain the information in the source images, the hybrid structural similarity (SSIM) and mean square error (MSE) loss functions are introduced. The qualitative and quantitative comparisons of grayscale images with infrared and visible image fusion indicate that the suggested method outperforms related works.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] HDCTfusion: Hybrid Dual-Branch Network Based on CNN and Transformer for Infrared and Visible Image Fusion
    Wang, Wenqing
    Li, Lingzhou
    Yang, Yifei
    Liu, Han
    Guo, Runyuan
    [J]. Sensors, 2024, 24 (23)
  • [2] Multi-modal medical image fusion based on densely-connected high-resolution CNN and hybrid transformer
    Quan Zhou
    Shaozhuang Ye
    Mingwei Wen
    Zhiwen Huang
    Mingyue Ding
    Xuming Zhang
    [J]. Neural Computing and Applications, 2022, 34 : 21741 - 21761
  • [3] Multi-modal medical image fusion based on densely-connected high-resolution CNN and hybrid transformer
    Zhou, Quan
    Ye, Shaozhuang
    Wen, Mingwei
    Huang, Zhiwen
    Ding, Mingyue
    Zhang, Xuming
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24): : 21741 - 21761
  • [4] Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer
    Wang, Hongmei
    Li, Lin
    Li, Chenkai
    Lu, Xuanyu
    [J]. IEEE ACCESS, 2023, 11 : 78956 - 78969
  • [5] UNFusion: A Unified Multi-Scale Densely Connected Network for Infrared and Visible Image Fusion
    Wang, Zhishe
    Wang, Junyao
    Wu, Yuanyuan
    Xu, Jiawei
    Zhang, Xiaoqin
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3360 - 3374
  • [6] Infrared and visible image fusion algorithm based on a cross-layer densely connected convolutional network
    Yu, Ruixing
    Chen, Weiyu
    Zhu, Bing
    [J]. APPLIED OPTICS, 2022, 61 (11) : 3107 - 3114
  • [7] Unsupervised densely attention network for infrared and visible image fusion
    Li, Yang
    Wang, Jixiao
    Miao, Zhuang
    Wang, Jiabao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 34685 - 34696
  • [8] Unsupervised densely attention network for infrared and visible image fusion
    Yang Li
    Jixiao Wang
    Zhuang Miao
    Jiabao Wang
    [J]. Multimedia Tools and Applications, 2020, 79 : 34685 - 34696
  • [9] Semantic perceptive infrared and visible image fusion Transformer
    Yang, Xin
    Huo, Hongtao
    Li, Chang
    Liu, Xiaowen
    Wang, Wenxi
    Wang, Cheng
    [J]. PATTERN RECOGNITION, 2024, 149
  • [10] ITFuse: An interactive transformer for infrared and visible image fusion
    Tang, Wei
    He, Fazhi
    Liu, Yu
    [J]. PATTERN RECOGNITION, 2024, 156