HDCCT: Hybrid Densely Connected CNN and Transformer for Infrared and Visible Image Fusion

被引：0

作者：

Li, Xue ^{[1
]}

He, Hui ^{[2
]}

Shi, Jin ^{[3
]}

机构：

[1] Shandong Jiaotong Univ, Sch Rail Transportat, Jinan 250357, Peoples R China

[2] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China

[3] CRSC Res & Design Inst Grp Co Ltd, Beijing 100070, Peoples R China

来源：

ELECTRONICS | 2024年 / 13卷 / 17期

关键词：

convolutional neural network (CNN); transformer; image fusion; encoder-decoder architecture; global and local information; GENERATIVE ADVERSARIAL NETWORK; PERFORMANCE; NEST; GAN;

D O I：

10.3390/electronics13173470

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-modal image fusion is a methodology that combines image features from multiple types of sensors, effectively improving the quality and content of fused images. However, most existing deep learning fusion methods need to integrate global or local features, restricting the representation of feature information. To address this issue, a hybrid densely connected CNN and transformer (HDCCT) fusion framework is proposed. In the proposed HDCCT framework, the network of the CNN-based blocks obtain the local structure of the input data, and the transformer-based blocks obtain the global structure of the original data, significantly improving the feature representation. In the fused image, the proposed encoder-decoder architecture is designed for both the CNN and transformer blocks to reduce feature loss while preserving the characterization of all-level features. In addition, the cross-coupled framework facilitates the flow of feature structures, retains the uniqueness of information, and makes the transform model long-range dependencies based on the local features already extracted by the CNN. Meanwhile, to retain the information in the source images, the hybrid structural similarity (SSIM) and mean square error (MSE) loss functions are introduced. The qualitative and quantitative comparisons of grayscale images with infrared and visible image fusion indicate that the suggested method outperforms related works.

引用

页数：17

共 50 条

[1] HDCTfusion: Hybrid Dual-Branch Network Based on CNN and Transformer for Infrared and Visible Image Fusion
Wang, Wenqing
Li, Lingzhou
Yang, Yifei
Liu, Han
Guo, Runyuan
[J]. Sensors, 2024, 24 (23)
[2] Multi-modal medical image fusion based on densely-connected high-resolution CNN and hybrid transformer
Quan Zhou
Shaozhuang Ye
Mingwei Wen
Zhiwen Huang
Mingyue Ding
Xuming Zhang
[J]. Neural Computing and Applications, 2022, 34 : 21741 - 21761
[3] Multi-modal medical image fusion based on densely-connected high-resolution CNN and hybrid transformer
Zhou, Quan
Ye, Shaozhuang
Wen, Mingwei
Huang, Zhiwen
Ding, Mingyue
Zhang, Xuming
[J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24): : 21741 - 21761
[4] Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer
Wang, Hongmei
Li, Lin
Li, Chenkai
Lu, Xuanyu
[J]. IEEE ACCESS, 2023, 11 : 78956 - 78969
[5] UNFusion: A Unified Multi-Scale Densely Connected Network for Infrared and Visible Image Fusion
Wang, Zhishe
Wang, Junyao
Wu, Yuanyuan
Xu, Jiawei
Zhang, Xiaoqin
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3360 - 3374
[6] Infrared and visible image fusion algorithm based on a cross-layer densely connected convolutional network
Yu, Ruixing
Chen, Weiyu
Zhu, Bing
[J]. APPLIED OPTICS, 2022, 61 (11) : 3107 - 3114
[7] Unsupervised densely attention network for infrared and visible image fusion
Li, Yang
Wang, Jixiao
Miao, Zhuang
Wang, Jiabao
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 34685 - 34696
[8] Unsupervised densely attention network for infrared and visible image fusion
Yang Li
Jixiao Wang
Zhuang Miao
Jiabao Wang
[J]. Multimedia Tools and Applications, 2020, 79 : 34685 - 34696
[9] Semantic perceptive infrared and visible image fusion Transformer
Yang, Xin
Huo, Hongtao
Li, Chang
Liu, Xiaowen
Wang, Wenxi
Wang, Cheng
[J]. PATTERN RECOGNITION, 2024, 149
[10] ITFuse: An interactive transformer for infrared and visible image fusion
Tang, Wei
He, Fazhi
Liu, Yu
[J]. PATTERN RECOGNITION, 2024, 156

← 1 2 3 4 5 →