A dual-branch infrared and visible image fusion network using progressive image-wise feature transfer

被引:0
|
作者
Xu, Shaoping [1 ]
Zhou, Changfei [1 ]
Xiao, Jian [1 ]
Tao, Wuyong [1 ]
Dai, Tianyu [1 ]
机构
[1] Nanchang Univ, Sch Math & Comp Sci, Nanchang 330031, Jiangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Infrared and visible image fusion; Dual-branch fusion network; Progressive image-wise feature transfer; Transformer module; CLIP loss; NEST;
D O I
10.1016/j.jvcir.2024.104190
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To achieve a fused image that contains rich texture details and prominent targets, we present a progressive dual-branch infrared and visible image fusion network called PDFusion, which incorporates the Transformer module. Initially, the proposed network is divided into two branches to extract infrared and visible features independently. Subsequently, the image-wise transfer block (ITB) is introduced to fuse the infrared and visible features at different layers, facilitating the exchange of information between features. The fused features are then fed back into both pathways to contribute to the subsequent feature extraction process. Moreover, in addition to conventional pixel-level and structured loss functions, the contrastive language- image pretraining (CLIP) loss is introduced to guide the network training. Experimental results on publicly available datasets demonstrate the promising performance of PDFusion in the task of infrared and visible image fusion. The exceptional fusion performance of the proposed fusion network can be attributed to the following reasons: (1) The ITB block, particularly with the integration of the Transformer, enhances the capability of representation learning. The Transformer module captures long-range dependencies among image features, enabling a global receptive field that integrates contextual information from the entire image. This leads to a more comprehensive fusion of features. (2) The feature loss based on the CLIP image encoder minimizes the discrepancy between the generated and target images. Consequently, it promotes the generation of semantically coherent and visually appealing fused images. The source code of our method can be found at https://github.com/Changfei-Zhou/PDFusion.
引用
下载
收藏
页数:11
相关论文
共 50 条
  • [11] Hyperspectral Image Classification Based on Hybrid Depth-Wise Separable Convolution and Dual-Branch Feature Fusion Network
    Dai, Hualin
    Yue, Yingli
    Liu, Qi
    Applied Sciences (Switzerland), 2025, 15 (03):
  • [12] DBF-Net: A Dual-Branch Network with Feature Fusion for Ultrasound Image Segmentation
    Xu, Guoping
    Wu, Xiaming
    Liao, Wentao
    Wu, Xinglong
    Huang, Qing
    Li, Chang
    arXiv,
  • [13] DBFNet: A Dual-Branch Fusion Network for Underwater Image Enhancement
    Sun, Kaichuan
    Tian, Yubo
    REMOTE SENSING, 2023, 15 (05)
  • [14] A dual-branch feature fusion neural network for fish image fine-grained recognition
    Geng, Xu
    Gao, Jinxiong
    Zhang, Yonghui
    Wang, Rong
    VISUAL COMPUTER, 2024, 40 (10): : 6883 - 6896
  • [15] A dual-branch multi-feature deep fusion network framework for hyperspectral image classification
    Liu, Linfeng
    Zhang, Chengcai
    Luo, Weiran
    GEOCARTO INTERNATIONAL, 2022, 37 (27) : 18692 - 18715
  • [16] Progressive Dual-Branch Network for Low-Light Image Enhancement
    Cui, Hengshuai
    Li, Jinjiang
    Hua, Zhen
    Fan, Linwei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [17] Classification of hyperspectral image based on dual-branch feature interaction network
    Li, Chenming
    Wang, Xiangyi
    Chen, Zhonghao
    Gao, Hongmin
    Xu, Shufang
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (09) : 3258 - 3279
  • [18] TDDFusion: A Target-Driven Dual Branch Network for Infrared and Visible Image Fusion
    Lu, Siyu
    Ye, Xiangzhou
    Rao, Junmin
    Li, Fanming
    Liu, Shijian
    SENSORS, 2024, 24 (01)
  • [19] Infrared-Visible Image Fusion Using Dual-Branch Auto-Encoder With Invertible High-Frequency Encoding
    Liu, Honglin
    Mao, Qirong
    Dong, Ming
    Zhan, Yongzhao
    IEEE Transactions on Circuits and Systems for Video Technology, 2025, 35 (03) : 2675 - 2688
  • [20] DFENet: A dual-branch feature enhanced network integrating transformers and convolutional feature learning for multimodal medical image fusion
    Li, Weisheng
    Zhang, Yin
    Wang, Guofen
    Huang, Yuping
    Li, Ruyue
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 80