GLFuse: A Global and Local Four-Branch Feature Extraction Network for Infrared and Visible Image Fusion

被引:0
|
作者
Zhao, Genping [1 ]
Hu, Zhuyong [1 ]
Feng, Silu [2 ]
Wang, Zhuowei [1 ]
Wu, Heng [3 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] Guangdong Univ Technol, Sch Integrated Circuits, Guangzhou 510006, Peoples R China
[3] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
image fusion; infrared and visible image fusion; global and local feature extraction; attention mechanism; deep learning; PERFORMANCE; NEST;
D O I
10.3390/rs16173246
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Infrared and visible image fusion integrates complementary information from different modalities into a single image, providing sufficient imaging information for scene interpretation and downstream target recognition tasks. However, existing fusion methods often focus only on highlighting salient targets or preserving scene details, failing to effectively combine entire features from different modalities during the fusion process, resulting in underutilized features and poor overall fusion effects. To address these challenges, a global and local four-branch feature extraction image fusion network (GLFuse) is proposed. On one hand, the Super Token Transformer (STT) block, which is capable of rapidly sampling and predicting super tokens, is utilized to capture global features in the scene. On the other hand, a Detail Extraction Block (DEB) is developed to extract local features in the scene. Additionally, two feature fusion modules, namely the Attention-based Feature Selection Fusion Module (ASFM) and the Dual Attention Fusion Module (DAFM), are designed to facilitate selective fusion of features from different modalities. Of more importance, the various perceptual information of feature maps learned from different modality images at the different layers of a network is investigated to design a perceptual loss function to better restore scene detail information and highlight salient targets by treating the perceptual information separately. Extensive experiments confirm that GLFuse exhibits excellent performance in both subjective and objective evaluations. It deserves note that GLFuse effectively improves downstream target detection performance on a unified benchmark.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] A global-local feature adaptive fusion network for image scene classification
    Guangrui Lv
    Lili Dong
    Wenwen Zhang
    Wenhai Xu
    Multimedia Tools and Applications, 2024, 83 : 6521 - 6554
  • [32] Graph Convolutional Network With Local and Global Feature Fusion for Hyperspectral Image Classification
    Wang, Yufan
    Yu, Xiaodong
    Dong, Hongbin
    Zang, Shuying
    IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
  • [33] MJ-GAN: Generative Adversarial Network with Multi-Grained Feature Extraction and Joint Attention Fusion for Infrared and Visible Image Fusion
    Yang, Danqing
    Wang, Xiaorui
    Zhu, Naibo
    Li, Shuang
    Hou, Na
    SENSORS, 2023, 23 (14)
  • [34] IBFusion: An Infrared and Visible Image Fusion Method Based on Infrared Target Mask and Bimodal Feature Extraction Strategy
    Bai Y.
    Gao M.
    Li S.
    Wang P.
    Guan N.
    Yin H.
    Yan Y.
    IEEE Transactions on Multimedia, 2024, 26 : 1 - 13
  • [35] MSFNet: MultiStage Fusion Network for infrared and visible image fusion
    Wang, Chenwu
    Wu, Junsheng
    Zhu, Zhixiang
    Chen, Hao
    NEUROCOMPUTING, 2022, 507 : 26 - 39
  • [36] Facial feature extraction in an infrared image by proxy with a visible face image
    Wang, Jian-Gang
    Sung, Eric
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2007, 56 (05) : 2057 - 2066
  • [37] MCnet: Multiscale visible image and infrared image fusion network
    Sun, Le
    Li, Yuhang
    Zheng, Min
    Zhong, Zhaoyi
    Zhang, Yanchun
    SIGNAL PROCESSING, 2023, 208
  • [38] Review of Feature-Level Infrared and Visible Image Fusion
    Zhang, Honggang
    Yang, Haitao
    Zheng, Fengjie
    Wang, Jinyu
    Zhou, Xixuan
    Wang, Haoyu
    Xu, Yifan
    Computer Engineering and Applications, 2024, 60 (18) : 17 - 31
  • [39] An End-to-End Local-Global-Fusion Feature Extraction Network for Remote Sensing Image Scene Classification
    Lv, Yafei
    Zhang, Xiaohan
    Xiong, Wei
    Cui, Yaqi
    Cai, Mi
    REMOTE SENSING, 2019, 11 (24)
  • [40] MLFFusion: Multi-level feature fusion network with region illumination retention for infrared and visible image fusion
    Wang, Chuanyun
    Sun, Dongdong
    Gao, Qian
    Wang, Linlin
    Yan, Zhuo
    Wang, Jingjing
    Wang, Ershen
    Wang, Tian
    INFRARED PHYSICS & TECHNOLOGY, 2023, 134