GLFuse: A Global and Local Four-Branch Feature Extraction Network for Infrared and Visible Image Fusion

被引:0
|
作者
Zhao, Genping [1 ]
Hu, Zhuyong [1 ]
Feng, Silu [2 ]
Wang, Zhuowei [1 ]
Wu, Heng [3 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] Guangdong Univ Technol, Sch Integrated Circuits, Guangzhou 510006, Peoples R China
[3] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
image fusion; infrared and visible image fusion; global and local feature extraction; attention mechanism; deep learning; PERFORMANCE; NEST;
D O I
10.3390/rs16173246
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Infrared and visible image fusion integrates complementary information from different modalities into a single image, providing sufficient imaging information for scene interpretation and downstream target recognition tasks. However, existing fusion methods often focus only on highlighting salient targets or preserving scene details, failing to effectively combine entire features from different modalities during the fusion process, resulting in underutilized features and poor overall fusion effects. To address these challenges, a global and local four-branch feature extraction image fusion network (GLFuse) is proposed. On one hand, the Super Token Transformer (STT) block, which is capable of rapidly sampling and predicting super tokens, is utilized to capture global features in the scene. On the other hand, a Detail Extraction Block (DEB) is developed to extract local features in the scene. Additionally, two feature fusion modules, namely the Attention-based Feature Selection Fusion Module (ASFM) and the Dual Attention Fusion Module (DAFM), are designed to facilitate selective fusion of features from different modalities. Of more importance, the various perceptual information of feature maps learned from different modality images at the different layers of a network is investigated to design a perceptual loss function to better restore scene detail information and highlight salient targets by treating the perceptual information separately. Extensive experiments confirm that GLFuse exhibits excellent performance in both subjective and objective evaluations. It deserves note that GLFuse effectively improves downstream target detection performance on a unified benchmark.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Infrared and Visible Image Fusion Based on Sparse Feature
    Ding Wen-shan
    Bi Du-yan
    He Lin-yuan
    Fan Zun-lin
    Wu Dong-peng
    ACTA PHOTONICA SINICA, 2018, 47 (09)
  • [22] Fusion network for local and global features extraction for hyperspectral image classification
    Gao, Hongmin
    Wu, Hongyi
    Chen, Zhonghao
    Zhang, Yiyan
    Xu, Shufang
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (10) : 3843 - 3867
  • [23] An infrared and visible image fusion network based on multi-scale feature cascades and non-local attention
    Xu, Jing
    Liu, Zhenjin
    Fang, Ming
    IET IMAGE PROCESSING, 2024, 18 (08) : 2114 - 2125
  • [24] Dual-Attention-Based Feature Aggregation Network for Infrared and Visible Image Fusion
    Tang, Zhimin
    Xiao, Guobao
    Guo, Junwen
    Wang, Shiping
    Ma, Jiayi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [25] Global and local feature fusion image dehazing
    Jiang X.
    Nie H.
    Zhu M.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (18): : 2687 - 2699
  • [26] THFuse: An infrared and visible image fusion network using transformer and hybrid feature extractor
    Chen, Jun
    Ding, Jianfeng
    Yu, Yang
    Gong, Wenping
    NEUROCOMPUTING, 2023, 527 : 71 - 82
  • [27] Double-Branch Local Context Feature Extraction Network for Hyperspectral Image Classification
    Cui, Ying
    Li, Wenshan
    Chen, Liwei
    Gao, Shan
    Wang, Liguo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [28] Local Saliency Extraction for Fusion of Visible and Infrared Images
    Hua, Weiping
    Zhao, Jufeng
    Cui, Guangmang
    Gong, Xiaoli
    Zhu, Liyao
    COMPUTER VISION, PT II, 2017, 772 : 210 - 221
  • [29] Infrared and visible image fusion in a rolling guided filtering framework based on deep feature extraction
    Cheng, Wei
    Lin, Bing
    Cheng, Liming
    Cui, Yong
    WIRELESS NETWORKS, 2024, 30 (9) : 7561 - 7568
  • [30] A global-local feature adaptive fusion network for image scene classification
    Lv, Guangrui
    Dong, Lili
    Zhang, Wenwen
    Xu, Wenhai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 6521 - 6554