MPCFusion: Multi-scale parallel cross fusion for infrared and visible images via convolution and vision Transformer

被引:7
|
作者
Tang, Haojie [1 ]
Qian, Yao [1 ]
Xing, Mengliang [1 ]
Cao, Yisheng [1 ]
Liu, Gang [1 ]
机构
[1] Shanghai Univ Elect Power, Sch Automat Engn, Shanghai 200090, Peoples R China
基金
中国国家自然科学基金;
关键词
Image fusion; Vision Transformer; Convolution; Multi-scale feature; Infrared; NETWORK;
D O I
10.1016/j.optlaseng.2024.108094
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
The image fusion community is thriving with the wave of deep learning, and the most popular fusion methods are usually built upon well -designed network structures. However, most of the current methods do not fully exploit deeper features while ignore the importance of long-range dependencies. In this paper, a convolution and vision Transformer -based multi -scale parallel cross fusion network for infrared and visible images is proposed (MPCFusion). To exploit deeper texture details, a feature extraction module based on convolution and vision Transformer is designed. With a view to correlating the shallow features between different modalities, a parallel cross -attention module is proposed, in which a parallel -channel model efficiently preserves the proprietary modal features, followed by a cross -spatial model that ensures the information interactions between the different modalities. Moreover, a cross -domain attention module based on convolution and vision Transformer is proposed to capturing long-range dependencies between in-depth features and effectively solves the problem of global context loss. Finally, a nest -connection based decoder is used for implementing feature reconstruction. In particular, we design a new texture -guided structural similarity loss function to drive the network to preserve more complete texture details. Extensive experimental results illustrate that MPCFusion shows excellent fusion performance and generalization capabilities. The source code will be released at https:// github .com /YQ -097 /MPCFusion.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Multi-scale transformer network for super-resolution of visible and thermal air images
    Fkih, Hedi
    Kallel, Abdelaziz
    Chtourou, Zied
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 23
  • [42] Multi-scale Dilated Convolution Transformer for Single Image Deraining
    Wu, Xianhao
    JiyangLu
    Wu, Jindi
    Li, Yufeng
    2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,
  • [43] MIAFusion: Infrared and Visible Image Fusion via Multi-scale Spatial and Channel-Aware Interaction Attention
    Lin, Teng
    Lu, Ming
    Jiang, Min
    Kong, Jun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII, 2025, 15038 : 238 - 251
  • [44] Infrared and visible image fusion via saliency analysis and local edge-preserving multi-scale decomposition
    Zhang, Xiaoye
    Ma, Yong
    Fan, Fan
    Zhang, Ying
    Huang, Jun
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2017, 34 (08) : 1400 - 1410
  • [45] Fusion of infrared intensity and polarization images using embedded multi-scale transform
    Lin, Su-zhen
    Wang, Dong-juan
    Zhu, Xiao-hong
    Zhang, Shang-min
    OPTIK, 2015, 126 (24): : 5127 - 5133
  • [46] Multi-Scale Vision Transformer for Defect Object Detection
    Lou, Liangshan
    Lu, Ke
    Xue, Jian
    Procedia Computer Science, 2023, 222 : 397 - 406
  • [47] Fusion of near-infrared and visible images based on saliency-map-guided multi-scale transformation decomposition
    Jun, Chen
    Lei, Cai
    Wei, Liu
    Yang, Yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 34631 - 34651
  • [48] Fusion of near-infrared and visible images based on saliency-map-guided multi-scale transformation decomposition
    Chen Jun
    Cai Lei
    Liu Wei
    Yu Yang
    Multimedia Tools and Applications, 2023, 82 : 34631 - 34651
  • [49] Brain magnetic resonance image registration based on parallel lightweight convolution and multi-scale fusion
    Shen Y.
    Yan Y.
    Song J.
    Liu G.
    Xu J.
    Wei Z.
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (02): : 213 - 219
  • [50] A fusion framework with multi-scale convolution and triple-branch cascaded transformer for underwater image enhancement
    Xiang, Dan
    Zhou, Zebin
    Yang, Wenlei
    Wang, Huihua
    Gao, Pan
    Xiao, Mingming
    Zhang, Jinwen
    Zhu, Xing
    OPTICS AND LASERS IN ENGINEERING, 2025, 184