MPCFusion: Multi-scale parallel cross fusion for infrared and visible images via convolution and vision Transformer

被引:7
|
作者
Tang, Haojie [1 ]
Qian, Yao [1 ]
Xing, Mengliang [1 ]
Cao, Yisheng [1 ]
Liu, Gang [1 ]
机构
[1] Shanghai Univ Elect Power, Sch Automat Engn, Shanghai 200090, Peoples R China
基金
中国国家自然科学基金;
关键词
Image fusion; Vision Transformer; Convolution; Multi-scale feature; Infrared; NETWORK;
D O I
10.1016/j.optlaseng.2024.108094
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
The image fusion community is thriving with the wave of deep learning, and the most popular fusion methods are usually built upon well -designed network structures. However, most of the current methods do not fully exploit deeper features while ignore the importance of long-range dependencies. In this paper, a convolution and vision Transformer -based multi -scale parallel cross fusion network for infrared and visible images is proposed (MPCFusion). To exploit deeper texture details, a feature extraction module based on convolution and vision Transformer is designed. With a view to correlating the shallow features between different modalities, a parallel cross -attention module is proposed, in which a parallel -channel model efficiently preserves the proprietary modal features, followed by a cross -spatial model that ensures the information interactions between the different modalities. Moreover, a cross -domain attention module based on convolution and vision Transformer is proposed to capturing long-range dependencies between in-depth features and effectively solves the problem of global context loss. Finally, a nest -connection based decoder is used for implementing feature reconstruction. In particular, we design a new texture -guided structural similarity loss function to drive the network to preserve more complete texture details. Extensive experimental results illustrate that MPCFusion shows excellent fusion performance and generalization capabilities. The source code will be released at https:// github .com /YQ -097 /MPCFusion.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Fusion of infrared-visible images using improved multi-scale top-hat transform and suitable fusion rules
    Zhu, Pan
    Ma, Xiaoqing
    Huang, Zhanhua
    INFRARED PHYSICS & TECHNOLOGY, 2017, 81 : 282 - 295
  • [32] Detail preserved fusion of visible and infrared images using regional saliency extraction and multi-scale image decomposition
    Cui, Guangmang
    Feng, Huajun
    Xu, Zhihai
    Li, Qi
    Chen, Yueting
    OPTICS COMMUNICATIONS, 2015, 341 : 199 - 209
  • [33] CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
    Chen, Chun-Fu
    Fan, Quanfu
    Panda, Rameswar
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 347 - 356
  • [34] AEFusion: A multi-scale fusion network combining Axial attention and Entropy feature Aggregation for infrared and visible images
    Li, Bicao
    Lu, Jiaxi
    Liu, Zhoufeng
    Shao, Zhuhong
    Li, Chunlei
    Du, Yifan
    Huang, Jie
    APPLIED SOFT COMPUTING, 2023, 132
  • [35] Lightweight Convolution Neural Network Based on Multi-Scale Parallel Fusion for Weed Identification
    Wang, Zhen
    Guo, Jianxin
    Zhang, Shanwen
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (07)
  • [36] CGTF: Convolution-Guided Transformer for Infrared and Visible Image Fusion
    Li, Jing
    Zhu, Jianming
    Li, Chang
    Chen, Xun
    Yang, Bin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [37] A Multi-Scale Infrared and Visible Image Fusion Network Based on Context Perception
    Zhao, Huixuan
    Cheng, Jinyong
    Du, Rundong
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 395 - 400
  • [38] MMF: A Multi-scale MobileNet based fusion method for infrared and visible image
    Liu, Yi
    Miao, Changyun
    Ji, Jianhua
    Li, Xianguo
    INFRARED PHYSICS & TECHNOLOGY, 2021, 119
  • [39] Multi-scale saliency measure and orthogonal space for visible and infrared image fusion
    Liu, Yaochen
    Dong, Lili
    Ren, Wei
    Xu, Wenhai
    INFRARED PHYSICS & TECHNOLOGY, 2021, 118
  • [40] MGFA : A multi-scale global feature autoencoder to fuse infrared and visible images
    Chen, Xiaoxuan
    Xu, Shuwen
    Hu, Shaohai
    Ma, Xiaole
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2024, 128