MRASFusion: A multi-scale residual attention infrared and visible image fusion network based on semantic segmentation guidance

被引:1
|
作者
An, Rongsheng [1 ]
Liu, Gang [1 ]
Qian, Yao [1 ]
Xing, Mengliang [1 ]
Tang, Haojie [1 ]
机构
[1] Shanghai Univ Elect Power, Sch Automat Engn, Shanghai 200090, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-scale residual attention; Semantic segmentation; Swin transformer; Image fusion; PERFORMANCE;
D O I
10.1016/j.infrared.2024.105343
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
To address the challenges of inadequate preservation of prominent targets, poor retention of texture details, and unsatisfactory reconstruction of image backgrounds in image fusion. In this paper, a multi-scale residual attention fusion network based on semantic segmentation guidance is proposed, termed as MRASFusion. First of all, Swin Transformer segmentation mask with high precision, and strong scalability is adopted to avoid the inefficiency and error of manual segmentation mask. The mask generated by semantic segmentation is used to construct a loss function to guide the image fusion process. Secondly, in order to maintain the integrity of contextual information and texture details, a new feature extraction module is proposed to fully extract the meaningful features. Finally, the fused image is obtained by reconstructing the extracted features. To verify the effectiveness of the method, MRASFusion is qualitatively and quantitatively compared with nine state-of-the-art fusion methods on TNO and RoadScene datasets. Experimental results indicate that our method has demonstrated satisfactory performance in image fusion tasks, exhibiting superior capabilities in preserving target information and retaining texture details. Furthermore, our fusion results have brought some performance improvements for advanced vision tasks, i.e., improved accuracy for the object detection, which provides a better foundation for solving real-world problems.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Infrared and visible image fusion based on multi-scale dense attention connection network
    Chen, Yong
    Zhang, Jiaojiao
    Wang, Zhen
    [J]. Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (18): : 2253 - 2266
  • [2] Deep Neural Network for Infrared and Visible Image Fusion Based on Multi-scale Decomposition and Interactive Residual Coordinate Attention
    Zong, Sha
    Xie, Zhihua
    Li, Qiang
    Liu, Guodong
    [J]. ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 254 - 262
  • [3] MGRCFusion: An infrared and visible image fusion network based on multi-scale group residual convolution
    Zhu, Pan
    Yin, Yufei
    Zhou, Xinglin
    [J]. OPTICS AND LASER TECHNOLOGY, 2025, 180
  • [4] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    [J]. Infrared Physics and Technology, 2022, 125
  • [5] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    [J]. INFRARED PHYSICS & TECHNOLOGY, 2022, 125
  • [6] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
    Fuquan Li
    Yonghui Zhou
    YanLi Chen
    Jie Li
    ZhiCheng Dong
    Mian Tan
    [J]. Complex & Intelligent Systems, 2024, 10 : 705 - 719
  • [7] Multi-scale attention-based lightweight network with dilated convolutions for infrared and visible image fusion
    Li, Fuquan
    Zhou, Yonghui
    Chen, YanLi
    Li, Jie
    Dong, ZhiCheng
    Tan, Mian
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 705 - 719
  • [8] Semantic segmentation network for remote sensing image based on multi-scale mutual attention
    Liu, Chun-Juan
    Qiao, Ze
    Yan, Hao-Wen
    Wu, Xiao-Suo
    Wang, Jia-Wei
    Xin, Yu-Qiang
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (07): : 1335 - 1344
  • [9] An infrared and visible image fusion network based on multi-scale feature cascades and non-local attention
    Xu, Jing
    Liu, Zhenjin
    Fang, Ming
    [J]. IET IMAGE PROCESSING, 2024, 18 (08) : 2114 - 2125
  • [10] Semantic Segmentation Method Based on Residual and Multi-Scale Feature Fusion
    Xiu, Chunbo
    Su, Huan
    Su, Xuemiao
    [J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 2078 - 2083