YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer

被引:104
|
作者
Tang, Wei [1 ]
He, Fazhi [1 ]
Liu, Yu [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Hefei Univ Technol, Dept Biomed Engn, Hefei 230009, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Dynamic transformer; image fusion; infrared image; Y-shape network; NETWORK; PERFORMANCE;
D O I
10.1109/TMM.2022.3192661
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Infrared and visible image fusion is aims to generate a composite image that can simultaneously describe the salient target in the infrared image and texture details in the visible image of the same scene. Since deep learning (DL) exhibits great feature extraction ability in computer vision tasks, it has also been widely employed in handling infrared and visible image fusion issue. However, the existing DL-based methods generally extract complementary information from source images through convolutional operations, which results in limited preservation of global features. To this end, we propose a novel infrared and visible image fusion method, i.e., the Y-shape dynamic Transformer (YDTR). Specifically, a dynamic Transformer module (DTRM) is designed to acquire not only the local features but also the significant context information. Furthermore, the proposed network is devised in a Y-shape to comprehensively maintain the thermal radiation information from the infrared image and scene details from the visible image. Considering the specific information provided by the source images, we design a loss function that consists of two terms to improve fusion quality: a structural similarity (SSIM) term and a spatial frequency (SF) term. Extensive experiments on mainstream datasets illustrate that the proposed method outperforms both classical and state-of-the-art approaches in both qualitative and quantitative assessments. We further extend the YDTR to address other infrared and RGB-visible images and multi-focus images without fine-tuning, and the satisfactory fusion results demonstrate that the proposed method has good generalization capability.
引用
收藏
页码:5413 / 5428
页数:16
相关论文
共 50 条
  • [1] TFIV: Multigrained Token Fusion for Infrared and Visible Image via Transformer
    Li, Jing
    Yang, Bin
    Bai, Lu
    Dou, Hao
    Li, Chang
    Ma, Lingfei
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [2] DATFuse: Infrared and Visible Image Fusion via Dual Attention Transformer
    Tang, Wei
    He, Fazhi
    Liu, Yu
    Duan, Yansong
    Si, Tongzhen
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (07) : 3159 - 3172
  • [3] AITFuse: Infrared and visible image fusion via adaptive interactive transformer learning
    Wang, Zhishe
    Yang, Fan
    Sun, Jing
    Xu, Jiawei
    Yang, Fengbao
    Yan, Xiaomei
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [4] Semantic perceptive infrared and visible image fusion Transformer
    Yang, Xin
    Huo, Hongtao
    Li, Chang
    Liu, Xiaowen
    Wang, Wenxi
    Wang, Cheng
    [J]. PATTERN RECOGNITION, 2024, 149
  • [5] ITFuse: An interactive transformer for infrared and visible image fusion
    Tang, Wei
    He, Fazhi
    Liu, Yu
    [J]. PATTERN RECOGNITION, 2024, 156
  • [6] Infrared and Visible Image Fusion with Convolutional Neural Network and Transformer
    Yang, Yang
    Ren, Zhennan
    Li, Beichen
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (16)
  • [7] MFT: Multi-scale Fusion Transformer for Infrared and Visible Image Fusion
    Zhang, Chen-Ming
    Yuan, Chengbo
    Luo, Yong
    Zhou, Xin
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 485 - 496
  • [8] Infrared and visible image fusion via gradientlet filter
    Ma, Jiayi
    Zhou, Yi
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 197
  • [9] RITFusion: Reinforced Interactive Transformer Network for Infrared and Visible Image Fusion
    Li, Xiaoling
    Li, Yanfeng
    Chen, Houjin
    Peng, Yahui
    Chen, Luyifu
    Wang, Minjun
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [10] MGT: Modality-Guided Transformer for Infrared and Visible Image Fusion
    Zhang, Taoying
    Li, Hesong
    Liu, Qiankun
    Wang, Xiaoyong
    Fu, Ying
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 321 - 332