AFT: Adaptive Fusion Transformer for Visible and Infrared Images

被引:19
|
作者
Chang, Zhihao [1 ]
Feng, Zhixi [2 ]
Yang, Shuyuan [2 ]
Gao, Quanwei [3 ]
机构
[1] Xidian Univ, Int Res Ctr Intelligent Percept & Comp, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710000, Peoples R China
[2] Sch Artificial Intelligence, Xian 710071, Peoples R China
[3] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformers; Feature extraction; Decoding; Convolution; Visualization; Image fusion; Fuses; Multi-modality images; transformer; adaptive fusion; multi-head self-attention; multi-head self-fusion; QUALITY ASSESSMENT; FRAMEWORK;
D O I
10.1109/TIP.2023.3263113
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an Adaptive Fusion Transformer (AFT) is proposed for unsupervised pixel-level fusion of visible and infrared images. Different from the existing convolutional networks, transformer is adopted to model the relationship of multi-modality images and explore cross-modal interactions in AFT. The encoder of AFT uses a Multi-Head Self-attention (MSA) module and Feed Forward (FF) network for feature extraction. Then, a Multi-head Self-Fusion (MSF) module is designed for the adaptive perceptual fusion of the features. By sequentially stacking the MSF, MSA, and FF, a fusion decoder is constructed to gradually locate complementary features for recovering informative images. In addition, a structure-preserving loss is defined to enhance the visual quality of fused images. Extensive experiments are conducted on several datasets to compare our proposed AFT method with 21 popular approaches. The results show that AFT has state-of-the-art performance in both quantitative metrics and visual perception.
引用
收藏
页码:2077 / 2092
页数:16
相关论文
共 50 条
  • [1] Adaptive fusion of infrared and visible images in dynamic scene
    Yang, Guang
    Yin, Yafeng
    Man, Hong
    Desai, Sachi
    [J]. UNMANNED/UNATTENDED SENSORS AND SENSOR NETWORKS VIII, 2011, 8184
  • [2] SwinFuse: A Residual Swin Transformer Fusion Network for Infrared and Visible Images
    Wang, Zhishe
    Chen, Yanlin
    Shao, Wenyu
    Li, Hui
    Zhang, Lei
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [3] AITFuse: Infrared and visible image fusion via adaptive interactive transformer learning
    Wang, Zhishe
    Yang, Fan
    Sun, Jing
    Xu, Jiawei
    Yang, Fengbao
    Yan, Xiaomei
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [4] An Adaptive Two-Scale Image Fusion of visible and Infrared Images
    Han, Xiyu
    Lv, Tao
    Song, Xiangyu
    Nie, Ting
    Liang, Huaidan
    He, Bin
    Kuijper, Arjan
    [J]. IEEE ACCESS, 2019, 7 : 56341 - 56352
  • [5] An illumination-dependent adaptive fusion method for infrared and visible images
    Li, Zhijian
    Yang, Fengbao
    Gao, Yubin
    Ji, Linna
    [J]. INFRARED PHYSICS & TECHNOLOGY, 2023, 131
  • [6] Bayesian fusion for infrared and visible images
    Zhao, Zixiang
    Xu, Shuang
    Zhang, Chunxia
    Liu, Junmin
    Zhang, Jiangshe
    [J]. SIGNAL PROCESSING, 2020, 177
  • [7] Poisson Fusion of Infrared and Visible Images
    He Bing-yang
    Zhang Zhi-quan
    Yang Jian-chang
    Li Chen
    Zhang Yu
    [J]. ACTA PHOTONICA SINICA, 2019, 48 (01)
  • [8] FAST NEAR INFRARED FUSION-BASED ADAPTIVE ENHANCEMENT OF VISIBLE IMAGES
    Elliethy, Ahmed
    Aly, Hussein A.
    [J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 156 - 160
  • [9] Multicontourlet-Based Adaptive Fusion of Infrared and Visible Remote Sensing Images
    Chang, Xia
    Jiao, Licheng
    Liu, Fang
    Xin, Fangfang
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2010, 7 (03) : 549 - 553
  • [10] An adaptive fusion approach for infrared and visible images based on NSCT and compressed sensing
    Zhang, Qiong
    Maldague, Xavier
    [J]. INFRARED PHYSICS & TECHNOLOGY, 2016, 74 : 11 - 20