AFDFusion: An adaptive frequency decoupling fusion network for multi-modality image

被引:0
|
作者
机构
[1] Wang, Chengchao
[2] Zhao, Zhengpeng
[3] Yang, Qiuxia
[4] Nie, Rencan
[5] 2,Cao, Jinde
[6] 1,Pu, Yuanyuan
基金
中国国家自然科学基金;
关键词
Adversarial machine learning - Image fusion - Image matching - Image texture - Infrared imaging - Self-supervised learning;
D O I
10.1016/j.eswa.2024.125694
中图分类号
学科分类号
摘要
The multi-modality image fusion goal is to create a single image that provides a comprehensive scene description and conforms to visual perception by integrating complementary information about the merits of the different modalities, e.g., salient intensities of infrared images and detail textures of visible images. Although some works explore decoupled representations of multi-modality images, they struggle with complex nonlinear relationships, fine modal decoupling, and noise handling. To cope with this issue, we propose an adaptive frequency decoupling module to perceive the associative invariant and inherent specific among cross-modality by dynamically adjusting the learnable low frequency weight of the kernel. Specifically, we utilize a contrastive learning loss for restricting the solution space of feature decoupling to learn representations of both the invariant and specific in the multi-modality images. The underlying idea is that: in decoupling, low frequency features, which are similar in the representation space, should be pulled closer to each other, signifying the associative invariant, while high frequencies are pushed farther away, also indicating the intrinsic specific. Additionally, a multi-stage training manner is introduced into our framework to achieve decoupling and fusion. Stage I, MixEncoder and MixDecoder with the same architecture but different parameters are trained to perform decoupling and reconstruction supervised by the contrastive self-supervised mechanism. Stage II, two feature fusion modules are added to integrate the invariant and specific features and output the fused image. Extensive experiments demonstrated the proposed method superiority over the state-of-the-art methods in both qualitative and quantitative evaluation on two multi-modal image fusion tasks. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [1] Multi-modality Fusion Network for Action Recognition
    Huang, Kai
    Qin, Zheng
    Xu, Kaiping
    Ye, Shuxiong
    Wang, Guolong
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 139 - 149
  • [2] ReCoNet: Recurrent Correction Network for Fast and Efficient Multi-modality Image Fusion
    Huang, Zhanbo
    Liu, Jinyuan
    Fan, Xin
    Liu, Risheng
    Zhong, Wei
    Luo, Zhongxuan
    [J]. COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 539 - 555
  • [3] Multi-modality image fusion for image-guided neurosurgery
    Haller, JW
    Ryken, T
    Madsen, M
    Edwards, A
    Bolinger, L
    Vannier, MW
    [J]. CARS '99: COMPUTER ASSISTED RADIOLOGY AND SURGERY, 1999, 1191 : 681 - 685
  • [4] Multi-Modality Image Fusion in Adaptive-Parameters SPCNN Based on Inherent Characteristics of Image
    Zhang, Lixia
    Zeng, Guangping
    Wei, Jinjin
    Xuan, Zhaocheng
    [J]. IEEE SENSORS JOURNAL, 2020, 20 (20) : 11820 - 11827
  • [5] STAFuse: A Feature Decomposition Network with Super Token Attention for Multi-modality Image Fusion
    Chen, Peng
    Chen, Aiguo
    Wang, Chuang
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14880 : 324 - 335
  • [6] Multi-Modality Medical Image Fusion Using Convolutional Neural Network and Contrast Pyramid
    Wang, Kunpeng
    Zheng, Mingyao
    Wei, Hongyan
    Qi, Guanqiu
    Li, Yuanyuan
    [J]. SENSORS, 2020, 20 (08)
  • [7] An Interpretable Fusion Siamese Network for Multi-Modality Remote Sensing Ship Image Retrieval
    Xiong, Wei
    Xiong, Zhenyu
    Cui, Yaqi
    Huang, Linzhou
    Yang, Ruining
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2696 - 2712
  • [8] DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion
    Zhao, Zixiang
    Bai, Haowen
    Zhu, Yuanzhi
    Zhang, Jiangshe
    Xu, Shuang
    Zhang, Yulun
    Zhang, Kai
    Meng, Deyu
    Timofte, Radu
    Van Gool, Luc
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8048 - 8059
  • [9] Lymphatic flow mapping utilizing multi-modality image fusion
    Vicic, M
    Thorstad, W
    Low, D
    Deasy, J
    [J]. MEDICAL PHYSICS, 2004, 31 (06) : 1900 - 1900
  • [10] Fast saliency-aware multi-modality image fusion
    Han, Jungong
    Pauwels, Eric J.
    de Zeeuw, Paul
    [J]. NEUROCOMPUTING, 2013, 111 : 70 - 80