CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion

被引:252
|
作者
Zhao, Zixiang [1 ,2 ]
Bai, Haowen [1 ]
Zhang, Jiangshe [1 ]
Zhang, Yulun [2 ]
Xu, Shuang [3 ,4 ]
Lin, Zudi [5 ]
Timofte, Radu [2 ,6 ]
Van Gool, Luc [2 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Peoples R China
[2] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
[3] Northwestern Polytech Univ Shenzhen, Inst Res & Dev, Shenzhen, Peoples R China
[4] Northwestern Polytech Univ, Xian, Peoples R China
[5] Harvard Univ, Cambridge, England
[6] Univ Wurzburg, Wurzburg, Germany
基金
中国国家自然科学基金;
关键词
NETWORK;
D O I
10.1109/CVPR52729.2023.00572
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-modality (MM) image fusion aims to render fused images that maintain the merits of different modalities, e.g., functional highlight and detailed textures. To tackle the challenge in modeling cross-modality features and decomposing desirable modality-specific and modality-shared features, we propose a novel Correlation-Driven feature Decomposition Fusion (CDDFuse) network. Firstly, CDDFuse uses Restormer blocks to extract cross-modality shallow features. We then introduce a dual-branch Transformer-CNN feature extractor with Lite Transformer (LT) blocks leveraging long-range attention to handle low-frequency global features and Invertible Neural Networks (INN) blocks focusing on extracting high-frequency local information. A correlation-driven loss is further proposed to make the low-frequency features correlated while the high-frequency features un-correlated based on the embedded information. Then, the LT-based global fusion and INN-based local fusion layers output the fused image. Extensive experiments demonstrate that our CDDFuse achieves promising results in multiple fusion tasks, including infrared-visible image fusion and medical image fusion. We also show that CDDFuse can boost the performance in downstream infrared-visible semantic segmentation and object detection in a unified benchmark. The code is available at https://github.com/Zhaozixiang1228/MMIF-CDDFuse.
引用
收藏
页码:5906 / 5916
页数:11
相关论文
共 50 条
  • [21] Multi-stage Image Fusion Method Based on Differential Dual-Branch Encoder
    Hong, Yulu
    Wu, Xiaojun
    Xu, Tianyang
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (07): : 661 - 670
  • [22] DBFNet: A Dual-Branch Fusion Network for Underwater Image Enhancement
    Sun, Kaichuan
    Tian, Yubo
    REMOTE SENSING, 2023, 15 (05)
  • [23] A dual-branch infrared and visible image fusion network using progressive image-wise feature transfer
    Xu, Shaoping
    Zhou, Changfei
    Xiao, Jian
    Tao, Wuyong
    Dai, Tianyu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 102
  • [24] CT and MRI image fusion via dual-branch GAN
    Zhai, Wenzhe
    Song, Wenhao
    Chen, Jinyong
    Zhang, Guisheng
    Li, Qilei
    Gao, Mingliang
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2023, 42 (01) : 52 - 63
  • [25] Multi-modality image registration using local correlation
    Rösch, P
    Blaffert, T
    Weese, J
    CARS '99: COMPUTER ASSISTED RADIOLOGY AND SURGERY, 1999, 1191 : 228 - 232
  • [26] CORRELATION-BASED FEATURE ANALYSIS AND MULTI-MODALITY FUSION FRAMEWORK FOR MULTIMEDIA SEMANTIC RETRIEVAL
    Ha, Hsin-Yu
    Yang, Yimin
    Fleites, Fausto C.
    Chen, Shu-Ching
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [27] Co-Enhancement of Multi-Modality Image Fusion and Object Detection via Feature Adaptation
    Dong, Aimei
    Wang, Long
    Liu, Jian
    Xu, Jingyuan
    Zhao, Guixin
    Zhai, Yi
    Lv, Guohua
    Cheng, Jinyong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12624 - 12637
  • [28] DFFNet: A Rainfall Nowcasting Model Based on Dual-Branch Feature Fusion
    Liu, Shuxian
    Liu, Yulong
    Zheng, Jiong
    Liao, Yuanyuan
    Zheng, Guohong
    Zhang, Yongjun
    ELECTRONICS, 2024, 13 (14)
  • [29] Dual-Branch Feature Fusion Remote Sensing Building Detection Model
    Cheng, Jiawei
    Guo, Rongzuo
    Wu, Jiancheng
    Zhang, Hao
    Computer Engineering and Applications, 2024, 60 (22) : 145 - 153
  • [30] DFENet: A dual-branch feature enhanced network integrating transformers and convolutional feature learning for multimodal medical image fusion
    Li, Weisheng
    Zhang, Yin
    Wang, Guofen
    Huang, Yuping
    Li, Ruyue
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 80