MSL-CCRN: Multi-stage self-supervised learning based cross-modality contrastive representation network for infrared and visible image fusion

被引:0
|
作者
Yan, Zhilin [1 ]
Nie, Rencan [1 ]
Cao, Jinde [2 ,3 ]
Xie, Guangxu [1 ]
Ding, Zhengze [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650500, Peoples R China
[2] Southeast Univ, Sch Math, Nanjing 211189, Peoples R China
[3] Ahlia Univ, Manama 10878, Bahrain
基金
中国博士后科学基金;
关键词
Contrastive representation network; Image fusion; Multi-stage; Contrastive learning; Self-supervised; MULTISCALE TRANSFORM; FRAMEWORK; WAVELET; NEST;
D O I
10.1016/j.dsp.2024.104853
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Infrared and visible image fusion (IVIF) facing different information in two modal scenarios, the focus of research is to better extract different information. In this work, we propose a multi-stage self-supervised learning based cross-modality contrastive representation network for infrared and visible image fusion (MSL-CCRN). Firstly, considering that the scene differences between different modalities affect the fusion of cross-modal images, we propose a contrastive representation network (CRN). CRN enhances the interaction between the fused image and the source image, and significantly improves the similarity between the meaningful features in each modality and the fused image. Secondly, due to the lack of ground truth in IVIF, the quality of directly obtained fused image is seriously affected. We design a multi-stage fusion strategy to address the loss of important information in this process. Notably, our method is a self-supervised network. In fusion stage I, we reconstruct the initial fused image as the new view of fusion stage II. In fusion stage II, we use the fused image obtained in the previous stage to carry out three-view contrastive representation, thereby constraining the feature extraction of the source image. This makes the final fused image introduce more important information in the source image. Through a large number of qualitative, quantitative experiments and downstream object detection experiments, our propose method shows excellent performance compared with most advanced methods.
引用
收藏
页数:14
相关论文
共 43 条
  • [31] One-stage self-supervised momentum contrastive learning network for open-set cross-domain fault diagnosis
    Wang, Weicheng
    Li, Chao
    Li, Aimin
    Li, Fudong
    Chen, Jinglong
    Zhang, Tianci
    KNOWLEDGE-BASED SYSTEMS, 2023, 275
  • [32] TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework Using Self-Supervised Multi-Task Learning
    Qu, Linhao
    Liu, Shaolei
    Wang, Manning
    Song, Zhijian
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2126 - 2134
  • [33] Infrared and Visible Image Fusion Based on Multi-scale Network with Dual-channel Information Cross Fusion Block
    Yang, Yong
    Kong, Xiangkai
    Huang, Shuying
    Wan, Weiguo
    Liu, Jiaxiang
    Zhang, Wang
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [34] Cross-Attention Based Multi-Resolution Feature Fusion Model for Self-Supervised Cervical OCT Image Classification
    Wang, Qingbin
    Chen, Kaiyi
    Dou, Wanrong
    Ma, Yutao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (04) : 2541 - 2554
  • [35] A fine-grained image classification algorithm based on self-supervised learning and multi-feature fusion of blood cells
    Jia, Nan
    Guo, Jingxia
    Li, Yan
    Tang, Siyuan
    Xu, Li
    Liu, Liang
    Xing, Junfeng
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [36] Modality specific infrared and visible image fusion based on multi-scale rich feature representation under low-light environment
    Liu, Chenhua
    Chen, Hanrui
    Deng, Lei
    Guo, Chentong
    Lu, Xitian
    Yu, Heng
    Zhu, Lianqing
    Dong, Mingli
    INFRARED PHYSICS & TECHNOLOGY, 2024, 140
  • [37] MSDCNet: Multi-stage and deep residual complementary multi-focus image fusion network based on multi-scale feature learning
    Hu, Gang
    Jiang, Jinlin
    Sheng, Guanglei
    Wei, Guo
    APPLIED INTELLIGENCE, 2025, 55 (03)
  • [38] Social-SSL: Self-supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction
    Tsao, Li-Wu
    Wang, Yan-Kai
    Lin, Hao-Siang
    Shuai, Hong-Han
    Wong, Lai-Kuan
    Cheng, Wen-Huang
    COMPUTER VISION, ECCV 2022, PT XXII, 2022, 13682 : 234 - 250
  • [39] Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension
    Zhang, Yujia
    Li, Qianzhong
    Pan, Yi
    Zhao, Xiaoguang
    Tan, Min
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3256 - 3270
  • [40] Two-stage Parallax Correction and Multi-stage Cross-view Fusion Network Based Stereo Image Super-Resolution
    Zheng, Yijian
    Li, Sumei
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,