Infrared Image Generation Based on Visual State Space and Contrastive Learning

被引:0
|
作者
Li, Bing [1 ]
Ma, Decao [1 ]
He, Fang [1 ]
Zhang, Zhili [1 ]
Zhang, Daqiao [1 ]
Li, Shaopeng [1 ,2 ]
机构
[1] Xian Res Inst High Technol, Xian 710025, Peoples R China
[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
visual state space; contrastive learning; generative adversarial network; visible-to-infrared image translation;
D O I
10.3390/rs16203817
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The preparation of infrared reference images is of great significance for improving the accuracy and precision of infrared imaging guidance. However, collecting infrared data on-site is difficult and time-consuming. Fortunately, the infrared images can be obtained from the corresponding visible-light images to enrich the infrared data. To this end, this present work proposes an image translation algorithm that converts visible-light images to infrared images. This algorithm, named V2IGAN, is founded on the visual state space attention module and multi-scale feature contrastive learning loss. Firstly, we introduce a visual state space attention module designed to sharpen the generative network's focus on critical regions within visible-light images. This enhancement not only improves feature extraction but also bolsters the generator's capacity to accurately model features, ultimately enhancing the quality of generated images. Furthermore, the method incorporates a multi-scale feature contrastive learning loss function, which serves to bolster the robustness of the model and refine the detail of the generated images. Experimental results show that the V2IGAN method outperforms existing typical infrared image generation techniques in both subjective visual assessments and objective metric evaluations. This suggests that the V2IGAN method is adept at enhancing the feature representation in images, refining the details of the generated infrared images, and yielding reliable, high-quality results.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Contrastive Learning for Image Captioning
    Dai, Bo
    Lin, Dahua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [32] ENHANCING CONTRASTIVE LEARNING WITH TEMPORAL COGNIZANCE FOR AUDIO-VISUAL REPRESENTATION GENERATION
    Lavania, Chandrashekhar
    Sundaram, Shiva
    Srinivasan, Sundararajan
    Kirchhoff, Katrin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4728 - 4732
  • [33] Framework for Contrastive Learning Phases of Matter Based on Visual Representations
    Xiao-Qi Han
    Sheng-Song Xu
    Zhen Feng
    Rong-Qiang He
    Zhong-Yi Lu
    Chinese Physics Letters, 2023, 40 (02) : 74 - 78
  • [34] Framework for Contrastive Learning Phases of Matter Based on Visual Representations
    Han, Xiao-Qi
    Xu, Sheng-Song
    Feng, Zhen
    He, Rong-Qiang
    Lu, Zhong-Yi
    CHINESE PHYSICS LETTERS, 2023, 40 (02)
  • [35] Mutual Contrastive Learning for Visual Representation Learning
    Yang, Chuanguang
    An, Zhulin
    Cai, Linhang
    Xu, Yongjun
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3045 - 3053
  • [36] Multi-focus image fusion with visual state space model and dual adversarial learning
    Xie, Xinzhe
    Guo, Buyu
    Li, Peiliang
    He, Shuangyan
    Zhou, Sangjun
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123
  • [37] Cross-similarity guided contrastive learning for infrared-visible image-to-image translation
    Yu, Pan
    Zhao, Wei
    Huang, Yan
    Wang, Guoyou
    Proceedings of SPIE - The International Society for Optical Engineering, 2024, 13180
  • [38] Vectorized Feature Space Embedded Clustering Based on Contrastive Learning
    Zheng, Yang
    Wu, Yongming
    Xu, An
    Computer Engineering and Applications, 2024, 60 (04) : 211 - 219
  • [39] Continuous image anomaly detection based on contrastive lifelong learning
    Wentao Fan
    Weimin Shangguan
    Nizar Bouguila
    Applied Intelligence, 2023, 53 : 17693 - 17707
  • [40] Retinal OCTA Image Segmentation Based on Global Contrastive Learning
    Ma, Ziping
    Feng, Dongxiu
    Wang, Jingyu
    Ma, Hu
    SENSORS, 2022, 22 (24)