Infrared Image Generation Based on Visual State Space and Contrastive Learning

被引：0

作者：

Li, Bing ^{[1
]}

Ma, Decao ^{[1
]}

He, Fang ^{[1
]}

Zhang, Zhili ^{[1
]}

Zhang, Daqiao ^{[1
]}

Li, Shaopeng ^{[1
,2
]}

机构：

[1] Xian Res Inst High Technol, Xian 710025, Peoples R China

[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

来源：

REMOTE SENSING | 2024年 / 16卷 / 20期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

visual state space; contrastive learning; generative adversarial network; visible-to-infrared image translation;

D O I：

10.3390/rs16203817

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The preparation of infrared reference images is of great significance for improving the accuracy and precision of infrared imaging guidance. However, collecting infrared data on-site is difficult and time-consuming. Fortunately, the infrared images can be obtained from the corresponding visible-light images to enrich the infrared data. To this end, this present work proposes an image translation algorithm that converts visible-light images to infrared images. This algorithm, named V2IGAN, is founded on the visual state space attention module and multi-scale feature contrastive learning loss. Firstly, we introduce a visual state space attention module designed to sharpen the generative network's focus on critical regions within visible-light images. This enhancement not only improves feature extraction but also bolsters the generator's capacity to accurately model features, ultimately enhancing the quality of generated images. Furthermore, the method incorporates a multi-scale feature contrastive learning loss function, which serves to bolster the robustness of the model and refine the detail of the generated images. Experimental results show that the V2IGAN method outperforms existing typical infrared image generation techniques in both subjective visual assessments and objective metric evaluations. This suggests that the V2IGAN method is adept at enhancing the feature representation in images, refining the details of the generated infrared images, and yielding reliable, high-quality results.

引用

页数：24

共 50 条

[31] Contrastive Learning for Image Captioning
Dai, Bo
Lin, Dahua
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[32] ENHANCING CONTRASTIVE LEARNING WITH TEMPORAL COGNIZANCE FOR AUDIO-VISUAL REPRESENTATION GENERATION
Lavania, Chandrashekhar
Sundaram, Shiva
Srinivasan, Sundararajan
Kirchhoff, Katrin
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4728 - 4732
[33] Framework for Contrastive Learning Phases of Matter Based on Visual Representations
Xiao-Qi Han
Sheng-Song Xu
Zhen Feng
Rong-Qiang He
Zhong-Yi Lu
Chinese Physics Letters, 2023, 40 (02) : 74 - 78
[34] Framework for Contrastive Learning Phases of Matter Based on Visual Representations
Han, Xiao-Qi
Xu, Sheng-Song
Feng, Zhen
He, Rong-Qiang
Lu, Zhong-Yi
CHINESE PHYSICS LETTERS, 2023, 40 (02)
[35] Mutual Contrastive Learning for Visual Representation Learning
Yang, Chuanguang
An, Zhulin
Cai, Linhang
Xu, Yongjun
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3045 - 3053
[36] Multi-focus image fusion with visual state space model and dual adversarial learning
Xie, Xinzhe
Guo, Buyu
Li, Peiliang
He, Shuangyan
Zhou, Sangjun
COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123
[37] Cross-similarity guided contrastive learning for infrared-visible image-to-image translation
Yu, Pan
Zhao, Wei
Huang, Yan
Wang, Guoyou
Proceedings of SPIE - The International Society for Optical Engineering, 2024, 13180
[38] Vectorized Feature Space Embedded Clustering Based on Contrastive Learning
Zheng, Yang
Wu, Yongming
Xu, An
Computer Engineering and Applications, 2024, 60 (04) : 211 - 219
[39] Continuous image anomaly detection based on contrastive lifelong learning
Wentao Fan
Weimin Shangguan
Nizar Bouguila
Applied Intelligence, 2023, 53 : 17693 - 17707
[40] Retinal OCTA Image Segmentation Based on Global Contrastive Learning
Ma, Ziping
Feng, Dongxiu
Wang, Jingyu
Ma, Hu
SENSORS, 2022, 22 (24)

← 1 2 3 4 5 →