VISION-LANGUAGE JOINT LEARNING FOR BOX-SUPERVISED CHANGE DETECTION IN REMOTE SENSING

被引：0

作者：

Yin, Kanghua ^{[1
]}

Liu, Fang ^{[1
]}

Liu, Jia ^{[1
]}

Xiao, Liang ^{[1
]}

机构：

[1] Nanjing Univ Sci & Technol, Jiangsu Prov Engn Res Ctr Airborne Detecting & In, Sch Comp Sci & Engn, Nanjing, Peoples R China

来源：

2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2024) | 2024年

关键词：

Change detection; remote sensing; vision-language; box-supervised;

D O I：

10.1109/IGARSS53475.2024.10641329

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Change detection (CD) in remote sensing aims at revealing land cover changes according to the category of the ground objects. However, the category information is always missing in current popular vision-based CD methods. Considering that language analysis is really good at identifying different categories, a vision-language joint learning method is proposed in this paper, which consists of two vision-language joint representation (VLJR) modules and a changed instance segmentation (CIS) module. The former combines image features and language features with the help of text encoder and Transformer. The latter generates the final pixel-level CD result with only box-level labeled samples by level-set evolution and box matching supervision, which reduces manual-labor to a large extent. Tested on representative WHU datasets, the proposed method achieves comparable results to fully-supervised CD methods and is ahead of the other weakly-supervised methods.

引用

页码：10254 / 10258

页数：5

共 50 条

[41] Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model
Du, Yu
Wei, Fangyun
Zhang, Zihe
Shi, Miaojing
Gao, Yue
Li, Guoqi
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14064 - 14073
[42] Weakly Supervised Learning for Target Detection in Remote Sensing Images
Zhang, Dingwen
Han, Junwei
Cheng, Gong
Liu, Zhenbao
Bu, Shuhui
Guo, Lei
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (04) : 701 - 705
[43] SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
Hoyer, Lukas
Tan, David Joseph
Naeem, Muhammad Ferjad
Van Gool, Luc
Tombari, Federico
COMPUTER VISION - ECCV 2024, PT XXXIX, 2025, 15097 : 257 - 275
[44] Prior-Experience-Based Vision-Language Model for Remote Sensing Image-Text Retrieval
Tang, Xu
Huang, Dabiao
Ma, Jingjing
Zhang, Xiangrong
Liu, Fang
Jiao, Licheng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[45] RS5M and GeoRSCLIP: A Large-Scale Vision- Language Dataset and a Large Vision-Language Model for Remote Sensing
Zhang, Zilun
Zhao, Tiancheng
Guo, Yulong
Yin, Jianwei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[46] CrackCLIP: Adapting Vision-Language Models for Weakly Supervised Crack Segmentation
Liang, Fengjiao
Li, Qingyong
Yu, Haomin
Wang, Wen
ENTROPY, 2025, 27 (02)
[47] Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog
Chen, Feilong
Zhang, Duzhen
Chen, Xiuyi
Shi, Jing
Xu, Shuang
Xu, Bo
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4142 - 4153
[48] Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
Wang, Xin
Huang, Qiuyuan
Celikyilmaz, Asli
Gao, Jianfeng
Shen, Dinghan
Wang, Yuan-Fang
Wang, William Yang
Zhang, Lei
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3622 - 6631
[49] S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions
Mo, Sangwoo
Kim, Minkyu
Lee, Kyungmin
Shin, Jinwoo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[50] Self-Supervised Change Detection in Multiview Remote Sensing Images
Chen, Yuxing
Bruzzone, Lorenzo
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

← 1 2 3 4 5 →