VISION-LANGUAGE JOINT LEARNING FOR BOX-SUPERVISED CHANGE DETECTION IN REMOTE SENSING

被引:0
|
作者
Yin, Kanghua [1 ]
Liu, Fang [1 ]
Liu, Jia [1 ]
Xiao, Liang [1 ]
机构
[1] Nanjing Univ Sci & Technol, Jiangsu Prov Engn Res Ctr Airborne Detecting & In, Sch Comp Sci & Engn, Nanjing, Peoples R China
来源
2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2024) | 2024年
关键词
Change detection; remote sensing; vision-language; box-supervised;
D O I
10.1109/IGARSS53475.2024.10641329
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Change detection (CD) in remote sensing aims at revealing land cover changes according to the category of the ground objects. However, the category information is always missing in current popular vision-based CD methods. Considering that language analysis is really good at identifying different categories, a vision-language joint learning method is proposed in this paper, which consists of two vision-language joint representation (VLJR) modules and a changed instance segmentation (CIS) module. The former combines image features and language features with the help of text encoder and Transformer. The latter generates the final pixel-level CD result with only box-level labeled samples by level-set evolution and box matching supervision, which reduces manual-labor to a large extent. Tested on representative WHU datasets, the proposed method achieves comparable results to fully-supervised CD methods and is ahead of the other weakly-supervised methods.
引用
收藏
页码:10254 / 10258
页数:5
相关论文
共 50 条
  • [41] Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model
    Du, Yu
    Wei, Fangyun
    Zhang, Zihe
    Shi, Miaojing
    Gao, Yue
    Li, Guoqi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14064 - 14073
  • [42] Weakly Supervised Learning for Target Detection in Remote Sensing Images
    Zhang, Dingwen
    Han, Junwei
    Cheng, Gong
    Liu, Zhenbao
    Bu, Shuhui
    Guo, Lei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (04) : 701 - 705
  • [43] SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
    Hoyer, Lukas
    Tan, David Joseph
    Naeem, Muhammad Ferjad
    Van Gool, Luc
    Tombari, Federico
    COMPUTER VISION - ECCV 2024, PT XXXIX, 2025, 15097 : 257 - 275
  • [44] Prior-Experience-Based Vision-Language Model for Remote Sensing Image-Text Retrieval
    Tang, Xu
    Huang, Dabiao
    Ma, Jingjing
    Zhang, Xiangrong
    Liu, Fang
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [45] RS5M and GeoRSCLIP: A Large-Scale Vision- Language Dataset and a Large Vision-Language Model for Remote Sensing
    Zhang, Zilun
    Zhao, Tiancheng
    Guo, Yulong
    Yin, Jianwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [46] CrackCLIP: Adapting Vision-Language Models for Weakly Supervised Crack Segmentation
    Liang, Fengjiao
    Li, Qingyong
    Yu, Haomin
    Wang, Wen
    ENTROPY, 2025, 27 (02)
  • [47] Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog
    Chen, Feilong
    Zhang, Duzhen
    Chen, Xiuyi
    Shi, Jing
    Xu, Shuang
    Xu, Bo
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4142 - 4153
  • [48] Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
    Wang, Xin
    Huang, Qiuyuan
    Celikyilmaz, Asli
    Gao, Jianfeng
    Shen, Dinghan
    Wang, Yuan-Fang
    Wang, William Yang
    Zhang, Lei
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3622 - 6631
  • [49] S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions
    Mo, Sangwoo
    Kim, Minkyu
    Lee, Kyungmin
    Shin, Jinwoo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [50] Self-Supervised Change Detection in Multiview Remote Sensing Images
    Chen, Yuxing
    Bruzzone, Lorenzo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60