DED-SAM:Adapting Segment Anything Model 2 for Dual Encoder-Decoder Change Detection

被引:1
|
作者
Qiu, Junlong [1 ]
Liu, Wei [1 ]
Zhang, Xin [2 ]
Li, Erzhu [1 ]
Zhang, Lianpeng [1 ]
Li, Xing [1 ]
机构
[1] Jiangsu Normal Univ, Sch Geog Geomat & Planning, Xuzhou 221116, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Feature extraction; Remote sensing; Decoding; Data models; Visualization; Transformers; Object oriented modeling; Accuracy; Semantics; Semantic segmentation; Change detection; remote sensing; segment anything model; vision foundation model; NETWORK;
D O I
10.1109/JSTARS.2024.3490754
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Change detection has become a crucial topic in the field of remote sensing deep learning due to its extensive application in earth observation. However, real remote sensing images often contain multiple land cover classes with significant intraclass variability and interclass similarity, limiting the performance of change detection in complex scenarios. To address this, we leverage the capabilities of vision foundation models by applying the segment anything model (SAM) to remote sensing change detection, and we name this method dual encoder-decoder SAM (DED-SAM). Specifically, we construct a DED framework, utilizing a small-scale change detection model in both branches to generate mixed prompts including image features, mask prompts, and box prompts. The SAM 2 model is used for fine-grained recognition of dual-temporal images, generating accurate, and stable feature boundaries, which are then used as constraints to generate the final change mask. To validate the effectiveness of DED-SAM across various application scenarios, we conduct quantitative experiments on three public datasets: Levir-CD, SYSU-CD, and CDD, testing its detection capabilities under single change categories, multiple change categories, and seasonal pseudochange interference. The results show that the proposed DED-SAM achieved state-of-the-art F1 scores and IoUs on these three datasets: LEVIR-CD (92.00%, 85.11%), SYSU-CD (84.15%, 72.01%), and CDD (97.72%, 95.47%).
引用
收藏
页码:995 / 1006
页数:12
相关论文
共 50 条
  • [21] Road Semantic Segmentation and Traffic Object Detection Model Based on Encoder-Decoder CNN Architecture
    Wang, Yih-Chen
    Yu, Chao-Wei
    Lu, Xiu-Ying
    Chen, Yen-Lin
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 421 - 422
  • [22] Quantum Mayfly Optimization with Encoder-Decoder Driven LSTM Networks for Malware Detection and Classification Model
    Alzubi, Omar A.
    Alzubi, Jafar A.
    Alzubi, Tareq Mahmod
    Singh, Ashish
    MOBILE NETWORKS & APPLICATIONS, 2023, 28 (02): : 795 - 807
  • [23] Street-view Change Detection via Siamese Encoder-decoder Structured Convolutional Neural Networks
    Zhao, Xinwei
    Li, Haichang
    Wang, Rui
    Zheng, Changwen
    Shi, Song
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 525 - 532
  • [24] Detection of Building Change in Remote Sensing Image Based on Encoder-Decoder Network UNet3+
    Liang Y.
    Yi C.-X.
    Wang G.-Y.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (08): : 1720 - 1732
  • [25] Unsupervised anomaly detection in hourly water demand data using an asymmetric encoder-decoder model
    Yan, Jieru
    Tao, Tao
    JOURNAL OF HYDROLOGY, 2022, 613
  • [26] ASS-CD: Adapting Segment Anything Model and Swin-Transformer for Change Detection in Remote Sensing Images
    Wei, Chenlong
    Wu, Xiaofeng
    Wang, Bin
    REMOTE SENSING, 2025, 17 (03)
  • [27] Adapting the segment anything model for multi-modal retinal anomaly detection and localization
    Li, Jingtao
    Chen, Ting
    Wang, Xinyu
    Zhong, Yanfei
    Xiao, Xuan
    INFORMATION FUSION, 2025, 113
  • [28] SAMNet: Adapting segment anything model for accurate light field salient object detection
    Wang, Xingzheng
    Wu, Jianbin
    Wu, Shaoyong
    Li, Jiahui
    IMAGE AND VISION COMPUTING, 2025, 154
  • [29] Ground-Based Remote Sensing Cloud Detection Using Dual Pyramid Network and Encoder-Decoder Constraint
    Zhang, Zhong
    Yang, Shuzhen
    Liu, Shuang
    Cao, Xiaozhong
    Durrani, Tariq S.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [30] An automated detection model of threat objects for X-Ray baggage inspection based on modified encoder-decoder model
    Sara, Dioline
    Mandava, Ajay Kumar
    NONDESTRUCTIVE TESTING AND EVALUATION, 2024, 39 (08) : 2730 - 2755