RingMo-SAM: A Foundation Model for Segment Anything in Multimodal Remote-Sensing Images

被引:14
|
作者
Yan, Zhiyuan [1 ,2 ]
Li, Junxi [1 ,2 ,3 ,4 ]
Li, Xuexue [1 ,2 ,3 ,4 ]
Zhou, Ruixue [1 ,2 ]
Zhang, Wenkai [1 ,2 ]
Feng, Yingchao [1 ,2 ]
Diao, Wenhui [1 ,2 ]
Fu, Kun [1 ,2 ,3 ,4 ]
Sun, Xian [1 ,2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] AerospaceInformat Res Inst, Chinese Acad Sci, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
关键词
Remote sensing; Task analysis; Semantic segmentation; Feature extraction; Radar polarimetry; Training; Adaptation models; Multimodal remote-sensing images; prompt learning; segment anything model (SAM); semantic segmentation;
D O I
10.1109/TGRS.2023.3332219
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The proposal of the segment anything model (SAM) has created a new paradigm for the deep-learning-based semantic segmentation field and has shown amazing generalization performance. However, we find it may fail or perform poorly on multimodal remote-sensing scenarios, especially synthetic aperture radar (SAR) images. Besides, SAM does not provide category information for objects. In this article, we propose a foundation model for multimodal remote-sensing image segmentation called RingMo-SAM, which can not only segment anything in optical and SAR remote-sensing data, but also identify object categories. First, a large-scale dataset containing millions of segmentation instances is constructed by collecting multiple open-source datasets in this field to train the model. Then, by constructing an instance-type and terrain-type category-decoupling mask decoder (CDMDecoder), the categorywise segmentation of various objects is achieved. In addition, a prompt encoder embedded with the characteristics of multimodal remote-sensing data is designed. It not only supports multibox prompts to improve the segmentation accuracy of multiobjects in complicated remote-sensing scenes, but also supports SAR characteristics prompts to improve the segmentation performance on SAR images. Extensive experimental results on several datasets including iSAID, ISPRS Vaihingen, ISPRS Potsdam, AIR-PolSAR-Seg, and so on have demonstrated the effectiveness of our method.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [1] RingMo: A Remote Sensing Foundation Model With Masked Image Modeling
    Sun, Xian
    Wang, Peijin
    Lu, Wanxuan
    Zhu, Zicong
    Lu, Xiaonan
    He, Qibin
    Li, Junxi
    Rong, Xuee
    Yang, Zhujun
    Chang, Hao
    He, Qinglin
    Yang, Guang
    Wang, Ruiping
    Lu, Jiwen
    Fu, Kun
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [2] The Segment Anything Model (SAM) for remote sensing applications: From zero to one shot
    Osco, Lucas Prado
    Wu, Qiusheng
    de Lemos, Eduardo Lopes
    Gonsalves, Wesley Nunes
    Ramos, Ana Paula Marques
    Li, Jonathan
    Marcato, Jose
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 124
  • [3] Adapting Segment Anything Model for Change Detection in VHR Remote Sensing Images
    Ding, Lei
    Zhu, Kun
    Peng, Daifeng
    Tang, Hao
    Yang, Kuiwu
    Bruzzone, Lorenzo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 11
  • [4] MeSAM: Multiscale Enhanced Segment Anything Model for Optical Remote Sensing Images
    Zhou, Xichuan
    Liang, Fu
    Chen, Lihui
    Liu, Haijun
    Song, Qianqian
    Vivone, Gemine
    Chanussot, Jocelyn
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [5] The ability of Segmenting Anything Model (SAM) to segment ultrasound images
    Chen, Fang
    Chen, Lingyu
    Han, Haojie
    Zhang, Sainan
    Zhang, Daoqiang
    Liao, Hongen
    [J]. BIOSCIENCE TRENDS, 2023, 17 (03) : 211 - 218
  • [6] SCD-SAM: Adapting Segment Anything Model for Semantic Change Detection in Remote Sensing Imagery
    Mei, Liye
    Ye, Zhaoyi
    Xu, Chuan
    Wang, Hongzhu
    Wang, Ying
    Lei, Cheng
    Yang, Wei
    Li, Yansheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [7] Semantic segmentation of glaciological features across multiple remote sensing platforms with the Segment Anything Model (SAM)
    Shankar, Siddharth
    Stearns, Leigh A.
    van der Veen, C. J.
    [J]. JOURNAL OF GLACIOLOGY, 2023,
  • [8] MW-SAM:Mangrove wetland remote sensing image segmentation network based on segment anything model
    Zhang, Yu
    Wang, Xin
    Cai, Jingye
    Yang, Qun
    [J]. IET Image Processing, 2024, 18 (14) : 4503 - 4513
  • [9] Road-SAM: Adapting the Segment Anything Model to Road Extraction From Large Very-High-Resolution Optical Remote Sensing Images
    Feng, Wenqing
    Guan, Fangli
    Sun, Chenhao
    Xu, Wei
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [10] Segment Anything Model (SAM) Assisted Remote Sensing Supervision for Mariculture-Using Liaoning Province, China as an Example
    Ren, Yougui
    Yang, Xiaomei
    Wang, Zhihua
    Yu, Ge
    Liu, Yueming
    Liu, Xiaoliang
    Meng, Dan
    Zhang, Qingyang
    Yu, Guo
    [J]. REMOTE SENSING, 2023, 15 (24)