Context-Aggregated and SAM-Guided Network for ViT-Based Instance Segmentation in Remote Sensing Images

被引:0
|
作者
Liu, Shuangzhou [1 ,2 ,3 ]
Wang, Feng [1 ,2 ]
You, Hongjian [1 ,2 ,3 ]
Jiao, Niangang [1 ,2 ]
Zhou, Guangyao [1 ,2 ]
Zhang, Tingtao [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Technol Geospatial Informat Proc & Applica, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 101408, Peoples R China
关键词
instance segmentation; remote sensing images; SAM; backbone;
D O I
10.3390/rs16132472
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Instance segmentation of remote sensing images can not only provide object-level positioning information but also provide pixel-level positioning information. This pixel-level information annotation has a wide range of uses in the field of remote sensing, and it is of great value for environmental detection and resource management. Because optical images generally have complex terrain environments and changeable object shapes, SAR images are affected by complex scattering phenomena, and the mask quality obtained by the traditional instance segmentation method used in remote sensing images is not high. Therefore, it is a challenging task to improve the mask quality of instance segmentation in remote sensing images. Since the traditional two-stage instance segmentation method consists of backbone, neck, bbox head, and mask head, the final mask quality depends on the product of all front-end work quality. Therefore, we consider the difficulty of optical and SAR images to bring instance segmentation to the targeted improvement of the neck, bbox head, and mask head, and we propose the Context-Aggregated and SAM-Guided Network (CSNet). In this network, the plain feature fusion pyramid network (PFFPN) can generate a pyramid for the plain feature and provide a feature map of the appropriate instance scale for detection and segmentation. The network also includes a context aggregation bbox head (CABH), which uses the context information and instance information around the instance to solve the problem of missed detection and false detection in detection. The network also has a SAM-Guided mask head (SGMH), which learns by using SAM as a teacher, and uses the knowledge learned to improve the edge of the mask. Experimental results show that CSNet significantly improves the quality of masks generated under optical and SAR images, and CSNet achieves 5.1% and 3.2% AP increments compared with other SOTA models.
引用
收藏
页数:27
相关论文
共 50 条
  • [31] HCA-Net: An Instance Segmentation Network for High-Consequence Areas Identification From Remote Sensing Images
    Dai, Xiaojun
    Huang, Weiyi
    Xi, Ming
    Zhang, Yaqi
    Ma, Deying
    Wang, Daguo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [32] HCANet: A Hierarchical Context Aggregation Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Bai, Haiwei
    Cheng, Jian
    Huang, Xia
    Liu, Siyu
    Deng, Changjian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [33] An Instance Segmentation Based Framework for Large-Sized High-Resolution Remote Sensing Images Registration
    Lu, Junyan
    Jia, Hongguang
    Li, Tie
    Li, Zhuqiang
    Ma, Jingyu
    Zhu, Ruifei
    REMOTE SENSING, 2021, 13 (09)
  • [34] FAST SINGLE-SHOT SHIP INSTANCE SEGMENTATION BASED ON POLAR TEMPLATE MASK IN REMOTE SENSING IMAGES
    Huang, Zhenhang
    Sun, Shihao
    Li, Ruirui
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1236 - 1239
  • [35] CMLFormer: CNN and Multiscale Local-Context Transformer Network for Remote Sensing Images Semantic Segmentation
    Wu, Honglin
    Zhang, Min
    Huang, Peng
    Tang, Wenlong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 7233 - 7241
  • [36] CGGLNet: Semantic Segmentation Network for Remote Sensing Images Based on Category-Guided Global-Local Feature Interaction
    Ni, Yue
    Liu, Jiahang
    Chi, Weijian
    Wang, Xiaozhen
    Li, Deren
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 17
  • [37] Boundary-Guided Semantic Context Network for Water Body Extraction from Remote Sensing Images
    Yu, Jie
    Cai, Yang
    Lyu, Xin
    Xu, Zhennan
    Wang, Xinyuan
    Fang, Yiwei
    Jiang, Wenxuan
    Li, Xin
    REMOTE SENSING, 2023, 15 (17)
  • [38] An Efficient and Light Transformer-Based Segmentation Network for Remote Sensing Images of Landscapes
    Chen, Lijia
    Chen, Honghui
    Xie, Yanqiu
    He, Tianyou
    Ye, Jing
    Zheng, Yushan
    FORESTS, 2023, 14 (11):
  • [39] Semantic segmentation network for mangrove tree species based on UAV remote sensing images
    Wang, Xin
    Zhang, Yu
    Ca, Jingye
    Qin, Qin
    Feng, Yi
    Yan, Jingke
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [40] CLOUD DETECTION FOR REMOTE SENSING IMAGES BASED ON DIFFERENCE FEATURES AND SEMANTIC SEGMENTATION NETWORK
    Ma, Nan
    Sun, Lin
    Zhou, Chenghu
    He, Yawen
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 603 - 606