Context-Aggregated and SAM-Guided Network for ViT-Based Instance Segmentation in Remote Sensing Images

被引:0
|
作者
Liu, Shuangzhou [1 ,2 ,3 ]
Wang, Feng [1 ,2 ]
You, Hongjian [1 ,2 ,3 ]
Jiao, Niangang [1 ,2 ]
Zhou, Guangyao [1 ,2 ]
Zhang, Tingtao [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Technol Geospatial Informat Proc & Applica, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 101408, Peoples R China
关键词
instance segmentation; remote sensing images; SAM; backbone;
D O I
10.3390/rs16132472
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Instance segmentation of remote sensing images can not only provide object-level positioning information but also provide pixel-level positioning information. This pixel-level information annotation has a wide range of uses in the field of remote sensing, and it is of great value for environmental detection and resource management. Because optical images generally have complex terrain environments and changeable object shapes, SAR images are affected by complex scattering phenomena, and the mask quality obtained by the traditional instance segmentation method used in remote sensing images is not high. Therefore, it is a challenging task to improve the mask quality of instance segmentation in remote sensing images. Since the traditional two-stage instance segmentation method consists of backbone, neck, bbox head, and mask head, the final mask quality depends on the product of all front-end work quality. Therefore, we consider the difficulty of optical and SAR images to bring instance segmentation to the targeted improvement of the neck, bbox head, and mask head, and we propose the Context-Aggregated and SAM-Guided Network (CSNet). In this network, the plain feature fusion pyramid network (PFFPN) can generate a pyramid for the plain feature and provide a feature map of the appropriate instance scale for detection and segmentation. The network also includes a context aggregation bbox head (CABH), which uses the context information and instance information around the instance to solve the problem of missed detection and false detection in detection. The network also has a SAM-Guided mask head (SGMH), which learns by using SAM as a teacher, and uses the knowledge learned to improve the edge of the mask. Experimental results show that CSNet significantly improves the quality of masks generated under optical and SAR images, and CSNet achieves 5.1% and 3.2% AP increments compared with other SOTA models.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] ECGNet: edge and class guided semantic segmentation network for remote sensing urban scene images
    Liu, Hongrong
    Liu, Minghua
    Song, Shuhua
    Guo, Guolong
    Yuan, Zhengyi
    Chen, Kai
    Yang, Shuai
    Yu, Jiangfeng
    Zhang, Hongwei
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (03)
  • [22] HRCNet: High-Resolution Context Extraction Network for Semantic Segmentation of Remote Sensing Images
    Xu, Zhiyong
    Zhang, Weicun
    Zhang, Tianxiang
    Li, Jiangyun
    REMOTE SENSING, 2021, 13 (01) : 1 - 23
  • [23] Multiscale Global Context Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Zeng, Qiaolin
    Zhou, Jingxiang
    Tao, Jinhua
    Chen, Liangfu
    Niu, Xuerui
    Zhang, Yumeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [24] A Spectral-Spatial Context-Boosted Network for Semantic Segmentation of Remote Sensing Images
    Li, Xin
    Yong, Xi
    Li, Tao
    Tong, Yao
    Gao, Hongmin
    Wang, Xinyuan
    Xu, Zhennan
    Fang, Yiwei
    You, Qian
    Lyu, Xin
    REMOTE SENSING, 2024, 16 (07)
  • [25] A New Instance Segmentation Model for High-Resolution Remote Sensing Images Based on Edge Processing
    Zhang, Xiaoying
    Shen, Jie
    Hu, Huaijin
    Yang, Houqun
    MATHEMATICS, 2024, 12 (18)
  • [26] An Improved Semantic Segmentation Method for Remote Sensing Images Based on Neural Network
    Jiang, Na
    Li, Jiyuan
    TRAITEMENT DU SIGNAL, 2020, 37 (02) : 271 - 278
  • [27] Semantic Segmentation for Remote Sensing Images Based on Adaptive Feature Selection Network
    Xiang, Shao
    Xie, Quangqi
    Wang, Mi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [28] Improved SegFormer Network Based Method for Semantic Segmentation of Remote Sensing Images
    Tian, Xuewei
    Wang, Jiali
    Chen, Ming
    Du, Shouqing
    Computer Engineering and Applications, 2023, 59 (08): : 217 - 226
  • [29] A Building Segmentation Network Based on Improved Spatial Pyramid in Remote Sensing Images
    Bai, Hao
    Bai, Tingzhu
    Li, Wei
    Liu, Xun
    APPLIED SCIENCES-BASEL, 2021, 11 (11):
  • [30] Negative Class Guided Spatial Consistency Network for Sparsely Supervised Semantic Segmentation of Remote Sensing Images
    Yang, Chen
    Wang, Junxiao
    Meng, Huixiao
    Yang, Shuyuan
    Feng, Zhixi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 657 - 669