Remote sensing image instance segmentation network with transformer and multi-scale feature representation

被引:9
|
作者
Ye, Wenhui [1 ]
Zhang, Wei [1 ]
Lei, Weimin [1 ]
Zhang, Wenchao [2 ]
Chen, Xinyi [1 ]
Wang, Yanwen [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[2] Xian Univ Sci & Technol, Coll Comp Sci & Technol, Xian 710054, Peoples R China
关键词
Remote sensing image; Instance segmentation; Region-based; Long-range dependencies;
D O I
10.1016/j.eswa.2023.121007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of remote sensing image (RSI) instance segmentation is to perform instance-level semantic parsing of its contents. Aside from classifying and locating regions of interest (RoI), it also requires assigning finer pixel -wise annotations to objects. However, RSI often suffers from cluttered backgrounds, variable object scales, and complex object edge contours, making the instance segmentation task more challenging. In this work, we analytically customize an instance segmentation model that is more suitable for RSI. Specifically, we propose three novel modules for a region-based instance segmentation framework, namely Channel-Spatial Attention Module (CSA), Multi-Scale Aware Module (MSA), and Semantic Relation Learning Module (SRL). Among them, feature calibration performed by CSA can alleviate the semantic gap between low-level features and high-level semantics in both channel and spatial dimensions. Inheriting the capabilities of both the convolutional neural network (CNN) and the Transformer, SRL can help the network integrate both neighborhood features and long-range dependencies for instance semantic prediction. The MSA module designs a cascaded residual structure with different receptive fields to model the scale variation of objects in RSI. Experimental results on challenging ISAID, NWPU VHR-10, SSDD, BITTC and HRSID datasets demonstrate the superiority of our method, achieving mask APs of 40.2%, 68.2%, 68.4%, 50.4% and 55.8% respectively. Code and pretrained models are available at https://github.com/Sherlock1018/RSIISN.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Remote sensing image semantic segmentation network based on multi-scale feature enhancement fusion
    Wang, Feiting
    Zhang, Yuan
    Hu, Qiongqiong
    Zhu, Yu
    [J]. GEOCARTO INTERNATIONAL, 2024, 39 (01)
  • [2] Multi-scale network for remote sensing segmentation
    Wang, Gaihua
    Zhai, Qianyu
    Lin, Jinheng
    [J]. IET IMAGE PROCESSING, 2022, 16 (06) : 1742 - 1751
  • [3] Feature ensemble network for medical image segmentation with multi-scale atrous transformer
    Gai, Di
    Geng, Yuhan
    Huang, Xia
    Huang, Zheng
    Xiong, Xin
    Zhou, Ruihua
    Wang, Qi
    [J]. IET IMAGE PROCESSING, 2024, 18 (11) : 3082 - 3092
  • [4] Deep Multi-Scale Transformer for Remote Sensing Image Restoration
    Li, Yanting
    [J]. 2024 5TH INTERNATIONAL CONFERENCE ON GEOLOGY, MAPPING AND REMOTE SENSING, ICGMRS 2024, 2024, : 138 - 142
  • [5] Multi-scale Adaptive Feature Fusion Network for Semantic Segmentation in Remote Sensing Images
    Shang, Ronghua
    Zhang, Jiyu
    Jiao, Licheng
    Li, Yangyang
    Marturi, Naresh
    Stolkin, Rustam
    [J]. REMOTE SENSING, 2020, 12 (05)
  • [6] MFATNet: Multi-Scale Feature Aggregation via Transformer for Remote Sensing Image Change Detection
    Mao, Zan
    Tong, Xinyu
    Luo, Ze
    Zhang, Honghai
    [J]. REMOTE SENSING, 2022, 14 (21)
  • [7] Multi-scale segmentation of the high resolution remote sensing image
    Zhong, C
    Zhao, ZM
    Yan, DM
    Chen, RX
    [J]. IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 3682 - 3684
  • [8] Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing Images
    Liu, Ye
    Li, Huifang
    Hu, Chao
    Luo, Shuang
    Luo, Yan
    Chen, Chang Wen
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [9] Topological Equivalence in Multi-Scale Remote Sensing Image Segmentation
    Liu, Xiaoli
    Zhu, Guobin
    Li, Jinggang
    Li, Xue
    [J]. SENSORS, MECHATRONICS AND AUTOMATION, 2014, 511-512 : 510 - +
  • [10] Transformer-based multi-scale feature fusion network for remote sensing change detection
    Liang, Shike
    Hua, Zhen
    Li, Jinjiang
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)