MSST-Net: A Multi-Scale Adaptive Network for Building Extraction from Remote Sensing Images Based on Swin Transformer

被引:44
|
作者
Yuan, Wei [1 ,2 ]
Xu, Wenbo [3 ]
机构
[1] Chengdu Univ, Sch Architecture & Civil Engn, Chengdu 610106, Peoples R China
[2] Chengdu Univ, Inst Higher Educ Sichuan Prov, Key Lab Pattern Recognit & Intelligent Informat P, Chengdu 610106, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Resources & Environm, Chengdu 611731, Peoples R China
关键词
deep learning; remote sensing; transformer; semantic segmentation; multi-scale adaptive; SEGMENTATION;
D O I
10.3390/rs13234743
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The segmentation of remote sensing images by deep learning technology is the main method for remote sensing image interpretation. However, the segmentation model based on a convolutional neural network cannot capture the global features very well. A transformer, whose self-attention mechanism can supply each pixel with a global feature, makes up for the deficiency of the convolutional neural network. Therefore, a multi-scale adaptive segmentation network model (MSST-Net) based on a Swin Transformer is proposed in this paper. Firstly, a Swin Transformer is used as the backbone to encode the input image. Then, the feature maps of different levels are decoded separately. Thirdly, the convolution is used for fusion, so that the network can automatically learn the weight of the decoding results of each level. Finally, we adjust the channels to obtain the final prediction map by using the convolution with a kernel of 1 x 1. By comparing this with other segmentation network models on a WHU building data set, the evaluation metrics, mIoU, F1-score and accuracy are all improved. The network model proposed in this paper is a multi-scale adaptive network model that pays more attention to the global features for remote sensing segmentation.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] MDTrans: Multi-scale and dual-branch feature fusion network based on Swin Transformer for building extraction in remote sensing images
    Diao, Kuo
    Zhu, Jinlong
    Liu, Guangjie
    Li, Meng
    IET IMAGE PROCESSING, 2024, 18 (11) : 2930 - 2942
  • [2] Multi-scale Residual Network for Building Extraction from Satellite Remote Sensing Images
    Hou, Xin
    Wang, Pu
    An, Wei
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1348 - 1351
  • [3] Multi-Scale Feature Fusion Attention Network for Building Extraction in Remote Sensing Images
    Liu, Jia
    Gu, Hang
    Li, Zuhe
    Chen, Hongyang
    Chen, Hao
    ELECTRONICS, 2024, 13 (05)
  • [4] Multi-Scale Attention Network for Building Extraction from High-Resolution Remote Sensing Images
    Chang, Jing
    He, Xiaohui
    Li, Panle
    Tian, Ting
    Cheng, Xijie
    Qiao, Mengjia
    Zhou, Tao
    Zhang, Beibei
    Chang, Ziqian
    Fan, Tingwei
    SENSORS, 2024, 24 (03)
  • [5] EMAFF-Net: an enhanced multi-scale attentive feature fusion network for building extraction from VHR remote sensing images
    Vijayan, Lakshmi
    Preethy Byju, Akshara
    REMOTE SENSING LETTERS, 2024, 15 (02) : 157 - 166
  • [6] Swin-Net: A Swin-Transformer-Based Network Combing with Multi-Scale Features for Segmentation of Breast Tumor Ultrasound Images
    Zhu, Chengzhang
    Chai, Xian
    Xiao, Yalong
    Liu, Xu
    Zhang, Renmao
    Yang, Zhangzheng
    Wang, Zhiyuan
    DIAGNOSTICS, 2024, 14 (03)
  • [7] Lightweight multi-scale difference network for remote sensing building extraction
    Li G.
    Wu H.
    Dong C.
    Liu Y.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (22): : 3371 - 3382
  • [8] Multi-scale building instance refinement extraction from remote sensing images by fusing with decentralized adaptive attention mechanism
    Jiang B.
    Hang W.
    Xu S.
    Wu Y.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2023, 52 (09): : 1504 - 1514
  • [9] Building Footprint Extraction from Remote Sensing Images with Residual Attention Multi-Scale Aggregation Fully Convolutional Network
    Ahmadian, Nima
    Sedaghat, Amin
    Mohammadi, Nazila
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2024, : 2417 - 2429
  • [10] FMAM-Net: Fusion Multi-Scale Attention Mechanism Network for Building Segmentation in Remote Sensing Images
    Ye, Huanran
    Zhou, Run
    Wang, Jianhao
    Huang, Zhiliang
    IEEE ACCESS, 2022, 10 : 134241 - 134251