MBT-UNet: Multi-Branch Transform Combined with UNet for Semantic Segmentation of Remote Sensing Images

被引:0
|
作者
Liu, Bin [1 ]
Li, Bing [1 ]
Sreeram, Victor [2 ]
Li, Shuofeng [1 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China
[2] Univ Western Australia, Sch Elect Elect & Comp Engn, Perth 6009, Australia
关键词
transformer; semantic segmentation; convolutional neural network; remote sensing; NETWORK; BUILDINGS; MODEL;
D O I
10.3390/rs16152776
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing (RS) images play an indispensable role in many key fields such as environmental monitoring, precision agriculture, and urban resource management. Traditional deep convolutional neural networks have the problem of limited receptive fields. To address this problem, this paper introduces a hybrid network model that combines the advantages of CNN and Transformer, called MBT-UNet. First, a multi-branch encoder design based on the pyramid vision transformer (PVT) is proposed to effectively capture multi-scale feature information; second, an efficient feature fusion module (FFM) is proposed to optimize the collaboration and integration of features at different scales; finally, in the decoder stage, a multi-scale upsampling module (MSUM) is proposed to further refine the segmentation results and enhance segmentation accuracy. We conduct experiments on the ISPRS Vaihingen dataset, the Potsdam dataset, the LoveDA dataset, and the UAVid dataset. Experimental results show that MBT-UNet surpasses state-of-the-art algorithms in key performance indicators, confirming its superior performance in high-precision remote sensing image segmentation tasks.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] A Multi-Attention UNet for Semantic Segmentation in Remote Sensing Images
    Sun, Yu
    Bi, Fukun
    Gao, Yangte
    Chen, Liang
    Feng, Suting
    SYMMETRY-BASEL, 2022, 14 (05):
  • [2] Remote sensing image semantic segmentation combining UNET and FPN
    Wang Xi
    Yu Ming
    Ren Hong-e
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2021, 36 (03) : 475 - 483
  • [3] Semantic Segmentation of Hyperspectral Remote Sensing Images Based on PSE-UNet Model
    Li, Jiaju
    Wang, Hefeng
    Zhang, Anbing
    Liu, Yuliang
    SENSORS, 2022, 22 (24)
  • [4] CT-UNet: Context-Transfer-UNet for Building Segmentation in Remote Sensing Images
    Liu, Sheng
    Ye, Huanran
    Jin, Kun
    Cheng, Haohao
    NEURAL PROCESSING LETTERS, 2021, 53 (06) : 4257 - 4277
  • [5] CT-UNet: Context-Transfer-UNet for Building Segmentation in Remote Sensing Images
    Sheng Liu
    Huanran Ye
    Kun Jin
    Haohao Cheng
    Neural Processing Letters, 2021, 53 : 4257 - 4277
  • [6] High-resolution remote sensing images semantic segmentation using improved UNet and SegNet
    Wang, Xin
    Jing, Shihan
    Dai, Huifeng
    Shi, Aiye
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 108
  • [7] A Semantic Segmentation Method Based on AS-Unet plus plus for Power Remote Sensing of Images
    Nan, Guojun
    Li, Haorui
    Du, Haibo
    Liu, Zhuo
    Wang, Min
    Xu, Shuiqing
    SENSORS, 2024, 24 (01)
  • [8] Combining Swin Transformer With UNet for Remote Sensing Image Semantic Segmentation
    Fan, Lili
    Zhou, Yu
    Liu, Hongmei
    Li, Yunjie
    Cao, Dongpu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 11
  • [9] Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation
    He, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Zhang, Di
    Yao, Rui
    Xue, Yong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [10] FPB-UNet plus plus : Semantic Segmentation for Remote Sensing Images of reservoir area via Improved UNet plus plus with FPN
    Wang, Kaiyue
    Fan, Xiaoye
    Wang, Qi
    6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 100 - 104