MS-UNet: Swin Transformer U-Net with Multi-scale Nested Decoder for Medical Image Segmentation with Small Training Data

被引:0
|
作者
Chen, Haoyuan [1 ]
Han, Yufei [1 ]
Li, Yanyi [1 ]
Xu, Pin [1 ]
Li, Kuan [1 ]
Yin, Jianping [1 ]
机构
[1] Dongguan Univ Technol, Dongguan, Peoples R China
关键词
Medical Image Segmentation; U-Net; Swin Transformer; Multi-scale Nested Decoder;
D O I
10.1007/978-981-99-8558-6_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel U-Net model named MS-UNet for the medical image segmentation task in this study. Instead of the single-layer U-Net decoder structure used in Swin-UNet and TransUnet, we specifically design a multi-scale nested decoder based on the Swin Transformer for U-Net. The new framework is proposed based on the observation that the single-layer decoder structure of U-Net is too "thin" to exploit enough information, resulting in large semantic differences between the encoder and decoder parts. Things get worse if the number of training sets of data is not sufficiently large, which is common in medical image processing tasks where annotated data are more difficult to obtain than other tasks. Overall, the proposed multi-scale nested decoder structure allows the feature mapping between the decoder and encoder to be semantically closer, thus enabling the network to learn more detailed features. Experiment results show that MS-UNet could effectively improve the network performance with more efficient feature learning capability and exhibit more advanced performance, especially in the extreme case with a small amount of training data. The code is publicly available at: https:// github.com/HH446/MS- UNet.
引用
下载
收藏
页码:472 / 483
页数:12
相关论文
共 50 条
  • [41] Training ESRGAN with multi-scale attention U-Net discriminator
    Quan Chen
    Hao Li
    Gehao Lu
    Scientific Reports, 14 (1)
  • [42] Deep Multi-Scale U-Net Architecture and Label-Noise Robust Training Strategies for Histopathological Image Segmentation
    Kurian, Nikhil Cherian
    Lohan, Amit
    Verghese, Gregory
    Dharamshi, Nimish
    Meena, Swati
    Li, Mengyuan
    Liu, Fangfang
    Gillet, Cheryl
    Rane, Swapnil
    Grigoriadis, Anita
    Sethi, Amit
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE 2022), 2022, : 91 - 96
  • [43] Cross Pyramid Transformer makes U-net stronger in medical image segmentation
    Zhu, Jinghua
    Sheng, Yue
    Cui, Hui
    Ma, Jiquan
    Wang, Jijian
    Xi, Heran
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [44] Multiscale transunet +  + : dense hybrid U-Net with transformer for medical image segmentation
    Bo Wang
    ·Fan Wang
    Pengwei Dong
    ·Chongyi Li
    Signal, Image and Video Processing, 2022, 16 : 1607 - 1614
  • [45] MFLUnet: multi-scale fusion lightweight Unet for medical image segmentation
    Cao, Dianlei
    Zhang, Rui
    Zhang, Yunfeng
    BIOMEDICAL OPTICS EXPRESS, 2024, 15 (10): : 5574 - 5591
  • [46] An Automatic Nuclei Image Segmentation Based on Multi-Scale Split-Attention U-Net
    Xu, Qing
    Duan, Wenting
    MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 : 236 - 245
  • [47] Multi-Scale Semantic Segmentation for Fire Smoke Image Based on Global Information and U-Net
    Zheng, Yuanpan
    Wang, Zhenyu
    Xu, Boyang
    Niu, Yiqing
    ELECTRONICS, 2022, 11 (17)
  • [48] Tumor Segmentation Based on Deeply Supervised Multi-Scale U-Net
    Wang, Lei
    Wang, Bo
    Xu, Zhenghua
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 746 - 749
  • [49] A New Segmentation Method to Preserve the Underlying Image Features: U-Net with Multi-scale Pooling
    Liu, Yuming
    Chen, Gongping
    Dai, Yu
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 1718 - 1722
  • [50] Latent fingerprint segmentation using multi-scale attention U-Net
    Akhila, P.
    Koolagudi, Shashidhar G.
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2024, 16 (02) : 195 - 215