MS-UNet: Swin Transformer U-Net with Multi-scale Nested Decoder for Medical Image Segmentation with Small Training Data

被引:0
|
作者
Chen, Haoyuan [1 ]
Han, Yufei [1 ]
Li, Yanyi [1 ]
Xu, Pin [1 ]
Li, Kuan [1 ]
Yin, Jianping [1 ]
机构
[1] Dongguan Univ Technol, Dongguan, Peoples R China
关键词
Medical Image Segmentation; U-Net; Swin Transformer; Multi-scale Nested Decoder;
D O I
10.1007/978-981-99-8558-6_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel U-Net model named MS-UNet for the medical image segmentation task in this study. Instead of the single-layer U-Net decoder structure used in Swin-UNet and TransUnet, we specifically design a multi-scale nested decoder based on the Swin Transformer for U-Net. The new framework is proposed based on the observation that the single-layer decoder structure of U-Net is too "thin" to exploit enough information, resulting in large semantic differences between the encoder and decoder parts. Things get worse if the number of training sets of data is not sufficiently large, which is common in medical image processing tasks where annotated data are more difficult to obtain than other tasks. Overall, the proposed multi-scale nested decoder structure allows the feature mapping between the decoder and encoder to be semantically closer, thus enabling the network to learn more detailed features. Experiment results show that MS-UNet could effectively improve the network performance with more efficient feature learning capability and exhibit more advanced performance, especially in the extreme case with a small amount of training data. The code is publicly available at: https:// github.com/HH446/MS- UNet.
引用
下载
收藏
页码:472 / 483
页数:12
相关论文
共 50 条
  • [31] A multi-scale large kernel attention with U-Net for medical image registration
    Chen, Yilin
    Hu, Xin
    Lu, Tao
    Zou, Lu
    Liao, Xiangyun
    Journal of Supercomputing, 2025, 81 (01):
  • [32] U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
    Petit, Olivier
    Thome, Nicolas
    Rambour, Clement
    Themyr, Loic
    Collins, Toby
    Soler, Luc
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 267 - 276
  • [33] Multi-Scale Fusion U-Net for the Segmentation of Breast Lesions
    Li, Jingyao
    Cheng, Lianglun
    Xia, Tingjian
    Ni, Haomin
    Li, Jiao
    IEEE ACCESS, 2021, 9 : 137125 - 137139
  • [34] Lightweight Multi-Scale Dilated U-Net for Crop Disease Leaf Image Segmentation
    Xu, Cong
    Yu, Changqing
    Zhang, Shanwen
    ELECTRONICS, 2022, 11 (23)
  • [35] Residual-Attention UNet plus plus : A Nested Residual-Attention U-Net for Medical Image Segmentation
    Li, Zan
    Zhang, Hong
    Li, Zhengzhen
    Ren, Zuyue
    APPLIED SCIENCES-BASEL, 2022, 12 (14):
  • [36] MSRD-Unet: Multiscale Residual Dilated U-Net for Medical Image Segmentation
    Khalaf, Muna
    Dhannoon, Ban N.
    BAGHDAD SCIENCE JOURNAL, 2022, 19 (06) : 1603 - 1611
  • [37] Dilated-UNet: A Fast and Accurate Medical Image Segmentation Approach using a Dilated Transformer and U-Net Architecture
    Saadati, Davoud
    Manzari, Omid Nejati
    Mirzakuchaki, Sattar
    arXiv, 2023,
  • [38] GA-UNet: A Lightweight Ghost and Attention U-Net for Medical Image Segmentation
    Pang, Bo
    Chen, Lianghong
    Tao, Qingchuan
    Wang, Enhui
    Yu, Yanmei
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (04): : 1874 - 1888
  • [39] DCU-Net: Multi-scale U-Net for brain tumor segmentation
    Yang, Tiejun
    Zhou, Yudan
    Li, Lei
    Zhu, Chunhua
    JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2020, 28 (04) : 709 - 726
  • [40] MULTI-SCALE ATTENTION BASED TRANSFORMER U-NET FOR CHANGE DETECTION
    Chen, Hengzhi
    Wu, Xiaofeng
    Zeng, Shan
    Wang, Zhiyong
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1067 - 1070