MS-UNet: Swin Transformer U-Net with Multi-scale Nested Decoder for Medical Image Segmentation with Small Training Data

被引:0
|
作者
Chen, Haoyuan [1 ]
Han, Yufei [1 ]
Li, Yanyi [1 ]
Xu, Pin [1 ]
Li, Kuan [1 ]
Yin, Jianping [1 ]
机构
[1] Dongguan Univ Technol, Dongguan, Peoples R China
关键词
Medical Image Segmentation; U-Net; Swin Transformer; Multi-scale Nested Decoder;
D O I
10.1007/978-981-99-8558-6_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel U-Net model named MS-UNet for the medical image segmentation task in this study. Instead of the single-layer U-Net decoder structure used in Swin-UNet and TransUnet, we specifically design a multi-scale nested decoder based on the Swin Transformer for U-Net. The new framework is proposed based on the observation that the single-layer decoder structure of U-Net is too "thin" to exploit enough information, resulting in large semantic differences between the encoder and decoder parts. Things get worse if the number of training sets of data is not sufficiently large, which is common in medical image processing tasks where annotated data are more difficult to obtain than other tasks. Overall, the proposed multi-scale nested decoder structure allows the feature mapping between the decoder and encoder to be semantically closer, thus enabling the network to learn more detailed features. Experiment results show that MS-UNet could effectively improve the network performance with more efficient feature learning capability and exhibit more advanced performance, especially in the extreme case with a small amount of training data. The code is publicly available at: https:// github.com/HH446/MS- UNet.
引用
下载
收藏
页码:472 / 483
页数:12
相关论文
共 50 条
  • [1] MDA-Unet: A Multi-Scale Dilated Attention U-Net for Medical Image Segmentation
    Amer, Alyaa
    Lambrou, Tryphon
    Ye, Xujiong
    APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [2] UNet plus plus : A Nested U-Net Architecture for Medical Image Segmentation
    Zhou, Zongwei
    Siddiquee, Md Mahfuzur Rahman
    Tajbakhsh, Nima
    Liang, Jianming
    DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, DLMIA 2018, 2018, 11045 : 3 - 11
  • [3] MUNet: A Multi-scale U-Net Framework for Medical Image Segmentation
    Zhang, Wentao
    Cheng, Hao
    Gan, Jun
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] HmsU-Net: A hybrid multi-scale U-net based on a CNN and transformer for medical image segmentation
    Fu, Bangkang
    Peng, Yunsong
    He, Junjie
    Tian, Chong
    Sun, Xinhuan
    Wang, Rongpin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
  • [5] DS-TransUNet: Dual Swin Transformer U-Net for Medical Image Segmentation
    Lin, Ailiang
    Chen, Bingzhi
    Xu, Jiayu
    Zhang, Zheng
    Lu, Guangming
    Zhang, David
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [6] MCA-UNet: multi-scale cross co-attentional U-Net for automatic medical image segmentation
    Haonan Wang
    Peng Cao
    Jinzhu Yang
    Osmar Zaiane
    Health Information Science and Systems, 11
  • [7] ST-Unet: Swin Transformer boosted U-Net with Cross-Layer Feature Enhancement for medical image segmentation
    Zhang, Jing
    Qin, Qiuge
    Ye, Qi
    Ruan, Tong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
  • [8] MCA-UNet: multi-scale cross co-attentional U-Net for automatic medical image segmentation
    Wang, Haonan
    Cao, Peng
    Yang, Jinzhu
    Zaiane, Osmar
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2023, 11 (01)
  • [9] Multi-scale nested UNet with transformer for colorectal polyp segmentation
    Wang, Zenan
    Liu, Zhen
    Yu, Jianfeng
    Gao, Yingxin
    Liu, Ming
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2024, 25 (06):
  • [10] Enhancing medical image segmentation with a multi-transformer U-Net
    Dan, Yongping
    Jin, Weishou
    Yue, Xuebin
    Wang, Zhida
    PEERJ, 2024, 12