Improved UNet with Attention for Medical Image Segmentation

被引:11
|
作者
AL Qurri, Ahmed [1 ]
Almekkawy, Mohamed [1 ]
机构
[1] Penn State Univ, Sch Elect Engn & Comp Sci, University Pk, PA 16802 USA
关键词
UNet; UNet plus plus; Transformer; CNN; attention; medical imaging; ultrasound; CT scan; U-NET; PLUS PLUS; ARCHITECTURE; TRANSFORMER;
D O I
10.3390/s23208589
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Medical image segmentation is crucial for medical image processing and the development of computer-aided diagnostics. In recent years, deep Convolutional Neural Networks (CNNs) have been widely adopted for medical image segmentation and have achieved significant success. UNet, which is based on CNNs, is the mainstream method used for medical image segmentation. However, its performance suffers owing to its inability to capture long-range dependencies. Transformers were initially designed for Natural Language Processing (NLP), and sequence-to-sequence applications have demonstrated the ability to capture long-range dependencies. However, their abilities to acquire local information are limited. Hybrid architectures of CNNs and Transformer, such as TransUNet, have been proposed to benefit from Transformer's long-range dependencies and CNNs' low-level details. Nevertheless, automatic medical image segmentation remains a challenging task due to factors such as blurred boundaries, the low-contrast tissue environment, and in the context of ultrasound, issues like speckle noise and attenuation. In this paper, we propose a new model that combines the strengths of both CNNs and Transformer, with network architectural improvements designed to enrich the feature representation captured by the skip connections and the decoder. To this end, we devised a new attention module called Three-Level Attention (TLA). This module is composed of an Attention Gate (AG), channel attention, and spatial normalization mechanism. The AG preserves structural information, whereas channel attention helps to model the interdependencies between channels. Spatial normalization employs the spatial coefficient of the Transformer to improve spatial attention akin to TransNorm. To further improve the skip connection and reduce the semantic gap, skip connections between the encoder and decoder were redesigned in a manner similar to that of the UNet++ dense connection. Moreover, deep supervision using a side-output channel was introduced, analogous to BASNet, which was originally used for saliency predictions. Two datasets from different modalities, a CT scan dataset and an ultrasound dataset, were used to evaluate the proposed UNet architecture. The experimental results showed that our model consistently improved the prediction performance of the UNet across different datasets.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] CSWin-UNet: Transformer UNet with cross-shaped windows for medical image segmentation
    Liu, Xiao
    Gao, Peng
    Yu, Tao
    Wang, Fei
    Yuan, Ru-Yue
    INFORMATION FUSION, 2025, 113
  • [32] VM-UNet++ research on crack image segmentation based on improved VM-UNet
    Wenliang Tang
    Ziyi Wu
    Wei Wang
    Youqin Pan
    Weihua Gan
    Scientific Reports, 15 (1)
  • [33] MAGRes-UNet: Improved Medical Image Segmentation Through a Deep Learning Paradigm of Multi-Attention Gated Residual U-Net
    Hussain, Tahir
    Shouno, Hayaru
    IEEE ACCESS, 2024, 12 : 40290 - 40310
  • [34] Attention UNet3+: a full-scale connected attention-aware UNet for CT image segmentation of liver
    Chen, Congping
    Shi, Jing
    Xu, Zhiwei
    Wang, Zhihan
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (06) : 63012
  • [35] DPA-UNet rectal cancer image segmentation based on visual attention
    Wang, Yuqian
    Ma, JianWei
    Sergey, Axyonov
    Zang, Shaofei
    Zhang, Miao
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (21):
  • [36] An attention mechanism-based lightweight UNet for musculoskeletal ultrasound image segmentation
    Zhang, Yan
    Yu, Xilong
    Hu, Qing
    Zhang, Xianlei
    Yang, Yixin
    Xiao, Han
    Medical Physics, 2025, 52 (01) : 400 - 413
  • [37] LIT-Unet: a lightweight and effective model for medical image segmentation
    Wang, Ru
    Kou, Qiqi
    Dou, Lina
    RADIOLOGICAL PHYSICS AND TECHNOLOGY, 2024, : 878 - 887
  • [38] NAS-Unet: Neural Architecture Search for Medical Image Segmentation
    Weng, Yu
    Zhou, Tianbao
    Li, Yujie
    Qiu, Xiaoyu
    IEEE ACCESS, 2019, 7 : 44247 - 44257
  • [39] GSAC-UFormer: Groupwise Self-Attention Convolutional Transformer-Based UNet for Medical Image Segmentation
    Anass Garbaz
    Yassine Oukdach
    Said Charfi
    Mohamed El Ansari
    Lahcen Koutti
    Mouna Salihoun
    Cognitive Computation, 2025, 17 (2)
  • [40] An Improved Algorithm for Medical Image Segmentation
    Huang, Ting-lei
    Bai, Xue
    SECOND INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING: WGEC 2008, PROCEEDINGS, 2008, : 289 - 292