Enhanced transformer encoder and hybrid cascaded upsampler for medical image segmentation

被引:0
|
作者
Li, Chaoqun [1 ]
Wang, Liejun [1 ]
Cheng, Shuli [1 ]
机构
[1] Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Xinjiang, Peoples R China
关键词
Medical image segmentation; Convolution neural network; Enhanced transformer; Hybrid cascaded upsampler; NETWORK; NET;
D O I
10.1016/j.eswa.2023.121965
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
UNet has been highly successful in various medical image segmentation tasks, but the restricted field of perception of convolutional operations has led to the lack of UNet's ability to explicitly model global context information. Vision Transformer captures global relevance through self-attention (SA), thus alleviating the problem of perceived wild locality in convolution neural network (CNN) architectures. However, traditional Transformer typically by means of SA with high computational complexity, and the fusion mechanism is static MLP mode, which is not efficient enough. In addition, the current segmentation methods usually perform simple feature fusion on the decoder side of the U-shaped architecture, which cannot meet the potential demand for important features when generating predictive maps. To solve these problems, we propose the E-TUNet network. On the one hand, we designed the Enhanced Transformer as the encoder by introducing EMSA and DynaMixer MLP. The Enhanced Transformer has high computational efficiency and dynamic mixing weights, which alleviates the problem of single static fusion mechanism. On the other hand, we introduce G-L MLP block with global-local space interaction capability to form hybrid cascaded upsampler for importance computation and matching of decoder side features. The hybrid cascaded upsampler has stronger information representation capabilities and effectively combines CNN and MLP to capture local and global dependencies. We demonstrate the effectiveness of our E-TUNet on two different public available datasets. Extensive experiments have shown that our method is highly competitive compared to other methods. In particular, on publicly available datasets (Synapse and ACDC), the mean DSC (%) is 82.15 and 91.12, respectively. HD95 (mm) is 17.89 on the Synapse dataset. E-TUNet has achieved significant performance improvement in multi-organ segmentation tasks, reaching a advanced level.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] A study of attention information from transformer layers in hybrid medical image segmentation networks
    Hasany, Syed Nouman
    Petitjean, Caroline
    Meriaudeau, Fabrice
    MEDICAL IMAGING 2023, 2023, 12464
  • [32] Hybrid framework for medical image segmentation
    Jiang, CY
    Zhang, XH
    Meinel, C
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2005, 3691 : 264 - 271
  • [33] MEDICAL IMAGE SEGMENTATION WITH HYBRID METHODS
    Liu, Lui
    Rong, Zebo
    Wang, Yue
    Wu, Zhiyong
    Yan, Fusheng
    INDIAN JOURNAL OF PHARMACEUTICAL SCIENCES, 2018, 80 (01) : 37 - 37
  • [34] A Hybrid Technique for Medical Image Segmentation
    Nyma, Alamgir
    Kang, Myeongsu
    Kwon, Yung-Keun
    Kim, Cheol-Hong
    Kim, Jong-Myon
    JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2012,
  • [35] VISION GRAPH U-NET: GEOMETRIC LEARNING ENHANCED ENCODER FOR MEDICAL IMAGE SEGMENTATION AND RESTORATION
    Jiang, Yuanhong
    Ding, Qiaoqiao
    Wang, Yu Guang
    Lio, Pietro
    Zhang, Xiaoqun
    INVERSE PROBLEMS AND IMAGING, 2024, 18 (03) : 672 - 689
  • [36] HMT-Net: Transformer and MLP Hybrid Encoder for Skin Disease Segmentation
    Yang, Sen
    Wang, Liejun
    SENSORS, 2023, 23 (06)
  • [37] Medical Image Segmentation Based on Transformer and HarDNet Structures
    Shen, Tongping
    Xu, Huanqing
    IEEE ACCESS, 2023, 11 : 16621 - 16630
  • [38] Dense deep transformer for medical image segmentation: DDTraMIS
    Joshi, Abhilasha
    Sharma, K. K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 18073 - 18089
  • [39] Advancements in medical image segmentation: A review of transformer models
    Kumar, S.S.
    Computers and Electrical Engineering, 2025, 123
  • [40] Combining frequency transformer and CNNs for medical image segmentation
    Ismayl Labbihi
    Othmane El Meslouhi
    Mohamed Benaddy
    Mustapha Kardouchi
    Moulay Akhloufi
    Multimedia Tools and Applications, 2024, 83 : 21197 - 21212