Enhanced transformer encoder and hybrid cascaded upsampler for medical image segmentation

被引:0
|
作者
Li, Chaoqun [1 ]
Wang, Liejun [1 ]
Cheng, Shuli [1 ]
机构
[1] Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Xinjiang, Peoples R China
关键词
Medical image segmentation; Convolution neural network; Enhanced transformer; Hybrid cascaded upsampler; NETWORK; NET;
D O I
10.1016/j.eswa.2023.121965
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
UNet has been highly successful in various medical image segmentation tasks, but the restricted field of perception of convolutional operations has led to the lack of UNet's ability to explicitly model global context information. Vision Transformer captures global relevance through self-attention (SA), thus alleviating the problem of perceived wild locality in convolution neural network (CNN) architectures. However, traditional Transformer typically by means of SA with high computational complexity, and the fusion mechanism is static MLP mode, which is not efficient enough. In addition, the current segmentation methods usually perform simple feature fusion on the decoder side of the U-shaped architecture, which cannot meet the potential demand for important features when generating predictive maps. To solve these problems, we propose the E-TUNet network. On the one hand, we designed the Enhanced Transformer as the encoder by introducing EMSA and DynaMixer MLP. The Enhanced Transformer has high computational efficiency and dynamic mixing weights, which alleviates the problem of single static fusion mechanism. On the other hand, we introduce G-L MLP block with global-local space interaction capability to form hybrid cascaded upsampler for importance computation and matching of decoder side features. The hybrid cascaded upsampler has stronger information representation capabilities and effectively combines CNN and MLP to capture local and global dependencies. We demonstrate the effectiveness of our E-TUNet on two different public available datasets. Extensive experiments have shown that our method is highly competitive compared to other methods. In particular, on publicly available datasets (Synapse and ACDC), the mean DSC (%) is 82.15 and 91.12, respectively. HD95 (mm) is 17.89 on the Synapse dataset. E-TUNet has achieved significant performance improvement in multi-organ segmentation tasks, reaching a advanced level.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Combining frequency transformer and CNNs for medical image segmentation
    Labbihi, Ismayl
    El Meslouhi, Othmane
    Benaddy, Mohamed
    Kardouchi, Mustapha
    Akhloufi, Moulay
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 21197 - 21212
  • [42] Transformer with progressive sampling for medical cellular image segmentation
    Jiang, Shen
    Li, Jinjiang
    Hua, Zhen
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (12) : 12104 - 12126
  • [43] LiteTrans: Reconstruct Transformer with Convolution for Medical Image Segmentation
    Xu, Shuying
    Quan, Hongyan
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 300 - 313
  • [44] A parallelly contextual convolutional transformer for medical image segmentation
    Feng, Yuncong
    Su, Jianyu
    Zheng, Jian
    Zheng, Yupeng
    Zhang, Xiaoli
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 98
  • [45] CONVFORMER: COMBINING CNN AND TRANSFORMER FOR MEDICAL IMAGE SEGMENTATION
    Gu, Pengfei
    Zhang, Yejia
    Wang, Chaoli
    Chen, Danny Z.
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [46] Dense deep transformer for medical image segmentation: DDTraMIS
    Abhilasha Joshi
    K. K. Sharma
    Multimedia Tools and Applications, 2024, 83 : 18073 - 18089
  • [47] DHT: Deformable Hybrid Transformer for Aerial Image Segmentation
    Zhang, Yan
    Gao, Xiyuan
    Duan, Qingyan
    Yuan, Lin
    Gao, Xinbo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [48] UCTNet: Uncertainty-guided CNN-Transformer hybrid networks for medical image segmentation
    Guo, Xiayu
    Lin, Xian
    Yang, Xin
    Yu, Li
    Cheng, Kwang-Ting
    Yan, Zengqiang
    PATTERN RECOGNITION, 2024, 152
  • [49] Few-Shot Medical Image Segmentation via a Region-Enhanced Prototypical Transformer
    Zhu, Yazhou
    Wang, Shidong
    Xin, Tong
    Zhang, Haofeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 271 - 280
  • [50] Semhybridnet: a semantically enhanced hybrid CNN-transformer network for radar pulse image segmentation
    Liu, Hongjia
    Xiao, Yubin
    Wu, Xuan
    Li, Yuanshu
    Zhao, Peng
    Liang, Yanchun
    Wang, Liupu
    Zhou, You
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 2851 - 2868