Hierarchical Decoder with Parallel Transformer and CNN for Medical Image Segmentation

被引:0
|
作者
Li, Shijie [1 ]
Gong, Yu [1 ]
Xiang, Qingyuan [1 ]
Li, Zheng [1 ,2 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Sichuan Univ, Tianfu Engn Oriented Numercial Simulat & Software, Chengdu 610207, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Hierarchical decoder; Attention mechanism; PLUS PLUS;
D O I
10.1007/978-981-97-8496-7_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the success of Transformers, hybrid Transformer and CNN methods gain considerable popularity in medical image segmentation. These methods utilize a hybrid architecture that combines Transformers and CNNs to fuse global and local information, supplemented by a pyramid structure to facilitate multi-scale interaction. However, they encounter two primary limitations: (i) Transformer struggle to capture complete global information due to the sliding window nature of the convolutional operator, and (ii) the pyramid structure within single decoder fails to provide sufficient multi-scale interaction necessary for restoring detailed features at higher levels. In this paper, we introduce the Hierarchical Decoder with Parallel Transformer and CNN (HiPar), a novel architecture designed to address these limitations. Firstly, we present a parallel structure of Transformer and CNN to maximize the capture of both global and local features. Subsequently, we propose a hierarchical decoder to model multi-scale information and progressively restore spatial details. Additionally, we incorporate lightweight components to enhance the efficiency of feature representation. Extensive experiments demonstrate that our HiPar achieves state-of-the-art results on three popular medical image segmentation benchmarks: Synapse, ACDC and GlaS.
引用
收藏
页码:133 / 147
页数:15
相关论文
共 50 条
  • [31] Hybrid 3D Medical Image Segmentation Using CNN and Frequency Transformer Fusion
    Labbihi, Ismayl
    Meslouhi, Othmane El
    Elassad, Zouhair Elamrani Abou
    Benaddy, Mohamed
    Kardouchi, Mustapha
    Akhloufi, Moulay
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [32] Aggregated Mutual Learning between CNN and Transformer for semi-supervised medical image segmentation
    Xu, Zhenghua
    Wang, Hening
    Yang, Runhe
    Yang, Yuchen
    Liu, Weipeng
    Lukasiewicz, Thomas
    KNOWLEDGE-BASED SYSTEMS, 2025, 311
  • [33] ScribFormer: Transformer Makes CNN Work Better for Scribble-Based Medical Image Segmentation
    Li, Zihan
    Zheng, Yuan
    Shan, Dandan
    Yang, Shuzhou
    Li, Qingde
    Wang, Beizhan
    Zhang, Yuanting
    Hong, Qingqi
    Shen, Dinggang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (06) : 2254 - 2265
  • [34] UCTNet: Uncertainty-guided CNN-Transformer hybrid networks for medical image segmentation
    Guo, Xiayu
    Lin, Xian
    Yang, Xin
    Yu, Li
    Cheng, Kwang-Ting
    Yan, Zengqiang
    PATTERN RECOGNITION, 2024, 152
  • [35] RAMIS: Increasing robustness and accuracy in medical image segmentation with hybrid CNN-transformer synergy
    Gu, Jia
    Tian, Fangzheng
    Oh, Il-Seok
    NEUROCOMPUTING, 2025, 618
  • [36] Boundary-guided feature integration network with hierarchical transformer for medical image segmentation
    Wang, Fan
    Wang, Bo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8955 - 8969
  • [37] Cross-Parallel Transformer: Parallel ViT for Medical Image Segmentation (vol 23, 9488, 2023)
    Wang, Dong
    Wang, Zixiang
    Chen, Ling
    Xiao, Hongfeng
    Yang, Bo
    SENSORS, 2024, 24 (02)
  • [38] EFFICIENT BINARY CNN FOR MEDICAL IMAGE SEGMENTATION
    Brahma, Kaustav
    Kumar, Viksit
    Samir, Anthony E.
    Chandrakasan, Anantha P.
    Eldar, Yonina C.
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 817 - 821
  • [39] Transformer and group parallel axial attention co-encoder for medical image segmentation
    Li, Chaoqun
    Wang, Liejun
    Li, Yongming
    SCIENTIFIC REPORTS, 2022, 12 (01):
  • [40] APT-Net: Adaptive encoding and parallel decoding transformer for medical image segmentation
    Zhang, Ning
    Yu, Long
    Zhang, Dezhi
    Wu, Weidong
    Tian, Shengwei
    Kang, Xiaojing
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151