SPCTNet: A Series-Parallel CNN and Transformer Network for 3D Medical Image Segmentation

被引:0
|
作者
Yu, Bin [1 ]
Zhou, Quan [1 ]
Zhang, Xuming [1 ]
机构
[1] Huazhong Univ Sci & Technol, Dept Biomed Engn, Coll Life Sci & Technol, Wuhan, Peoples R China
来源
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Deep learning; 3D Medical image segmentation; Transformer;
D O I
10.1007/978-981-99-8850-1_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical image segmentation is crucial for lesion localization and surgical navigation. Recent advancements in medical image segmentation have been driven by Convolutional Neural Networks (CNNs) and Transformers. However, CNNs have limitations in capturing long-range dependencies due to their weight sharing and localized receptive fields, posing challenges in handling varying organ shapes. While Transformers offer an alternative with global receptive fields, their spatial and computational complexity is particularly high, especially for 3D medical images. To address this issue, we propose a novel series-parallel network that combines convolution and self-attention for 3D medical image segmentation. We utilize a serial 3D CNN as the encoder to extract multi-level feature maps, which are fused via a feature pyramid network. Subsequently, we adopt four parallel Transformer branches to capture global features. To efficiently model long-range information, we introduce patch self-attention, which divides the input into non-overlapping patches and computes attention between corresponding pixels across patches. Experimental evaluations on 3D MRI prostate and left atrial segmentation tasks confirm the superior performance of our network compared to other CNN and Transformer-based networks. Notably, our method achieves higher segmentation accuracy and faster inference speed.
引用
收藏
页码:376 / 387
页数:12
相关论文
共 50 条
  • [41] Abstract: 3D Medical Image Segmentation with Transformer-based Scaling of ConvNets MedNeXt
    Roy, Saikat
    Koehler, Gregor
    Baumgartner, Michael
    Ulrich, Constantin
    Isensee, Fabian
    Jaeger, Paul F.
    Maier-Hein, Klaus
    BILDVERARBEITUNG FUR DIE MEDIZIN 2024, 2024, : 79 - 79
  • [42] 3D bi-directional transformer U-Net for medical image segmentation
    Fu, Xiyao
    Sun, Zhexian
    Tang, Haoteng
    Zou, Eric M.
    Huang, Heng
    Wang, Yong
    Zhan, Liang
    FRONTIERS IN BIG DATA, 2023, 5
  • [43] HCA-former: Hybrid Convolution Attention Transformer for 3D Medical Image Segmentation
    Yang, Fan
    Wang, Fan
    Dong, Pengwei
    Wang, Bo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 90
  • [44] 3D Swin Transformer for Partial Medical Auto Segmentation
    Rangnekar, Aneesh
    Jiang, Jue
    Veeraraghavan, Harini
    FAST, LOW-RESOURCE, AND ACCURATE ORGAN AND PAN-CANCER SEGMENTATION IN ABDOMEN CT, FLARE 2023, 2024, 14544 : 222 - 235
  • [45] Dynamic Linear Transformer for 3D Biomedical Image Segmentation
    Zhang, Zheyuan
    Bagci, Ulas
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 : 171 - 180
  • [46] EFFICIENT 3D TRANSFORMER WITH CLUSTER-BASED DOMAIN-ADVERSARIAL LEARNING FOR 3D MEDICAL IMAGE SEGMENTATION
    Zhang, Haoran
    Chen, Hao
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [47] CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation
    Chen, Yuanbin
    Wang, Tao
    Tang, Hui
    Zhao, Longxuan
    Zhang, Xinlin
    Tan, Tao
    Gao, Qinquan
    Du, Min
    Tong, Tong
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (17):
  • [48] Automated multi-modal Transformer network (AMTNet) for 3D medical images segmentation
    Zheng, Shenhai
    Tan, Jiaxin
    Jiang, Chuangbo
    Li, Laquan
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (02):
  • [49] MixFormer: A Mixed CNN-Transformer Backbone for Medical Image Segmentation
    Liu, Jun
    Li, Kunqi
    Huang, Chun
    Dong, Hua
    Song, Yusheng
    Li, Rihui
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [50] A hybrid framework for 3D medical image segmentation
    Chen, T
    Metaxas, D
    MEDICAL IMAGE ANALYSIS, 2005, 9 (06) : 547 - 565