UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network

被引:5
|
作者
Fang, Kun [1 ,2 ]
He, Baochun [2 ]
Liu, Libo [2 ]
Hu, Haoyu [3 ]
Fang, Chihua [3 ,4 ]
Huang, Xuguang [1 ]
Jia, Fucang [2 ,4 ,5 ]
机构
[1] South China Normal Univ, Sch Informat & Optoelect Sci & Engn, Guangzhou, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Res Ctr Med Artificial Intelligence, Shenzhen, Peoples R China
[3] Southern Med Univ, Dept Hepatobiliary Surg 1, Zhujiang Hosp, Guangzhou, Peoples R China
[4] Pazhou Lab, Guangzhou, Peoples R China
[5] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Minimally Invas Surg Robot & Sys, Shenzhen, Peoples R China
关键词
Pancreas; image segmentation; transformer; deep learning; U-Net;
D O I
10.21037/qims-22-544
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules which help to extract long-range information and encode global context into convolutional representations, and there is a lack of investigation on how to optimally combine self-attention with convolution. Methods: We designed a novel transformer block (MRFormer) that combines a multi-head self-attention layer and a residual depthwise convolutional block as the basic unit to deeply integrate both long-range and local spatial information. The MRFormer block was embedded between the encoder and decoder in U-Net at the last two layers. This framework (UMRFormer-Net) was applied to the segmentation of threedimensional (3D) pancreas, and its ability to effectively capture the characteristic contextual information of the pancreas and surrounding tissues was investigated. Results: Experimental results show that the proposed UMRFormer-Net achieved accuracy in pancreas segmentation that was comparable or superior to that of existing state-of-the-art 3D methods in both the Clinical Proteomic Tumor Analysis Consortium Pancreatic Ductal Adenocarcinoma (CPTAC-PDA) dataset and the public Medical Segmentation Decathlon dataset (self-division). UMRFormer-Net statistically significantly outperformed existing transformer-related methods and state-of-the-art 3D methods (P< 0.05, P<0.01, or P< 0.001), with a higher Dice coefficient ( 85.54% and 77.36%, respectively) or a lower 95% Hausdorff distance (4.05 and 8.34 mm, respectively). Conclusions: UMRFormer-Net can obtain more matched and accurate segmentation boundary and region information in pancreas segmentation, thus improving the accuracy of pancreas segmentation. The code is available at https://github.com/supersunshinefk/UMRFormer-Net.
引用
收藏
页码:1619 / 1630
页数:12
相关论文
共 50 条
  • [21] Hypersingular meshless method using double-layer potentials for three-dimensional exterior acoustic problems
    Young, D. L.
    Chen, K. H.
    Liu, T. Y.
    Wu, C. S.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (01): : 529 - 540
  • [22] CCT-Unet: A U-Shaped Network Based on Convolution Coupled Transformer for Segmentation of Peripheral and Transition Zones in Prostate MRI
    Yan, Yifei
    Liu, Rongzong
    Chen, Haobo
    Zhang, Limin
    Zhang, Qi
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (09) : 4341 - 4351
  • [23] Automatic segmentation of head and neck organs at risk based on three-dimensional U-NET deep convolutional neural network
    Dai, Xiangkun
    Wang, Xiaoshen
    Du, Lehui
    Ma, Na
    Xu, Shouping
    Cai, Boning
    Wang, Shuxin
    Wang, Zhonguo
    Qu, Baolin
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2020, 37 (01): : 136 - 141
  • [24] DAC-Net: A light-weight U-shaped network based efficient convolution and attention for thyroid nodule segmentation
    Yang, Yingwei
    Huang, Haiguang
    Shao, Yingsheng
    Chen, Beilei
    Computers in Biology and Medicine, 2024, 180
  • [25] Double-layer Photonic Devices Based on Transfer Printing of Silicon Nanomembranes for Three-dimensional Photonics
    Zhang, Yang
    Carlson, Andrew
    Yang, Sang Y.
    Hosseini, Amir
    Kwong, David
    Rogers, John A.
    Chen, Ray T.
    2012 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2012,
  • [26] A Novel Method Combining U-Net with LSTM for Three-Dimensional Soil Pore Segmentation Based on Computed Tomography Images
    Liu, Lei
    Han, Qiaoling
    Zhao, Yue
    Zhao, Yandong
    APPLIED SCIENCES-BASEL, 2024, 14 (08):
  • [27] DUDA-Net: a double U-shaped dilated attention network for automatic infection area segmentation in COVID-19 lung CT images
    Xie, Feng
    Huang, Zheng
    Shi, Zhengjin
    Wang, Tianyu
    Song, Guoli
    Wang, Bolun
    Liu, Zihong
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (09) : 1425 - 1434
  • [28] DUDA-Net: a double U-shaped dilated attention network for automatic infection area segmentation in COVID-19 lung CT images
    Feng Xie
    Zheng Huang
    Zhengjin Shi
    Tianyu Wang
    Guoli Song
    Bolun Wang
    Zihong Liu
    International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 1425 - 1434
  • [29] An Improved Generative Adversarial Network-Based and U-Shaped Transformer Method for Glass Curtain Crack Deblurring Using UAVs
    Huang, Jiaxi
    Liu, Guixiong
    SENSORS, 2024, 24 (23)
  • [30] Enhanced three-dimensional U-Net with graph-based refining for segmentation of gastrointestinal stromal tumours
    Wang, Qiong
    Li, Zhipeng
    Zhao, Wanqing
    Wu, Hao
    Xie, Fei
    Guan, Ziyu
    Zhao, Wei
    IET COMPUTER VISION, 2021, 15 (08) : 549 - 560