UNETR plus plus : Delving Into Efficient and Accurate 3D Medical Image Segmentation

被引:4
|
作者
Shaker, Abdelrahman [1 ]
Maaz, Muhammad [1 ]
Rasheed, Hanoona [1 ]
Khan, Salman [1 ]
Yang, Ming-Hsuan [2 ,3 ,4 ]
Khan, Fahad Shahbaz [5 ,6 ]
机构
[1] Mohamed Bin Zayed Univ Artificial Intelligence, Comp Vis Dept, Abu Dhabi, U Arab Emirates
[2] Univ Calif Merced, Elect Engn & Comp Sci Dept, Merced, CA 95343 USA
[3] Yonsei Univ, Coll Comp, Seoul 03722, South Korea
[4] Google, Mountain View, CA 95344 USA
[5] Mohamed Bin Zayed Univ, Abu Dhabi, U Arab Emirates
[6] Linkoping Univ, Elect Engn Dept, S-58183 Linkoping, Sweden
关键词
Image segmentation; Three-dimensional displays; Transformers; Biomedical imaging; Complexity theory; Graphics processing units; Task analysis; Deep learning; efficient attention; hybrid architecture; medical image segmentation; TRANSFORMER;
D O I
10.1109/TMI.2024.3398728
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Owing to the success of transformer models, recent works study their applicability in 3D medical segmentation tasks. Within the transformer models, the self-attention mechanism is one of the main building blocks that strives to capture long-range dependencies, compared to the local convolutional-based design. However, the self-attention operation has quadratic complexity which proves to be a computational bottleneck, especially in volumetric medical imaging, where the inputs are 3D with numerous slices. In this paper, we propose a 3D medical image segmentation approach, named UNETR++, that offers both high-quality segmentation masks as well as efficiency in terms of parameters, compute cost, and inference speed. The core of our design is the introduction of a novel efficient paired attention (EPA) block that efficiently learns spatial and channel-wise discriminative features using a pair of inter-dependent branches based on spatial and channel attention. Our spatial attention formulation is efficient and has linear complexity with respect to the input. To enable communication between spatial and channel-focused branches, we share the weights of query and key mapping functions that provide a complimentary benefit (paired attention), while also reducing the complexity. Our extensive evaluations on five benchmarks, Synapse, BTCV, ACDC, BraTS, and Decathlon-Lung, reveal the effectiveness of our contributions in terms of both efficiency and accuracy. On Synapse, our UNETR++ sets a new state-of-the-art with a Dice Score of 87.2%, while significantly reducing parameters and FLOPs by over 71%, compared to the best method in the literature. Our code and models are available at: https://tinyurl.com/2p87x5xn.
引用
收藏
页码:3377 / 3390
页数:14
相关论文
共 50 条
  • [11] 3D medical image segmentation technique
    El-said, Shaimaa Ahmed
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2015, 17 (03) : 232 - 251
  • [12] 3D Multiple-Contextual ROI-Attention Network for Efficient and Accurate Volumetric Medical Image Segmentation
    Li, He
    Iwamoto, Yutaro
    Han, Xianhua
    Lin, Lanfen
    Furukawa, Akira
    Kanasaki, Shuzo
    Chen, Yen-Wei
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (05) : 1027 - 1037
  • [13] Small Convolutional Neural Networks for Efficient 3D Medical Image Segmentation
    Celaya, A.
    Actor, J.
    Muthusivarajan, R.
    Gates, E.
    Chung, C.
    Schellingerhout, D.
    Riviere, B.
    Fuentes, D.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [14] Efficient 3D Deep Learning Model for Medical Image Semantic Segmentation
    Alalwan, Nasser
    Abozeid, Amr
    ElHabshy, AbdAllah A.
    Alzahrani, Ahmed
    ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (01) : 1231 - 1239
  • [15] SUNet plus plus : A Deep Network with Channel Attention for Small-Scale Object Segmentation on 3D Medical Images
    Zhang, Lan
    Zhang, Kejia
    Pan, Haiwei
    TSINGHUA SCIENCE AND TECHNOLOGY, 2023, 28 (04): : 628 - 638
  • [16] α-MeanShift plus plus : Improving MeanShift plus plus for Image Segmentation
    Park, Hanhoon
    IEEE ACCESS, 2021, 9 : 131430 - 131439
  • [17] GenU-Net plus plus : An Automatic Intracranial Brain Tumors Segmentation Algorithm on 3D Image Series with High Performance
    Zhang, Yan
    Liu, Xi
    Wa, Shiyun
    Liu, Yutong
    Kang, Jiali
    Lv, Chunli
    SYMMETRY-BASEL, 2021, 13 (12):
  • [18] Efficient 3D medical image segmentation algorithm over a secured multimedia network
    Shadi Al-Zu’bi
    Bilal Hawashin
    Ala Mughaid
    Thar Baker
    Multimedia Tools and Applications, 2021, 80 : 16887 - 16905
  • [19] Efficient 3D medical image segmentation algorithm over a secured multimedia network
    Al-Zu'bi, Shadi
    Hawashin, Bilal
    Mughaid, Ala
    Baker, Thar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (11) : 16887 - 16905
  • [20] A hybrid framework for 3D medical image segmentation
    Chen, T
    Metaxas, D
    MEDICAL IMAGE ANALYSIS, 2005, 9 (06) : 547 - 565