UNETR plus plus : Delving Into Efficient and Accurate 3D Medical Image Segmentation

被引:4
|
作者
Shaker, Abdelrahman [1 ]
Maaz, Muhammad [1 ]
Rasheed, Hanoona [1 ]
Khan, Salman [1 ]
Yang, Ming-Hsuan [2 ,3 ,4 ]
Khan, Fahad Shahbaz [5 ,6 ]
机构
[1] Mohamed Bin Zayed Univ Artificial Intelligence, Comp Vis Dept, Abu Dhabi, U Arab Emirates
[2] Univ Calif Merced, Elect Engn & Comp Sci Dept, Merced, CA 95343 USA
[3] Yonsei Univ, Coll Comp, Seoul 03722, South Korea
[4] Google, Mountain View, CA 95344 USA
[5] Mohamed Bin Zayed Univ, Abu Dhabi, U Arab Emirates
[6] Linkoping Univ, Elect Engn Dept, S-58183 Linkoping, Sweden
关键词
Image segmentation; Three-dimensional displays; Transformers; Biomedical imaging; Complexity theory; Graphics processing units; Task analysis; Deep learning; efficient attention; hybrid architecture; medical image segmentation; TRANSFORMER;
D O I
10.1109/TMI.2024.3398728
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Owing to the success of transformer models, recent works study their applicability in 3D medical segmentation tasks. Within the transformer models, the self-attention mechanism is one of the main building blocks that strives to capture long-range dependencies, compared to the local convolutional-based design. However, the self-attention operation has quadratic complexity which proves to be a computational bottleneck, especially in volumetric medical imaging, where the inputs are 3D with numerous slices. In this paper, we propose a 3D medical image segmentation approach, named UNETR++, that offers both high-quality segmentation masks as well as efficiency in terms of parameters, compute cost, and inference speed. The core of our design is the introduction of a novel efficient paired attention (EPA) block that efficiently learns spatial and channel-wise discriminative features using a pair of inter-dependent branches based on spatial and channel attention. Our spatial attention formulation is efficient and has linear complexity with respect to the input. To enable communication between spatial and channel-focused branches, we share the weights of query and key mapping functions that provide a complimentary benefit (paired attention), while also reducing the complexity. Our extensive evaluations on five benchmarks, Synapse, BTCV, ACDC, BraTS, and Decathlon-Lung, reveal the effectiveness of our contributions in terms of both efficiency and accuracy. On Synapse, our UNETR++ sets a new state-of-the-art with a Dice Score of 87.2%, while significantly reducing parameters and FLOPs by over 71%, compared to the best method in the literature. Our code and models are available at: https://tinyurl.com/2p87x5xn.
引用
收藏
页码:3377 / 3390
页数:14
相关论文
共 50 条
  • [31] EMONAS: Efficient Multiobjective Neural Architecture Search Framework for 3D Medical Image Segmentation
    Calisto, Maria G. Baldeon
    Lai-Yuen, Susana K.
    MEDICAL IMAGING 2021: IMAGE PROCESSING, 2021, 11596
  • [32] Efficient combined algorithm of Transformer and U-Net for 3D medical image segmentation
    Zhang, Mingyan
    Wang, Aixia
    Yang, Gang
    Li, Jingjiao
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4377 - 4382
  • [33] ACCURATE AND EFFICIENT SURFACE REPRESENTATION FOR 3D IMAGE FUSION
    YAN, CH
    BEAUPRE, GS
    BREITT, GA
    SUMANAWEERA, TS
    HEMLER, PF
    NAPEL, SA
    RADIOLOGY, 1995, 197 : 223 - 223
  • [34] Active Volume Models for 3D Medical Image Segmentation
    Shen, Tian
    Li, Hongsheng
    Qian, Zhen
    Huang, Xiaolei
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 707 - +
  • [35] Feature clustering algorithm for 3D medical image segmentation
    Li, Xinwu
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2008, 48 (SUPPL.): : 1790 - 1793
  • [36] Swarm Intelligence Approach to 3D Medical Image Segmentation
    Galinska, Marta
    Badura, Pawel
    INFORMATION TECHNOLOGIES IN MEDICINE, ITIB 2016, VOL 1, 2016, 471 : 15 - 24
  • [37] Adaptive metamorphs model for 3D medical image segmentation
    Huang, Junzhou
    Huang, Xiaolei
    Metaxas, Dimitris
    Axel, Leon
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2007, PT 1, PROCEEDINGS, 2007, 4791 : 302 - +
  • [38] Volumetric Attention for 3D Medical Image Segmentation and Detection
    Wang, Xudong
    Han, Shizhong
    Chen, Yunqiang
    Gao, Dashan
    Vasconcelos, Nuno
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT VI, 2019, 11769 : 175 - 184
  • [39] Elastic Boundary Projection for 3D Medical Image Segmentation
    Ni, Tianwei
    Xie, Lingxi
    Zheng, Huangjie
    Fishman, Elliot K.
    Yuille, Alan L.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2104 - 2113
  • [40] Medical Image Segmentation with Imperfect 3D Bounding Boxes
    Redekop, Ekaterina
    Chernyavskiy, Alexey
    DEEP GENERATIVE MODELS, AND DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS, 2021, 13003 : 193 - 200