Abstract: 3D Medical Image Segmentation with Transformer-based Scaling of ConvNets MedNeXt

被引:0
|
作者
Roy, Saikat [1 ,3 ]
Koehler, Gregor [1 ,3 ,4 ]
Baumgartner, Michael [1 ]
Ulrich, Constantin [1 ,5 ]
Isensee, Fabian [1 ,4 ]
Jaeger, Paul F. [4 ,6 ]
Maier-Hein, Klaus [1 ,2 ]
机构
[1] German Canc Res Ctr, Div Med Image Comp MIC, Heidelberg, Germany
[2] Heidelberg Univ Hosp, Dept Radiat Oncol, Pattern Anal & Learning Grp, Heidelberg, Germany
[3] Heidelberg Univ, Fac Math & Comp Sci, Heidelberg, Germany
[4] German Canc Res Ctr, Helmholtz Imaging, Heidelberg, Germany
[5] NCT Heidelberg, Natl Ctr Tumor Dis NCT, Heidelberg, Germany
[6] German Canc Res Ctr, Interact Machine Learning Grp, Heidelberg, Germany
关键词
D O I
10.1007/978-3-658-44037-4_23
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Transformer-based architectures have seen widespread adoption recently for medical image segmentation. However, achieving performances equivalent to those in natural images are challenging due to the absence of large-scale annotated datasets. In contrast, convolutional networks have higher inductive biases and consequently, are easier to train to high performance. Recently, the ConvNeXt architecture attempted to improve the standard ConvNet by upgrading the popular ResNet blocks to mirror Transformer blocks. In this work, we extend upon this to design a modernized and scalable convolutional architecture customized to challenges of dense segmentation tasks in data-scarce medical settings. In this work, we introduce the MedNeXt architecture which is a Transformer-inspired, scalable large-kernel network for medical image segmentation with 4 key features - 1) Fully ConvNeXt 3D Encoder-Decoder architecture to leverage network-wide benefits of the block design, 2) Residual ConvNeXt blocks for up and downsampling to preserve semantic richness across scales, 3) Upkern, an algorithm to iteratively increase kernel size by upsampling small kernel networks, thus preventing performance saturation on limited data, 4) Compound scaling of depth, width and kernel size to leverage the benefits of large-scale variants of the MedNeXt architecture. With state-of-the-art performance on 4 popular segmentation tasks, across variations in imaging modalities (CT, MRI) and dataset sizes, MedNeXt represents a modernized deep architecture for medical image segmentation. This work was originally published in [1]. Our code is made publicly available at: https://github.com/MIC-DKFZ/MedNeXt.
引用
收藏
页码:79 / 79
页数:1
相关论文
共 50 条
  • [31] A Transformer-Based Network for Deformable Medical Image Registration
    Wang, Yibo
    Qian, Wen
    Li, Mengqi
    Zhang, Xuming
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 502 - 513
  • [32] ResTrans-Unet: A Residual-Aware Transformer-Based Approach to Medical Image Segmentation
    Ma, Fengying
    Wang, Zhi
    Ji, Peng
    Fu, Chengcai
    Wang, Feng
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (04)
  • [33] SDSL: Spectral Distance Scaling Loss Pretraining SwinUNETR for 3D Medical Image Segmentation
    Lee, Jin
    Vu, Dang Thanh
    Yu, Gwanghyun
    Kim, Jinsul
    Kim, Kunyung
    Kim, Jinyoung
    [J]. IEEE ACCESS, 2024, 12 : 126693 - 126706
  • [34] TransWS: Transformer-Based Weakly Supervised Histology Image Segmentation
    Zhang, Shaoteng
    Zhang, Jianpeng
    Xia, Yong
    [J]. MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 : 367 - 376
  • [35] DS-Former: A dual-stream encoding-based transformer for 3D medical image segmentation
    Zhang, Lei
    Zuo, Yi
    Jia, Yu
    Li, Dongze
    Zeng, Rui
    Li, Dong
    Chen, Junren
    Wang, Wei
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [36] A deformable patch-based transformer for 3D medical image registration
    Liwei Deng
    Qiang Zhi
    Sijuan Huang
    Xin Yang
    Jing Wang
    [J]. International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 2295 - 2306
  • [37] A deformable patch-based transformer for 3D medical image registration
    Deng, Liwei
    Zhi, Qiang
    Huang, Sijuan
    Yang, Xin
    Wang, Jing
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (12) : 2295 - 2306
  • [38] A hybrid framework for 3D medical image segmentation
    Chen, T
    Metaxas, D
    [J]. MEDICAL IMAGE ANALYSIS, 2005, 9 (06) : 547 - 565
  • [39] UNETR: Transformers for 3D Medical Image Segmentation
    Hatamizadeh, Ali
    Tang, Yucheng
    Nath, Vishwesh
    Yang, Dong
    Myronenko, Andriy
    Landman, Bennett
    Roth, Holger R.
    Xu, Daguang
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1748 - 1758
  • [40] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
    Jianfei He
    Canhui Xu
    [J]. Applied Intelligence, 2023, 53 : 28542 - 28554