Abstract: 3D Medical Image Segmentation with Transformer-based Scaling of ConvNets MedNeXt

被引:0
|
作者
Roy, Saikat [1 ,3 ]
Koehler, Gregor [1 ,3 ,4 ]
Baumgartner, Michael [1 ]
Ulrich, Constantin [1 ,5 ]
Isensee, Fabian [1 ,4 ]
Jaeger, Paul F. [4 ,6 ]
Maier-Hein, Klaus [1 ,2 ]
机构
[1] German Canc Res Ctr, Div Med Image Comp MIC, Heidelberg, Germany
[2] Heidelberg Univ Hosp, Dept Radiat Oncol, Pattern Anal & Learning Grp, Heidelberg, Germany
[3] Heidelberg Univ, Fac Math & Comp Sci, Heidelberg, Germany
[4] German Canc Res Ctr, Helmholtz Imaging, Heidelberg, Germany
[5] NCT Heidelberg, Natl Ctr Tumor Dis NCT, Heidelberg, Germany
[6] German Canc Res Ctr, Interact Machine Learning Grp, Heidelberg, Germany
关键词
D O I
10.1007/978-3-658-44037-4_23
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Transformer-based architectures have seen widespread adoption recently for medical image segmentation. However, achieving performances equivalent to those in natural images are challenging due to the absence of large-scale annotated datasets. In contrast, convolutional networks have higher inductive biases and consequently, are easier to train to high performance. Recently, the ConvNeXt architecture attempted to improve the standard ConvNet by upgrading the popular ResNet blocks to mirror Transformer blocks. In this work, we extend upon this to design a modernized and scalable convolutional architecture customized to challenges of dense segmentation tasks in data-scarce medical settings. In this work, we introduce the MedNeXt architecture which is a Transformer-inspired, scalable large-kernel network for medical image segmentation with 4 key features - 1) Fully ConvNeXt 3D Encoder-Decoder architecture to leverage network-wide benefits of the block design, 2) Residual ConvNeXt blocks for up and downsampling to preserve semantic richness across scales, 3) Upkern, an algorithm to iteratively increase kernel size by upsampling small kernel networks, thus preventing performance saturation on limited data, 4) Compound scaling of depth, width and kernel size to leverage the benefits of large-scale variants of the MedNeXt architecture. With state-of-the-art performance on 4 popular segmentation tasks, across variations in imaging modalities (CT, MRI) and dataset sizes, MedNeXt represents a modernized deep architecture for medical image segmentation. This work was originally published in [1]. Our code is made publicly available at: https://github.com/MIC-DKFZ/MedNeXt.
引用
收藏
页码:79 / 79
页数:1
相关论文
共 50 条
  • [1] MedNeXt: Transformer-Driven Scaling of ConvNets for Medical Image Segmentation
    Roy, Saikat
    Koehler, Gregor
    Ulrich, Constantin
    Baumgartner, Michael
    Petersen, Jens
    Isensee, Fabian
    Jaeger, Paul F.
    Maier-Hein, Klaus H.
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 405 - 415
  • [2] A Transformer-Based Network for Anisotropic 3D Medical Image Segmentation
    Guo, Danfeng
    Terzopoulos, Demetri
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8857 - 8861
  • [3] Transformer-based 3D Instance Segmentation With Auxiliary Denoising Learning
    Song, Sung-Ho
    Kim, Incheol
    [J]. Journal of Institute of Control, Robotics and Systems, 2023, 29 (12) : 954 - 965
  • [4] nnFormer: Volumetric Medical Image Segmentation via a 3D Transformer
    Zhou, Hong-Yu
    Guo, Jiansen
    Zhang, Yinghao
    Han, Xiaoguang
    Yu, Lequan
    Wang, Liansheng
    Yu, Yizhou
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4036 - 4045
  • [5] Transformer-Based Annotation Bias-Aware Medical Image Segmentation
    Liao, Zehui
    Hu, Shishuai
    Xie, Yutong
    Xia, Yong
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 24 - 34
  • [6] A 3D Medical Image Segmentation Framework Fusing Convolution and Transformer Features
    Zhu, Fazhan
    Lv, Jiaxing
    Lu, Kun
    Wang, Wenyan
    Cong, Hongshou
    Zhang, Jun
    Chen, Peng
    Zhao, Yuan
    Wu, Ziheng
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATION (ICIC 2022), PT I, 2022, 13393 : 772 - 786
  • [7] MCRformer: Morphological constraint reticular transformer for 3D medical image segmentation
    Li, Jun
    Chen, Nan
    Zhou, Han
    Lai, Taotao
    Dong, Heng
    Feng, Chunhui
    Chen, Riqing
    Yang, Changcai
    Cai, Fanggang
    Wei, Lifang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [8] DAST: Differentiable Architecture Search with Transformer for 3D Medical Image Segmentation
    Yang, Dong
    Xu, Ziyue
    He, Yufan
    Nath, Vishwesh
    Li, Wenqi
    Myronenko, Andriy
    Hatamizadeh, Ali
    Zhao, Can
    Roth, Holger R.
    Xu, Daguang
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 747 - 756
  • [9] CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
    Xie, Yutong
    Zhang, Jianpeng
    Shen, Chunhua
    Xia, Yong
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 171 - 180
  • [10] EFFICIENT 3D TRANSFORMER WITH CLUSTER-BASED DOMAIN-ADVERSARIAL LEARNING FOR 3D MEDICAL IMAGE SEGMENTATION
    Zhang, Haoran
    Chen, Hao
    [J]. 2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,