The Fully Convolutional Transformer for Medical Image Segmentation

被引:30
|
作者
Tragakis, Athanasios [1 ]
Kaul, Chaitanya [2 ]
Murray-Smith, Roderick [2 ]
Husmeier, Dirk [1 ]
机构
[1] Univ Glasgow, Math & Stat, Glasgow G12 8QW, Lanark, Scotland
[2] Univ Glasgow, Sch Comp Sci, Glasgow G12 8RZ, Lanark, Scotland
基金
英国工程与自然科学研究理事会; “创新英国”项目;
关键词
PLUS PLUS;
D O I
10.1109/WACV56688.2023.00365
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel transformer, capable of segmenting medical images of varying modalities. Challenges posed by the fine-grained nature of medical image analysis mean that the adaptation of the transformer for their analysis is still at nascent stages. The overwhelming success of the UNet lay in its ability to appreciate the fine-grained nature of the segmentation task, an ability which existing transformer based models do not currently posses. To address this shortcoming, we propose The Fully Convolutional Transformer (FCT), which builds on the proven ability of Convolutional Neural Networks to learn effective image representations, and combines them with the ability of Transformers to effectively capture long-term dependencies in its inputs. The FCT is the first fully convolutional Transformer model in medical imaging literature. It processes its input in two stages, where first, it learns to extract long range semantic dependencies from the input image, and then learns to capture hierarchical global attributes from the features. FCT is compact, accurate and robust. Our results show that it outperforms all existing transformer architectures by large margins across multiple medical image segmentation datasets of varying data modalities without the need for any pretraining. FCT outperforms its immediate competitor on the ACDC dataset by 1.3%, on the Synapse dataset by 4.4%, on the Spleen dataset by 1.2% and on ISIC 2017 dataset by 1.1% on the dice metric, with up to five times fewer parameters. On the ACDC Post-2017-MICCAI-Challenge online test set, our model sets a new state-of-the-art on unseen MRI test cases outperforming large ensemble models as well as nnUNet with considerably fewer parameters. Our code, environments and models will be available via GitHub(+).
引用
收藏
页码:3649 / 3658
页数:10
相关论文
共 50 条
  • [1] A parallelly contextual convolutional transformer for medical image segmentation
    Feng, Yuncong
    Su, Jianyu
    Zheng, Jian
    Zheng, Yupeng
    Zhang, Xiaoli
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 98
  • [2] ConTrans: Improving Transformer with Convolutional Attention for Medical Image Segmentation
    Lin, Ailiang
    Xu, Jiayu
    Li, Jinxing
    Lu, Guangming
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 297 - 307
  • [3] CCTrans: Improving Medical Image Segmentation with Contoured Convolutional Transformer Network
    Wang, Jingling
    Zhang, Haixian
    Yi, Zhang
    [J]. MATHEMATICS, 2023, 11 (09)
  • [4] CTRANSNET: CONVOLUTIONAL NEURAL NETWORK COMBINED WITH TRANSFORMER FOR MEDICAL IMAGE SEGMENTATION
    Zhang, Zhixin
    Jiang, Shuhao
    Pan, Xuhua
    [J]. COMPUTING AND INFORMATICS, 2023, 42 (02) : 392 - 410
  • [5] Ctnet: rethinking convolutional neural networks and vision transformer for medical image segmentation
    Zhang, Zhixin
    Jiang, Shuhao
    Pan, Xuhua
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2265 - 2275
  • [6] Ctnet: rethinking convolutional neural networks and vision transformer for medical image segmentation
    Zhixin Zhang
    Shuhao Jiang
    Xuhua Pan
    [J]. Signal, Image and Video Processing, 2024, 18 : 2265 - 2275
  • [7] MRI Image Segmentation by Fully Convolutional Networks
    Wang, Yabiao
    Sun, Zeyu
    Liu, Chang
    Peng, Wenbo
    Zhang, Juhua
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, 2016, : 1697 - 1702
  • [8] A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer
    Zhang, Zhuo
    Wu, Hongbing
    Zhao, Huan
    Shi, Yicheng
    Wang, Jifang
    Bai, Hua
    Sun, Baoshan
    [J]. INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2023, 15 (04) : 663 - 677
  • [9] A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer
    Zhuo Zhang
    Hongbing Wu
    Huan Zhao
    Yicheng Shi
    Jifang Wang
    Hua Bai
    Baoshan Sun
    [J]. Interdisciplinary Sciences: Computational Life Sciences, 2023, 15 : 663 - 677
  • [10] V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation
    Milletari, Fausto
    Navab, Nassir
    Ahmadi, Seyed-Ahmad
    [J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 565 - 571