A 3D Medical Image Segmentation Framework Fusing Convolution and Transformer Features

被引:1
|
作者
Zhu, Fazhan [1 ]
Lv, Jiaxing [1 ]
Lu, Kun [1 ,2 ]
Wang, Wenyan [1 ,2 ,3 ]
Cong, Hongshou [1 ]
Zhang, Jun [4 ]
Chen, Peng [5 ,6 ,7 ]
Zhao, Yuan [1 ,2 ]
Wu, Ziheng [1 ,2 ]
机构
[1] Anhui Univ Technol, Sch Elect & Informat Engn, Maanshan 243032, Anhui, Peoples R China
[2] Anhui Univ Technol, Key Lab Met Emiss Reduct & Resources Recycling, Minist Educ, Maanshan 243002, Peoples R China
[3] Anhui Univ Technol, Sch Mat Sci & Engn, Maanshan 243032, Anhui, Peoples R China
[4] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Anhui, Peoples R China
[5] Anhui Univ, Sch Internet, Natl Engn Res Ctr Agroecol Big Data Anal & Applic, Hefei 230601, Anhui, Peoples R China
[6] Anhui Univ, Inst Phys Sci, Hefei 230601, Anhui, Peoples R China
[7] Anhui Univ, Inst Informat Technol, Hefei 230601, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; 3D; Convolutional neural networks; Transformer;
D O I
10.1007/978-3-031-13870-6_63
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Medical images can be accurately segmented to provide reliable basis for clinical diagnosis and pathology research, and assist doctors to make more accurate diagnosis, as well as deep learning technology can accelerate this process. Convolutional Neural Networks (CNNs) and Transformer have become two mainstream architectures of deep learning in medical image segmentation. However, the Transformer architecture has limited ability to obtain local inductive bias, and the Transformer architecture is at a disadvantage in a small sample data set. Many theories and experiments show that the above problems can be effectively solved by fusing Convolution and Transformer features. In this manuscript, a new U-shaped segmentation model based on Convolution and swin-transformer framework is proposed, which is called CST-UNET. In the encoder part, it combines the advantages of both dilated convolution and Transformer, which can make the model fully obtain semantic inductive bias information and long-term information. At the same time, it has the advantages of fewer parameters and lower Flops. Even if it is trained on a small sample data set, the framework still has strong generalization ability. In addition, on BraTS2021 dataset, the Dice coefficients of ET, TC and WTare 85.46%, 89.38%, 92.35% respectively, and the result of HD95 are 7.95, 5.06 and 4.07 respectively.
引用
收藏
页码:772 / 786
页数:15
相关论文
共 50 条
  • [1] HCA-former: Hybrid Convolution Attention Transformer for 3D Medical Image Segmentation
    Yang, Fan
    Wang, Fan
    Dong, Pengwei
    Wang, Bo
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 90
  • [2] CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation
    Chen, Yuanbin
    Wang, Tao
    Tang, Hui
    Zhao, Longxuan
    Zhang, Xinlin
    Tan, Tao
    Gao, Qinquan
    Du, Min
    Tong, Tong
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (17):
  • [3] A hybrid framework for 3D medical image segmentation
    Chen, T
    Metaxas, D
    [J]. MEDICAL IMAGE ANALYSIS, 2005, 9 (06) : 547 - 565
  • [4] Hybrid Transformer and Convolution for Medical Image Segmentation
    Wang, Fan
    Wang, Bo
    [J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 156 - 159
  • [5] nnFormer: Volumetric Medical Image Segmentation via a 3D Transformer
    Zhou, Hong-Yu
    Guo, Jiansen
    Zhang, Yinghao
    Han, Xiaoguang
    Yu, Lequan
    Wang, Liansheng
    Yu, Yizhou
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4036 - 4045
  • [6] Hybrid segmentation framework for 3D medical image analysis
    Chen, T
    Metaxas, D
    [J]. MEDICAL IMAGING 2003: IMAGE PROCESSING, PTS 1-3, 2003, 5032 : 1421 - 1432
  • [7] FUSING ADAPTIVE ATLAS AND INFORMATIVE FEATURES FOR ROBUST 3D BRAIN IMAGE SEGMENTATION
    Liu, Cheng-Yi
    Iglesias, Juan Eugenio
    Toga, Arthur
    Tu, Zhuowen
    [J]. 2010 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2010, : 848 - 851
  • [8] LiteTrans: Reconstruct Transformer with Convolution for Medical Image Segmentation
    Xu, Shuying
    Quan, Hongyan
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 300 - 313
  • [9] MCRformer: Morphological constraint reticular transformer for 3D medical image segmentation
    Li, Jun
    Chen, Nan
    Zhou, Han
    Lai, Taotao
    Dong, Heng
    Feng, Chunhui
    Chen, Riqing
    Yang, Changcai
    Cai, Fanggang
    Wei, Lifang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [10] DAST: Differentiable Architecture Search with Transformer for 3D Medical Image Segmentation
    Yang, Dong
    Xu, Ziyue
    He, Yufan
    Nath, Vishwesh
    Li, Wenqi
    Myronenko, Andriy
    Hatamizadeh, Ali
    Zhao, Can
    Roth, Holger R.
    Xu, Daguang
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 747 - 756