Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation

被引:3
|
作者
Chen, Yong [1 ]
Lu, Xuesong [1 ]
Xie, Qinlan [1 ]
机构
[1] South Cent Minzu Univ, Sch Biomed Engn, Wuhan 430074, Hubei, Peoples R China
关键词
Convolutional neural networks; Transformers; Interlaced collaboration; Versatile models; 3D medical image segmentation;
D O I
10.1016/j.compbiomed.2023.107228
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Integrating transformers and convolutional neural networks represents a crucial and cutting-edge approach for tackling medical image segmentation problems. Nonetheless, the existing hybrid methods fail to fully leverage the strengths of both operators. During the Patch Embedding, the patch projection method ignores the two-dimensional structure and local spatial information within each patch, while the fixed patch size cannot capture features with rich representation effectively. Moreover, the calculation of self-attention results in attention diffusion, hindering the provision of precise details to the decoder while maintaining feature consistency. Lastly, none of the existing methods establish an efficient multi-scale modeling concept. To address these issues, we design the Collaborative Networks of Transformers and Convolutional neural networks (TC-CoNet), which is generally used for accurate 3D medical image segmentation. First, we elaborately design precise patch embedding to generate 3D features with accurate spatial position information, laying a solid foundation for subsequent learning. The encoder-decoder backbone network is then constructed by TC-CoNet in an interlaced combination to properly incorporate long-range dependencies and hierarchical object concepts at various scales. Furthermore, we employ the constricted attention bridge to constrict attention to local features, allowing us to accurately guide the recovery of detailed information while maintaining feature consistency. Finally, atrous spatial pyramid pooling is applied to high-level feature map to establish the concept of multi-scale objects. On five challenging datasets, including Synapse, ACDC, brain tumor segmentation, cardiac left atrium segmentation, and lung tumor segmentation, the extensive experiments demonstrate that TC-CoNet outperforms state-of-the-art approaches in terms of superiority, migration, and strong generalization. These illustrate in full the efficacy of the proposed transformers and convolutional neural networks combination for medical image segmentation. Our code is freely available at: https://github.com/YongChen-Exact/TC-CoNet.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Convolutional Neural Networks for SAR Image Segmentation
    Malmgren-Hansen, David
    Nobel-Jorgensen, Morten
    2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 231 - 236
  • [32] 3D Medical image segmentation using parallel transformers
    Yan, Qingsen
    Liu, Shengqiang
    Xu, Songhua
    Dong, Caixia
    Li, Zongfang
    Shi, Javen Qinfeng
    Zhang, Yanning
    Dai, Duwei
    PATTERN RECOGNITION, 2023, 138
  • [33] Dense graph convolutional neural networks on 3D meshes for 3D object segmentation and classification
    Tang, Wenming
    Qiu, Guoping
    IMAGE AND VISION COMPUTING, 2021, 114
  • [34] Directionally Convolutional Networks for 3D Shape Segmentation
    Xu, Haotian
    Dong, Ming
    Zhong, Zichun
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2717 - 2726
  • [35] 3D Shape Segmentation with Projective Convolutional Networks
    Kalogerakis, Evangelos
    Averkiou, Melinos
    Maji, Subhransu
    Chaudhuri, Siddhartha
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6630 - 6639
  • [36] 3D Brain Image Segmentation Using 3D Tiled Convolution Neural Networks
    Haque, Md Mahibul
    Ria, Jobeda Khanam
    Al Mannan, Fahad
    Majumder, Sadman
    Uddin, Reaz
    Abed, Mahjabeen Tamanna
    Alam, Md Ashraful
    PATTERN RECOGNITION AND PREDICTION XXXV, 2024, 13040
  • [37] Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review
    Takahashi, Satoshi
    Sakaguchi, Yusuke
    Kouno, Nobuji
    Takasawa, Ken
    Ishizu, Kenichi
    Akagi, Yu
    Aoyama, Rina
    Teraya, Naoki
    Bolatkan, Amina
    Shinkai, Norio
    Machino, Hidenori
    Kobayashi, Kazuma
    Asada, Ken
    Komatsu, Masaaki
    Kaneko, Syuzo
    Sugiyama, Masashi
    Hamamoto, Ryuji
    JOURNAL OF MEDICAL SYSTEMS, 2024, 48 (01)
  • [38] Ctnet: rethinking convolutional neural networks and vision transformer for medical image segmentation
    Zhang, Zhixin
    Jiang, Shuhao
    Pan, Xuhua
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2265 - 2275
  • [39] Convolutional Neural Networks and 3D Gabor Filtering for Hyperspectral Image Classification
    Wei X.
    Yu X.
    Tan X.
    Liu B.
    Zhi L.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (01): : 90 - 98
  • [40] MIScnn: a framework for medical image segmentation with convolutional neural networks and deep learning
    Mueller, Dominik
    Kramer, Frank
    BMC MEDICAL IMAGING, 2021, 21 (01)