Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation

被引：3

作者：

Chen, Yong ^{[1
]}

Lu, Xuesong ^{[1
]}

Xie, Qinlan ^{[1
]}

机构：

[1] South Cent Minzu Univ, Sch Biomed Engn, Wuhan 430074, Hubei, Peoples R China

来源：

COMPUTERS IN BIOLOGY AND MEDICINE | 2023年 / 164卷

关键词：

Convolutional neural networks; Transformers; Interlaced collaboration; Versatile models; 3D medical image segmentation;

D O I：

10.1016/j.compbiomed.2023.107228

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Integrating transformers and convolutional neural networks represents a crucial and cutting-edge approach for tackling medical image segmentation problems. Nonetheless, the existing hybrid methods fail to fully leverage the strengths of both operators. During the Patch Embedding, the patch projection method ignores the two-dimensional structure and local spatial information within each patch, while the fixed patch size cannot capture features with rich representation effectively. Moreover, the calculation of self-attention results in attention diffusion, hindering the provision of precise details to the decoder while maintaining feature consistency. Lastly, none of the existing methods establish an efficient multi-scale modeling concept. To address these issues, we design the Collaborative Networks of Transformers and Convolutional neural networks (TC-CoNet), which is generally used for accurate 3D medical image segmentation. First, we elaborately design precise patch embedding to generate 3D features with accurate spatial position information, laying a solid foundation for subsequent learning. The encoder-decoder backbone network is then constructed by TC-CoNet in an interlaced combination to properly incorporate long-range dependencies and hierarchical object concepts at various scales. Furthermore, we employ the constricted attention bridge to constrict attention to local features, allowing us to accurately guide the recovery of detailed information while maintaining feature consistency. Finally, atrous spatial pyramid pooling is applied to high-level feature map to establish the concept of multi-scale objects. On five challenging datasets, including Synapse, ACDC, brain tumor segmentation, cardiac left atrium segmentation, and lung tumor segmentation, the extensive experiments demonstrate that TC-CoNet outperforms state-of-the-art approaches in terms of superiority, migration, and strong generalization. These illustrate in full the efficacy of the proposed transformers and convolutional neural networks combination for medical image segmentation. Our code is freely available at: https://github.com/YongChen-Exact/TC-CoNet.

引用

页数：14

共 50 条

[31] Convolutional Neural Networks for SAR Image Segmentation
Malmgren-Hansen, David
Nobel-Jorgensen, Morten
2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 231 - 236
[32] 3D Medical image segmentation using parallel transformers
Yan, Qingsen
Liu, Shengqiang
Xu, Songhua
Dong, Caixia
Li, Zongfang
Shi, Javen Qinfeng
Zhang, Yanning
Dai, Duwei
PATTERN RECOGNITION, 2023, 138
[33] Dense graph convolutional neural networks on 3D meshes for 3D object segmentation and classification
Tang, Wenming
Qiu, Guoping
IMAGE AND VISION COMPUTING, 2021, 114
[34] Directionally Convolutional Networks for 3D Shape Segmentation
Xu, Haotian
Dong, Ming
Zhong, Zichun
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2717 - 2726
[35] 3D Shape Segmentation with Projective Convolutional Networks
Kalogerakis, Evangelos
Averkiou, Melinos
Maji, Subhransu
Chaudhuri, Siddhartha
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6630 - 6639
[36] 3D Brain Image Segmentation Using 3D Tiled Convolution Neural Networks
Haque, Md Mahibul
Ria, Jobeda Khanam
Al Mannan, Fahad
Majumder, Sadman
Uddin, Reaz
Abed, Mahjabeen Tamanna
Alam, Md Ashraful
PATTERN RECOGNITION AND PREDICTION XXXV, 2024, 13040
[37] Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review
Takahashi, Satoshi
Sakaguchi, Yusuke
Kouno, Nobuji
Takasawa, Ken
Ishizu, Kenichi
Akagi, Yu
Aoyama, Rina
Teraya, Naoki
Bolatkan, Amina
Shinkai, Norio
Machino, Hidenori
Kobayashi, Kazuma
Asada, Ken
Komatsu, Masaaki
Kaneko, Syuzo
Sugiyama, Masashi
Hamamoto, Ryuji
JOURNAL OF MEDICAL SYSTEMS, 2024, 48 (01)
[38] Ctnet: rethinking convolutional neural networks and vision transformer for medical image segmentation
Zhang, Zhixin
Jiang, Shuhao
Pan, Xuhua
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2265 - 2275
[39] Convolutional Neural Networks and 3D Gabor Filtering for Hyperspectral Image Classification
Wei X.
Yu X.
Tan X.
Liu B.
Zhi L.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (01): : 90 - 98
[40] MIScnn: a framework for medical image segmentation with convolutional neural networks and deep learning
Mueller, Dominik
Kramer, Frank
BMC MEDICAL IMAGING, 2021, 21 (01)

← 1 2 3 4 5 →