A Fusion Deep Learning Model of ResNet and Vision Transformer for 3D CT Images

被引：1

作者：

Liu, Chiyu ^{[1
,2
]}

Sun, Cunjie ^{[1
,3
]}

机构：

[1] Xuzhou Med Univ, Dept Med Imaging, Xuzhou 221004, Peoples R China

[2] First Peoples Hosp Xuzhou, Imaging Ctr, Xuzhou 221002, Peoples R China

[3] Xuzhou Med Univ, Affiliated Hosp, Informat Dept, Xuzhou 221006, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Deep learning; fusion model; 3D CT images; COVID-19; Resnet; 3D; video swin transformer;

D O I：

10.1109/ACCESS.2024.3423689

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The outbreak of COVID-19 has had a serious impact on the safety of human life and property. Rapid and effective diagnosis is the key to the prevention and treatment of the virus. In this study, we introduce a new fusion model called "Reswin", which was trained by 3D CT data to diagnose COVID-19. The model combines two mainstream computer vision models, Resnet 3D (a convolutional neural network) and Video Swin Transformer (a vision transformer neural network), which use a soft voting method. We compared our proposed model Reswin with ResNet 3D-50, Swin-T, MViT, R2+1 D-50, SlowFast-50, X3D, and CSN101, which are state-of-the-art deep learning models used for the classification of 3D images. The Reswin model achieved an accuracy of 0.9099, precision of 0.9266, F1 score of 0.9425, AUC of 0.9541, and AUPR of 0.9861 in binary classification, and an accuracy of 0.8655, precision of 0.8580, and F1 score of 0.8620 in triple classification. Reswin provides a new solution for 3D CT image classification tasks and new ideas for the development of deep learning in 3D medical imaging.

引用

页码：93389 / 93397

页数：9

共 50 条

[41] A combined learning algorithm for prostate segmentation on 3D CT images
Ma, Ling
Guo, Rongrong
Zhang, Guoyi
Schuster, David M.
Fei, Baowei
MEDICAL PHYSICS, 2017, 44 (11) : 5768 - 5781
[42] Deep learning of 3D Computed Tomography (CT) images for organ segmentation using 2D multi-channel SegNet model
Liu, Yingzhou
Fu, Wanyi
Selvakumaran, Vignesh
Phelan, Matthew
Segars, W. Paul
Samei, Ehsan
Mazurowski, Maciej
Lo, Joseph Y.
Rubin, Geoffrey D.
Henao, Ricardo
MEDICAL IMAGING 2019: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2019, 10954
[43] Deep learning NTCP model for late dysphagia based on 3D dose, CT and segmentations
de Vette, S. P.
Chu, H.
Neh, H.
Steenbakkers, R. J.
van Ooijen, P. M.
Fuller, C. D.
Langendijk, J. A.
Sijtsema, N. M.
van Dijk, L. V.
RADIOTHERAPY AND ONCOLOGY, 2023, 182 : S51 - S52
[44] Deep Learning 3D Shape Surfaces Using Geometry Images
Sinha, Ayan
Bai, Jing
Ramani, Karthik
COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 223 - 240
[45] A Federated Deep Learning Framework for 3D Brain MRI Images
Fan, Zhipeng
Su, Jianpo
Gao, Kai
Hu, Dewen
Ling-Li Zeng
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[46] Geometries of panoramic images and 3D vision
Huang, Fay
Torii, Akihiko
Klette, Reinhard
Machine Graphics and Vision, 2010, 19 (04): : 463 - 477
[47] Studies of Vision Recognition of 3D Images
Huang, Je-Yi
Fang, Yi-Chin
Tsai, Chen-Mu
Chen, Ling-Fei
IDW'11: PROCEEDINGS OF THE 18TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2011, : 961 - 962
[48] Fusion of infrared images with 3D GIS model for environmental imaging
Bukowska-Belniak, B.
Lupa, M.
Lesniak, A.
13TH QUANTITATIVE INFRARED THERMOGRAPHY CONFERENCE, 2016, : 170 - 171
[49] Deep Learning-Based Landmark Localization in 3D CT Images of the Heart: Method and Dataset Comparison
Skrlj, Luka
Jelenc, Matija
Vrtovec, Tomaz
MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
[50] 3D Reconstruction for Super-Resolution CT Images in the Internet of Health Things Using Deep Learning
Zhang, Jing
Gong, Ling-Rui
Yu, Keping
Qi, Xin
Wen, Zheng
Hua, Qiaozhi
Myint, San Hlaing
IEEE ACCESS, 2020, 8 : 121513 - 121525

← 1 2 3 4 5 →