A Fusion Deep Learning Model of ResNet and Vision Transformer for 3D CT Images

被引：1

作者：

Liu, Chiyu ^{[1
,2
]}

Sun, Cunjie ^{[1
,3
]}

机构：

[1] Xuzhou Med Univ, Dept Med Imaging, Xuzhou 221004, Peoples R China

[2] First Peoples Hosp Xuzhou, Imaging Ctr, Xuzhou 221002, Peoples R China

[3] Xuzhou Med Univ, Affiliated Hosp, Informat Dept, Xuzhou 221006, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Deep learning; fusion model; 3D CT images; COVID-19; Resnet; 3D; video swin transformer;

D O I：

10.1109/ACCESS.2024.3423689

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The outbreak of COVID-19 has had a serious impact on the safety of human life and property. Rapid and effective diagnosis is the key to the prevention and treatment of the virus. In this study, we introduce a new fusion model called "Reswin", which was trained by 3D CT data to diagnose COVID-19. The model combines two mainstream computer vision models, Resnet 3D (a convolutional neural network) and Video Swin Transformer (a vision transformer neural network), which use a soft voting method. We compared our proposed model Reswin with ResNet 3D-50, Swin-T, MViT, R2+1 D-50, SlowFast-50, X3D, and CSN101, which are state-of-the-art deep learning models used for the classification of 3D images. The Reswin model achieved an accuracy of 0.9099, precision of 0.9266, F1 score of 0.9425, AUC of 0.9541, and AUPR of 0.9861 in binary classification, and an accuracy of 0.8655, precision of 0.8580, and F1 score of 0.8620 in triple classification. Reswin provides a new solution for 3D CT image classification tasks and new ideas for the development of deep learning in 3D medical imaging.

引用

页码：93389 / 93397

页数：9

共 50 条

[21] 3D image fusion using MRI/CT and infrared images
Fusão 3D de imagens de MRI/CT e termografia
2013, Sociedade Brasileira de Engenharia Biomedica, Caixa Postal 68510, Rio de Janeiro, RJ, 21941-972, Brazil (29):
[22] Vision transformer and deep learning based weighted ensemble model for automated spine fracture type identification with GAN generated CT images
Sindhura D.N.
Radhika M. Pai
Shyamasunder N. Bhat
Manohara M. M. Pai
Scientific Reports, 15 (1)
[23] Learning 3D Face Representation with Vision Transformer for Masked Face Recognition
Wang, Yuan
Yang, Zhen
Zhang, Zhiqiang
Zang, Huaijuan
Zhu, Qiang
Zhan, Shu
2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 505 - 511
[24] DEEP LEARNING FOR OBJECTIVE QUALITY ASSESSMENT OF 3D IMAGES
Mocanu, Decebal Constantin
Exarchakos, Georgios
Liotta, Antonio
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 758 - 762
[25] Research on 3D Face Reconstruction Algorithm Based on ResNet and Transformer
Yaermaimaiti, Yilihamu
Yan, Tianxing
Zhao, Yuhang
Kari, Tusongjiang
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2024, 23 (01)
[26] 3D robotic navigation using a vision-based deep reinforcement learning model
Zieliński, P.
Markowska-Kaczmar, U.
Applied Soft Computing, 2021, 110
[27] 3D robotic navigation using a vision-based deep reinforcement learning model
Zielinski, P.
Markowska-Kaczmar, U.
APPLIED SOFT COMPUTING, 2021, 110
[28] Topologically preserved registration of 3D CT images with deep networks
Liu, Huaying
Gong, Guanzhong
Zou, Wei
Hu, Nan
Wang, Jiajun
PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (03):
[29] Histological Subtypes Classification of Lung Cancers on CT Images Using 3D Deep Learning and Radiomics
Guo, Yixian
Song, Qiong
Jiang, Mengmeng
Guo, Yinglong
Xu, Peng
Zhang, Yiqian
Fu, Chi-Cheng
Fang, Qu
Zeng, Mengsu
Yao, Xiuzhong
ACADEMIC RADIOLOGY, 2021, 28 (09) : E258 - E266
[30] Deep learning techniques to process 3D chest CT
Solar, Mauricio
Aguirre, Pablo
JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2024, 30 (06) : 758 - 778

← 1 2 3 4 5 →