A Fusion Deep Learning Model of ResNet and Vision Transformer for 3D CT Images

被引:1
|
作者
Liu, Chiyu [1 ,2 ]
Sun, Cunjie [1 ,3 ]
机构
[1] Xuzhou Med Univ, Dept Med Imaging, Xuzhou 221004, Peoples R China
[2] First Peoples Hosp Xuzhou, Imaging Ctr, Xuzhou 221002, Peoples R China
[3] Xuzhou Med Univ, Affiliated Hosp, Informat Dept, Xuzhou 221006, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Deep learning; fusion model; 3D CT images; COVID-19; Resnet; 3D; video swin transformer;
D O I
10.1109/ACCESS.2024.3423689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The outbreak of COVID-19 has had a serious impact on the safety of human life and property. Rapid and effective diagnosis is the key to the prevention and treatment of the virus. In this study, we introduce a new fusion model called "Reswin", which was trained by 3D CT data to diagnose COVID-19. The model combines two mainstream computer vision models, Resnet 3D (a convolutional neural network) and Video Swin Transformer (a vision transformer neural network), which use a soft voting method. We compared our proposed model Reswin with ResNet 3D-50, Swin-T, MViT, R2+1 D-50, SlowFast-50, X3D, and CSN101, which are state-of-the-art deep learning models used for the classification of 3D images. The Reswin model achieved an accuracy of 0.9099, precision of 0.9266, F1 score of 0.9425, AUC of 0.9541, and AUPR of 0.9861 in binary classification, and an accuracy of 0.8655, precision of 0.8580, and F1 score of 0.8620 in triple classification. Reswin provides a new solution for 3D CT image classification tasks and new ideas for the development of deep learning in 3D medical imaging.
引用
收藏
页码:93389 / 93397
页数:9
相关论文
共 50 条
  • [21] 3D image fusion using MRI/CT and infrared images
    Fusão 3D de imagens de MRI/CT e termografia
    2013, Sociedade Brasileira de Engenharia Biomedica, Caixa Postal 68510, Rio de Janeiro, RJ, 21941-972, Brazil (29):
  • [22] Vision transformer and deep learning based weighted ensemble model for automated spine fracture type identification with GAN generated CT images
    Sindhura D.N.
    Radhika M. Pai
    Shyamasunder N. Bhat
    Manohara M. M. Pai
    Scientific Reports, 15 (1)
  • [23] Learning 3D Face Representation with Vision Transformer for Masked Face Recognition
    Wang, Yuan
    Yang, Zhen
    Zhang, Zhiqiang
    Zang, Huaijuan
    Zhu, Qiang
    Zhan, Shu
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 505 - 511
  • [24] DEEP LEARNING FOR OBJECTIVE QUALITY ASSESSMENT OF 3D IMAGES
    Mocanu, Decebal Constantin
    Exarchakos, Georgios
    Liotta, Antonio
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 758 - 762
  • [25] Research on 3D Face Reconstruction Algorithm Based on ResNet and Transformer
    Yaermaimaiti, Yilihamu
    Yan, Tianxing
    Zhao, Yuhang
    Kari, Tusongjiang
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2024, 23 (01)
  • [26] 3D robotic navigation using a vision-based deep reinforcement learning model
    Zieliński, P.
    Markowska-Kaczmar, U.
    Applied Soft Computing, 2021, 110
  • [27] 3D robotic navigation using a vision-based deep reinforcement learning model
    Zielinski, P.
    Markowska-Kaczmar, U.
    APPLIED SOFT COMPUTING, 2021, 110
  • [28] Topologically preserved registration of 3D CT images with deep networks
    Liu, Huaying
    Gong, Guanzhong
    Zou, Wei
    Hu, Nan
    Wang, Jiajun
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (03):
  • [29] Histological Subtypes Classification of Lung Cancers on CT Images Using 3D Deep Learning and Radiomics
    Guo, Yixian
    Song, Qiong
    Jiang, Mengmeng
    Guo, Yinglong
    Xu, Peng
    Zhang, Yiqian
    Fu, Chi-Cheng
    Fang, Qu
    Zeng, Mengsu
    Yao, Xiuzhong
    ACADEMIC RADIOLOGY, 2021, 28 (09) : E258 - E266
  • [30] Deep learning techniques to process 3D chest CT
    Solar, Mauricio
    Aguirre, Pablo
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2024, 30 (06) : 758 - 778