A Fusion Deep Learning Model of ResNet and Vision Transformer for 3D CT Images

被引:1
|
作者
Liu, Chiyu [1 ,2 ]
Sun, Cunjie [1 ,3 ]
机构
[1] Xuzhou Med Univ, Dept Med Imaging, Xuzhou 221004, Peoples R China
[2] First Peoples Hosp Xuzhou, Imaging Ctr, Xuzhou 221002, Peoples R China
[3] Xuzhou Med Univ, Affiliated Hosp, Informat Dept, Xuzhou 221006, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Deep learning; fusion model; 3D CT images; COVID-19; Resnet; 3D; video swin transformer;
D O I
10.1109/ACCESS.2024.3423689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The outbreak of COVID-19 has had a serious impact on the safety of human life and property. Rapid and effective diagnosis is the key to the prevention and treatment of the virus. In this study, we introduce a new fusion model called "Reswin", which was trained by 3D CT data to diagnose COVID-19. The model combines two mainstream computer vision models, Resnet 3D (a convolutional neural network) and Video Swin Transformer (a vision transformer neural network), which use a soft voting method. We compared our proposed model Reswin with ResNet 3D-50, Swin-T, MViT, R2+1 D-50, SlowFast-50, X3D, and CSN101, which are state-of-the-art deep learning models used for the classification of 3D images. The Reswin model achieved an accuracy of 0.9099, precision of 0.9266, F1 score of 0.9425, AUC of 0.9541, and AUPR of 0.9861 in binary classification, and an accuracy of 0.8655, precision of 0.8580, and F1 score of 0.8620 in triple classification. Reswin provides a new solution for 3D CT image classification tasks and new ideas for the development of deep learning in 3D medical imaging.
引用
收藏
页码:93389 / 93397
页数:9
相关论文
共 50 条
  • [1] Spectrum Prediction With Deep 3D Pyramid Vision Transformer Learning
    Pan, Guangliang
    Wu, Qihui
    Zhou, Bo
    Li, Jie
    Wang, Wei
    Ding, Guoru
    Yau, David K. Y.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2025, 24 (01) : 509 - 525
  • [2] Patient -Specific 3D CT Images Reconstruction from 2D KV Images Via Vision Transformer-Based Deep-Learning
    Ding, Y.
    Holmes, J.
    Li, B.
    Vargas, C. E.
    Vora, S. A.
    Wong, W. W.
    Fatyga, M.
    Foote, R. L.
    Patel, S. H.
    Liu, W.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2023, 117 (02): : E660 - E660
  • [3] Hybrid 3D-ResNet Deep Learning Model for Automatic Segmentation of Thoracic Organs at Risk in CT Images
    Qayyum, Abdul
    Ang, Chun Kit
    Sridevi, S.
    Khan, M. K. A. Ahamed
    Hong, Lim Wei
    Mazher, Moona
    Tran Duc Chung
    2020 INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING, APPLICATIONS AND MANUFACTURING (ICIEAM), 2020,
  • [4] Deep learning for 3D vision
    Guo, Yulan
    Wang, Hanyun
    Clark, Ronald
    Berretti, Stefano
    Bennamoun, Mohammed
    IET COMPUTER VISION, 2022, 16 (07) : 567 - 569
  • [5] DESIGN AND RESEARCH OF A MULTI-VIEW GRAPH DEEP LEARNING 3D MODEL RETRIEVAL SYSTEM BASED ON FUSION VISION-TRANSFORMER
    Liang, Rong
    Li, Fangping
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2024, 20 (06): : 1775 - 1788
  • [6] GSC-DVIT: A vision transformer based deep learning model for lung cancer classification in CT images
    Mannepalli, Durgaprasad
    Tak, Tan Kuan
    Krishnan, Sivaneasan Bala
    Sreenivas, Velagapudi
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 103
  • [7] 3D multi-scale vision transformer for lung nodule detection in chest CT images
    Mkindu, Hassan
    Wu, Longwen
    Zhao, Yaqin
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (05) : 2473 - 2480
  • [8] 3D multi-scale vision transformer for lung nodule detection in chest CT images
    Hassan Mkindu
    Longwen Wu
    Yaqin Zhao
    Signal, Image and Video Processing, 2023, 17 : 2473 - 2480
  • [9] A 3D-CAE-CNN model for Deep Representation Learning of 3D images
    Pintelas, Emmanuel
    Pintelas, Panagiotis
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 113
  • [10] A novel deep learning framework for lung nodule detection in 3d CT images
    Reza Majidpourkhoei
    Mehdi Alilou
    Kambiz Majidzadeh
    Amin Babazadehsangar
    Multimedia Tools and Applications, 2021, 80 : 30539 - 30555