A Fusion Deep Learning Model of ResNet and Vision Transformer for 3D CT Images

被引:1
|
作者
Liu, Chiyu [1 ,2 ]
Sun, Cunjie [1 ,3 ]
机构
[1] Xuzhou Med Univ, Dept Med Imaging, Xuzhou 221004, Peoples R China
[2] First Peoples Hosp Xuzhou, Imaging Ctr, Xuzhou 221002, Peoples R China
[3] Xuzhou Med Univ, Affiliated Hosp, Informat Dept, Xuzhou 221006, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Deep learning; fusion model; 3D CT images; COVID-19; Resnet; 3D; video swin transformer;
D O I
10.1109/ACCESS.2024.3423689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The outbreak of COVID-19 has had a serious impact on the safety of human life and property. Rapid and effective diagnosis is the key to the prevention and treatment of the virus. In this study, we introduce a new fusion model called "Reswin", which was trained by 3D CT data to diagnose COVID-19. The model combines two mainstream computer vision models, Resnet 3D (a convolutional neural network) and Video Swin Transformer (a vision transformer neural network), which use a soft voting method. We compared our proposed model Reswin with ResNet 3D-50, Swin-T, MViT, R2+1 D-50, SlowFast-50, X3D, and CSN101, which are state-of-the-art deep learning models used for the classification of 3D images. The Reswin model achieved an accuracy of 0.9099, precision of 0.9266, F1 score of 0.9425, AUC of 0.9541, and AUPR of 0.9861 in binary classification, and an accuracy of 0.8655, precision of 0.8580, and F1 score of 0.8620 in triple classification. Reswin provides a new solution for 3D CT image classification tasks and new ideas for the development of deep learning in 3D medical imaging.
引用
收藏
页码:93389 / 93397
页数:9
相关论文
共 50 条
  • [41] A combined learning algorithm for prostate segmentation on 3D CT images
    Ma, Ling
    Guo, Rongrong
    Zhang, Guoyi
    Schuster, David M.
    Fei, Baowei
    MEDICAL PHYSICS, 2017, 44 (11) : 5768 - 5781
  • [42] Deep learning of 3D Computed Tomography (CT) images for organ segmentation using 2D multi-channel SegNet model
    Liu, Yingzhou
    Fu, Wanyi
    Selvakumaran, Vignesh
    Phelan, Matthew
    Segars, W. Paul
    Samei, Ehsan
    Mazurowski, Maciej
    Lo, Joseph Y.
    Rubin, Geoffrey D.
    Henao, Ricardo
    MEDICAL IMAGING 2019: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2019, 10954
  • [43] Deep learning NTCP model for late dysphagia based on 3D dose, CT and segmentations
    de Vette, S. P.
    Chu, H.
    Neh, H.
    Steenbakkers, R. J.
    van Ooijen, P. M.
    Fuller, C. D.
    Langendijk, J. A.
    Sijtsema, N. M.
    van Dijk, L. V.
    RADIOTHERAPY AND ONCOLOGY, 2023, 182 : S51 - S52
  • [44] Deep Learning 3D Shape Surfaces Using Geometry Images
    Sinha, Ayan
    Bai, Jing
    Ramani, Karthik
    COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 223 - 240
  • [45] A Federated Deep Learning Framework for 3D Brain MRI Images
    Fan, Zhipeng
    Su, Jianpo
    Gao, Kai
    Hu, Dewen
    Ling-Li Zeng
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [46] Geometries of panoramic images and 3D vision
    Huang, Fay
    Torii, Akihiko
    Klette, Reinhard
    Machine Graphics and Vision, 2010, 19 (04): : 463 - 477
  • [47] Studies of Vision Recognition of 3D Images
    Huang, Je-Yi
    Fang, Yi-Chin
    Tsai, Chen-Mu
    Chen, Ling-Fei
    IDW'11: PROCEEDINGS OF THE 18TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2011, : 961 - 962
  • [48] Fusion of infrared images with 3D GIS model for environmental imaging
    Bukowska-Belniak, B.
    Lupa, M.
    Lesniak, A.
    13TH QUANTITATIVE INFRARED THERMOGRAPHY CONFERENCE, 2016, : 170 - 171
  • [49] Deep Learning-Based Landmark Localization in 3D CT Images of the Heart: Method and Dataset Comparison
    Skrlj, Luka
    Jelenc, Matija
    Vrtovec, Tomaz
    MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [50] 3D Reconstruction for Super-Resolution CT Images in the Internet of Health Things Using Deep Learning
    Zhang, Jing
    Gong, Ling-Rui
    Yu, Keping
    Qi, Xin
    Wen, Zheng
    Hua, Qiaozhi
    Myint, San Hlaing
    IEEE ACCESS, 2020, 8 : 121513 - 121525