Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review

被引:7
|
作者
Takahashi, Satoshi [1 ,2 ]
Sakaguchi, Yusuke [1 ,3 ]
Kouno, Nobuji [1 ,2 ,4 ]
Takasawa, Ken [1 ,2 ]
Ishizu, Kenichi [1 ]
Akagi, Yu [5 ]
Aoyama, Rina [1 ,6 ]
Teraya, Naoki [1 ,6 ]
Bolatkan, Amina [1 ,2 ]
Shinkai, Norio [1 ,2 ]
Machino, Hidenori [1 ,2 ]
Kobayashi, Kazuma [1 ,2 ]
Asada, Ken [1 ,2 ]
Komatsu, Masaaki [1 ,2 ]
Kaneko, Syuzo [1 ]
Sugiyama, Masashi [7 ]
Hamamoto, Ryuji [1 ,2 ]
机构
[1] Natl Canc Ctr, Res Inst, Div Med AI Res & Dev, 5-1-1 Tsukiji,Chuo Ku, Tokyo 1040045, Japan
[2] RIKEN, Ctr Adv Intelligence Project, Canc Translat Res Team, 1-4-1 Nihonbashi,Chuo Ku, Tokyo 1030027, Japan
[3] Univ Tokyo, Grad Sch Med, Dept Neurosurg, 7-3-1 Hongo Bunkyo ku, Tokyo 1138655, Japan
[4] Kyoto Univ, Grad Sch Med, Dept Surg, Yoshida konoe cho,Sakyo ku, Kyoto 6068303, Japan
[5] Univ Tokyo, Grad Sch Med, Dept Biomed Informat, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138655, Japan
[6] Showa Univ, Sch Med, Dept Obstet & Gynecol, 1-5-8 Hatanodai Shinagawa ku, Tokyo 1428666, Japan
[7] RIKEN, Ctr Adv Intelligence Project, Tokyo 1030027, Japan
基金
日本学术振兴会;
关键词
Artificial intelligence; Vision transformer; Convolutional neural network; Medical image analysis; Prior learning; SEGMENTATION;
D O I
10.1007/s10916-024-02105-8
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
In the rapidly evolving field of medical image analysis utilizing artificial intelligence (AI), the selection of appropriate computational models is critical for accurate diagnosis and patient care. This literature review provides a comprehensive comparison of vision transformers (ViTs) and convolutional neural networks (CNNs), the two leading techniques in the field of deep learning in medical imaging. We conducted a survey systematically. Particular attention was given to the robustness, computational efficiency, scalability, and accuracy of these models in handling complex medical datasets. The review incorporates findings from 36 studies and indicates a collective trend that transformer-based models, particularly ViTs, exhibit significant potential in diverse medical imaging tasks, showcasing superior performance when contrasted with conventional CNN models. Additionally, it is evident that pre-training is important for transformer applications. We expect this work to help researchers and practitioners select the most appropriate model for specific medical image analysis tasks, accounting for the current state of the art and future trends in the field.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Pooling in convolutional neural networks for medical image analysis: a survey and an empirical study
    Rajendran Nirthika
    Siyamalan Manivannan
    Amirthalingam Ramanan
    Ruixuan Wang
    Neural Computing and Applications, 2022, 34 : 5321 - 5347
  • [42] Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning?
    Tajbakhsh, Nima
    Shin, Jae Y.
    Gurudu, Suryakanth R.
    Hurst, R. Todd
    Kendall, Christopher B.
    Gotway, Michael B.
    Liang, Jianming
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) : 1299 - 1312
  • [43] Uncertainty-Aware Vision Transformers for Medical Image Analysis
    Erick, Franciskus Xaverius
    Rezaei, Mina
    Mueller, Johanna Paula
    Kainz, Bernhard
    UNCERTAINTY FOR SAFE UTILIZATION OF MACHINE LEARNING IN MEDICAL IMAGING, UNSURE 2024, 2025, 15167 : 171 - 180
  • [44] Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation
    Chen, Yong
    Lu, Xuesong
    Xie, Qinlan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
  • [45] Automatic Microstructural Classification of Ultrahigh Carbon Steel with Vision Transformers and Convolutional Neural Networks
    Liu, Xiu
    Aldrich, Chris
    IFAC PAPERSONLINE, 2024, 58 (22): : 119 - 123
  • [46] EXPLORING THE COLLABORATION BETWEEN CONVOLUTIONAL NEURAL NETWORKS AND TRANSFORMERS IN HYPERSPECTRAL IMAGE CLASSIFICATION
    Gao, Hongmin
    Zhang, Yiyan
    Chen, Zhonghao
    Wu, Hongyi
    Zhang, Weibo
    Li, Chenming
    2022 12TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2022,
  • [47] A comparative study of vision transformers and convolutional neural networks: sugarcane leaf diseases identification
    Süleyman Öğrekçi
    Yavuz Ünal
    Muhammet Nuri Dudak
    European Food Research and Technology, 2023, 249 : 1833 - 1843
  • [48] A comparative study of vision transformers and convolutional neural networks: sugarcane leaf diseases identification
    Ogrekci, Suleyman
    Unal, Yavuz
    Dudak, Muhammet Nuri
    EUROPEAN FOOD RESEARCH AND TECHNOLOGY, 2023, 249 (07) : 1833 - 1843
  • [49] Utilizing convolutional neural networks and vision transformers for precise corn leaf disease identification
    Ishak Pacal
    Gültekin Işık
    Neural Computing and Applications, 2025, 37 (4) : 2479 - 2496
  • [50] An Analysis of Convolutional Neural Networks for Image Recognition
    He, Jun
    Liu, Yue
    Li, Shuai
    Shen, Jin-ming
    2017 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELING, SIMULATION AND APPLIED MATHEMATICS (CMSAM), 2017, : 524 - 528