Recent progress in transformer-based medical image analysis

被引:21
|
作者
Liu, Zhaoshan [1 ]
Lv, Qiujie [1 ,2 ]
Yang, Ziduo [1 ,2 ]
Li, Yifan [1 ]
Lee, Chau Hung [1 ,3 ]
Shen, Lei [1 ]
机构
[1] Natl Univ Singapore, Dept Mech Engn, 9 Engn Dr 1, Singapore 117575, Singapore
[2] Sun Yat Sen Univ, Sch Intelligent Syst Engn, 66 Gongchang Rd, Singapore 518107, Singapore
[3] Tan Tock Seng Hosp, Dept Radiol, 11 Jalan Tan Tock Seng, Singapore 308433, Singapore
关键词
Deep learning; Transformer; Attention mechanism; Convolutional neural network; Medical image analysis; CHEST-X-RAY; VISION TRANSFORMER; CT IMAGE; TUMOR SEGMENTATION; CNN-TRANSFORMER; NEURAL-NETWORK; ATTENTION; COVID-19; NET; ULTRASOUND;
D O I
10.1016/j.compbiomed.2023.107268
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The transformer is primarily used in the field of natural language processing. Recently, it has been adopted and shows promise in the computer vision (CV) field. Medical image analysis (MIA), as a critical branch of CV, also greatly benefits from this state-of-the-art technique. In this review, we first recap the core component of the transformer, the attention mechanism, and the detailed structures of the transformer. After that, we depict the recent progress of the transformer in the field of MIA. We organize the applications in a sequence of different tasks, including classification, segmentation, captioning, registration, detection, enhancement, localization, and synthesis. The mainstream classification and segmentation tasks are further divided into eleven medical image modalities. A large number of experiments studied in this review illustrate that the transformer-based method outperforms existing methods through comparisons with multiple evaluation metrics. Finally, we discuss the open challenges and future opportunities in this field. This task-modality review with the latest contents, detailed information, and comprehensive comparison may greatly benefit the broad MIA community.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] A Transformer-Based Network for Deformable Medical Image Registration
    Wang, Yibo
    Qian, Wen
    Li, Mengqi
    Zhang, Xuming
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 502 - 513
  • [2] Transformer-based Image Compression
    Lu, Ming
    Guo, Peiyao
    Shi, Huiqing
    Cao, Chuntong
    Ma, Zhan
    [J]. DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 469 - 469
  • [3] Transformer-Based Annotation Bias-Aware Medical Image Segmentation
    Liao, Zehui
    Hu, Shishuai
    Xie, Yutong
    Xia, Yong
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 24 - 34
  • [4] Performance Comparison of Vision Transformer-Based Models in Medical Image Classification
    Kanca, Elif
    Ayas, Selen
    Kablan, Elif Baykal
    Ekinci, Murat
    [J]. 2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [5] TanrsColour: Transformer-based medical image colourization with content and structure preservation
    Liu, Qinghai
    Zhao, Dengping
    Tang, Lun
    Xu, Limin
    [J]. IET IMAGE PROCESSING, 2024, 18 (10) : 2702 - 2714
  • [6] TRANSFORMER-BASED SAR IMAGE DESPECKLING
    Perera, Malsha V.
    Bandara, Wele Gedara Chaminda
    Valanarasu, Jeya Maria Jose
    Patel, Vishal M.
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 751 - 754
  • [7] Explaining transformer-based image captioning models: An empirical analysis
    Cornia, Marcella
    Baraldi, Lorenzo
    Cucchiara, Rita
    [J]. AI COMMUNICATIONS, 2022, 35 (02) : 111 - 129
  • [8] Swin transformer-based GAN for multi-modal medical image translation
    Yan, Shouang
    Wang, Chengyan
    Chen, Weibo
    Lyu, Jun
    [J]. FRONTIERS IN ONCOLOGY, 2022, 12
  • [9] DTMFormer: Dynamic Token Merging for Boosting Transformer-Based Medical Image Segmentation
    Wang, Zhehao
    Lin, Xian
    Wu, Nannan
    Yu, Li
    Cheng, Kwang-Ting
    Yan, Zengqiang
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5814 - 5822
  • [10] A Transformer-Based Network for Anisotropic 3D Medical Image Segmentation
    Guo, Danfeng
    Terzopoulos, Demetri
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8857 - 8861