3D multi-scale vision transformer for lung nodule detection in chest CT images

被引:0
|
作者
Hassan Mkindu
Longwen Wu
Yaqin Zhao
机构
[1] Harbin Institute of Technology,School of Electronics and Information Engineering
来源
关键词
Computer-aided diagnosis; Computed tomography; Vision transformer; Lung nodule; 3D-MSViT;
D O I
暂无
中图分类号
学科分类号
摘要
Lung cancer becomes the most prominent cause of cancer-related death in society. Normally, radiologists use computed tomography (CT) to diagnose lung nodules in lung cancer patients. A single CT scan for a patient produces hundreds of images that are manually analyzed by radiologists which is a big burden and sometimes leads to inaccuracy. Recently, many computer-aided diagnosis (CAD) systems integrated with deep learning architectures have been proposed to assist radiologists. This study proposes the CAD scheme based on a 3D multi-scale vision transformer (3D-MSViT) to enhance multi-scale feature extraction and improves lung nodule prediction efficiency from 3D CT images. The 3D-MSViT architecture adopted a local–global transformer block structure whereby the local transformer stage individually processes each scale patch and forwards it to the global transformer level for merging multi-scale features. The transformer blocks fully relied on the attention mechanism without the inclusion of the convolutional neural network to reduce the network parameters. The proposed CAD scheme was validated on 888 CT images of the Lung Nodule Analysis 2016 (LUNA16) public dataset. Free-response receiver operating characteristics analysis was adopted to evaluate the proposed method. The 3D-MSViT algorithm obtained the highest sensitivity of 97.81% and competition performance metrics of 0.911. Therefore, the 3D-MSViT scheme obtained comparable results with low network complexity related to the counterpart deep learning approaches in prior studies.
引用
收藏
页码:2473 / 2480
页数:7
相关论文
共 50 条
  • [41] LUNG NODULE DETECTION IN CT USING 3D CONVOLUTIONAL NEURAL NETWORKS
    Huang, Xiaojie
    Shan, Junjie
    Vaidya, Vivek
    2017 IEEE 14TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2017), 2017, : 379 - 383
  • [42] Nodule detection performance in compressed chest CT images
    Feldman, U
    Judy, PF
    Seltzer, SE
    Topal, U
    Wester, C
    IMAGE PERCEPTION - MEDICAL IMAGING 1996, 1996, 2712 : 123 - 127
  • [43] Pulmonary nodule detection using chest CT images
    Kim, DY
    Kim, JH
    Noh, SM
    Park, JW
    ACTA RADIOLOGICA, 2003, 44 (03) : 252 - 257
  • [44] A hybrid multi-scale model for thyroid nodule boundary detection on ultrasound images
    Tsantis, S.
    Dimitropoulos, N.
    Cauouras, D.
    Nikiforidis, G.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2006, 84 (2-3) : 86 - 98
  • [45] MSIT-Det: Multi-Scale Feature Aggregation with Iterative Transformer Networks for 3D Object Detection
    Li, Xi
    Chen, Yuanyuan
    Lv, Yisheng
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 5510 - 5515
  • [46] Generative EO/IR multi-scale vision transformer for improved object detection
    Christian, Jonathan
    Bright, Max
    Summers, Jason
    Olson, Ashley
    Havens, Tim
    SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035
  • [47] Computerized lung nodule detection in helical CT images using adaptive 3D clustering and artificial neural network
    Sahiner, B
    Chan, H
    Petrick, NA
    Hadjiiski, LM
    Kazerooni, EA
    Cascade, PN
    RADIOLOGY, 2002, 225 : 534 - 534
  • [48] NODULe: Combining constrained multi-scale LoG filters with densely dilated 3D deep convolutional neural network for pulmonary nodule detection
    Zhang, Junjie
    Xia, Yong
    Zeng, Haoyue
    Zhang, Yanning
    NEUROCOMPUTING, 2018, 317 : 159 - 167
  • [49] Multi-scale spatial-temporal transformer for 3D human pose estimation
    Wu, Yongpeng
    Gao, Junna
    2021 5TH INTERNATIONAL CONFERENCE ON VISION, IMAGE AND SIGNAL PROCESSING (ICVISP 2021), 2021, : 242 - 247
  • [50] A multi-scale shape description tool for 3D MR brain images
    Schnabel, JA
    Arridge, SR
    CAR '96: COMPUTER ASSISTED RADIOLOGY, 1996, 1124 : 292 - 297