3D multi-scale vision transformer for lung nodule detection in chest CT images

被引:0
|
作者
Hassan Mkindu
Longwen Wu
Yaqin Zhao
机构
[1] Harbin Institute of Technology,School of Electronics and Information Engineering
来源
关键词
Computer-aided diagnosis; Computed tomography; Vision transformer; Lung nodule; 3D-MSViT;
D O I
暂无
中图分类号
学科分类号
摘要
Lung cancer becomes the most prominent cause of cancer-related death in society. Normally, radiologists use computed tomography (CT) to diagnose lung nodules in lung cancer patients. A single CT scan for a patient produces hundreds of images that are manually analyzed by radiologists which is a big burden and sometimes leads to inaccuracy. Recently, many computer-aided diagnosis (CAD) systems integrated with deep learning architectures have been proposed to assist radiologists. This study proposes the CAD scheme based on a 3D multi-scale vision transformer (3D-MSViT) to enhance multi-scale feature extraction and improves lung nodule prediction efficiency from 3D CT images. The 3D-MSViT architecture adopted a local–global transformer block structure whereby the local transformer stage individually processes each scale patch and forwards it to the global transformer level for merging multi-scale features. The transformer blocks fully relied on the attention mechanism without the inclusion of the convolutional neural network to reduce the network parameters. The proposed CAD scheme was validated on 888 CT images of the Lung Nodule Analysis 2016 (LUNA16) public dataset. Free-response receiver operating characteristics analysis was adopted to evaluate the proposed method. The 3D-MSViT algorithm obtained the highest sensitivity of 97.81% and competition performance metrics of 0.911. Therefore, the 3D-MSViT scheme obtained comparable results with low network complexity related to the counterpart deep learning approaches in prior studies.
引用
收藏
页码:2473 / 2480
页数:7
相关论文
共 50 条
  • [31] Nodule detection methods using autocorrelation features on 3D chest CT scans
    Hara, Takeshi
    Zhou, Xiangrong
    Okura, Shoji
    Fujita, Hiroshi
    Kiryu, Takuji
    Hoshi, Hiroaki
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2007, 2 : S361 - S362
  • [32] Lung Nodule Detection With Deep Learning in 3D Thoracic MR Images
    Li, Yanfeng
    Zhang, Linlin
    Chen, Houjin
    Yang, Na
    IEEE ACCESS, 2019, 7 (37822-37832) : 37822 - 37832
  • [33] Fast lung nodule detection in chest CT images using cylindrical nodule-enhancement filter
    Teramoto, Atsushi
    Fujita, Hiroshi
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2013, 8 (02) : 193 - 205
  • [34] Fast lung nodule detection in chest CT images using cylindrical nodule-enhancement filter
    Atsushi Teramoto
    Hiroshi Fujita
    International Journal of Computer Assisted Radiology and Surgery, 2013, 8 : 193 - 205
  • [35] A Fusion Deep Learning Model of ResNet and Vision Transformer for 3D CT Images
    Liu, Chiyu
    Sun, Cunjie
    IEEE ACCESS, 2024, 12 : 93389 - 93397
  • [36] An Invariant Multi-Scale Saliency Detection for 3D Mesh
    El Chakik, Abdallah
    El Sayed, Abdul Rahman
    Nohra, Shadi
    PROCEEDINGS OF THE 2018 4TH INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT - 2018), 2018, : 310 - 314
  • [37] Multi-Scale PointPillars 3D Object Detection Network
    Ya, Hang
    Luo, Guiming
    PROCEEDINGS OF THE 2019 IEEE 18TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2019), 2019, : 174 - 179
  • [38] Automatic Orientation of Multi-Scale Terrestrial Images for 3D Reconstruction
    Tommaselli, Antonio M. G.
    Berveglieri, Adilson
    REMOTE SENSING, 2014, 6 (04) : 3020 - 3040
  • [39] HMTN: Hierarchical Multi-scale Transformer Network for 3D Shape Recognition
    Zhao, Yue
    Nie, Weizhi
    Gao, Zan
    Liu, An-an
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [40] Location bias in nodule detection on chest CT images
    Judy, PF
    Seitzer, SE
    Jacobson, FL
    Feldman, U
    RADIOLOGY, 1996, 201 : 1069 - 1069