OCTFormer: An Efficient Hierarchical Transformer Network Specialized for Retinal Optical Coherence Tomography Image Recognition

被引:1
|
作者
Wang, Haoran [1 ]
Guo, Xinyu [1 ]
Song, Kaiwen [1 ]
Sun, Mingyang [1 ]
Shao, Yanbin [1 ]
Xue, Songfeng [1 ]
Zhang, Hongwei [1 ]
Zhang, Tianyu [1 ]
机构
[1] Jilin Univ, Coll Instrumentat & Elect Engn, Key Lab Geophys Explorat Equipment, Minist Educ, Changchun 130000, Peoples R China
关键词
Computer-aided diagnosis; deep learning; image classification; optical coherence tomography (OCT); vision transformer (ViT); MACULAR EDEMA; MECHANISMS;
D O I
10.1109/TIM.2023.3329106
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Diabetic retinopathy (DR) is a common complication of diabetes and one of the main causes of blindness in humans, which can be prevented by early-stage detection and treatment. Clinically, ophthalmologists use optical coherence tomography (OCT) image analysis as a basis for diagnosing DR. The existing medical resources can no longer meet the needs of the escalating patient population. Therefore, deep-learning technology has become a mainstream solution for medical image analysis. Vision transformer (ViT), a new neural network structure, has demonstrated great performance in analyzing images. However, due to the lack of inductive bias and prohibition of input image changes in size, ViT cannot avoid over-fitting problems on small datasets and limits the model to biological tissue characteristics. Thus, we propose an OCT multihead self-attention (OMHSA) block that especially calculates OCT image information based on a hybrid CNN-Transformer strategy. Compared to traditional MHSA, OMHSA integrates local information extraction differences into the calculation of self-attention and adds local information to the transformer model without relying on a multibranch network establishment. We built a neural network architecture (OCTFormer) by stacking convolutional layers and OMHSA blocks repeatedly in each stage. Similar to CNN, OCTFormer allows input size change at each stage to achieve a hierarchical structure effect. The model diagnosis effectiveness on the collected retinal OCT dataset was evaluated, and the accuracy reached 98.60%, surpassing the state-of-the-art (SOTA) model. The OCTFormer deployment to mobile terminals through knowledge distillation technology was shown, which presented a reference for deploying transformer models to actual clinical environments.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [31] Optical coherence tomography angiography retinal vascular network assessment in multiple sclerosis
    Lanzillo, R.
    Cennamo, G.
    Criscuolo, C.
    Carotenuto, A.
    Velotti, N.
    Sparnelli, F.
    Cianflone, A.
    Moccia, M.
    Morra, V. Brescia
    EUROPEAN JOURNAL OF NEUROLOGY, 2017, 24 : 491 - 491
  • [32] Erectile dysfunction and retinal microvascular network: an optical coherence tomography angiography study
    Yilmaz, Hayati
    Gultekin, Mehmet Hamza
    Yalcin, Ahmet
    INTERNATIONAL JOURNAL OF IMPOTENCE RESEARCH, 2021, 33 (03) : 318 - 324
  • [33] Optical coherence tomography angiography retinal vascular network assessment in multiple sclerosis
    Lanzillo, Roberta
    Cennamo, Gilda
    Criscuolo, Chiara
    Carotenuto, Antonio
    Velotti, Nunzio
    Sparnelli, Federica
    Cianflone, Alessandra
    Moccia, Marcello
    Morra, Vincenzo Brescia
    MULTIPLE SCLEROSIS JOURNAL, 2018, 24 (13) : 1706 - 1714
  • [34] Hierarchical enhancement of optical coherence tomography images
    Turani, Zahra
    Fatemizadeh, Emad
    Avanaki, Mohammadreza Nasiri
    2017 24TH NATIONAL AND 2ND INTERNATIONAL IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING (ICBME), 2017, : 209 - 212
  • [35] A hierarchical Transformer network for smoke video recognition
    Cheng, Guangtao
    Xian, Baoyi
    Liu, Yifan
    Chen, Xue
    Hu, Lianjun
    Song, Zhanjie
    DIGITAL SIGNAL PROCESSING, 2025, 158
  • [36] Recognition of individual scatterers against the noise background in the optical coherence tomography image
    Shilyagin, P. A.
    Novozhilov, A. A.
    Dilenyan, A. L.
    Vasilenkova, T. V.
    Moiseev, A. A.
    Kasatkina, I., V
    Gelikonov, V. M.
    Gelikonov, G., V
    QUANTUM ELECTRONICS, 2021, 51 (05) : 371 - 376
  • [37] Image Averaging, a Powerful Tool in Optical Coherence Tomography and Optical Coherence Tomography Angiography
    Waheed, Nadia K.
    Duker, Jay S.
    JAMA OPHTHALMOLOGY, 2017, 135 (11) : 1204 - 1205
  • [38] Detection of retinal crystals by Optical Coherence Tomography
    Tran, MK
    Jain, A
    Abbago, M
    Sarraf, D
    Schwartz, SD
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2004, 45 : U927 - U927
  • [39] In vivo functional retinal optical coherence tomography
    Schmoll, Tilman
    Kolbitsch, Christoph
    Leitgeb, Rainer A.
    JOURNAL OF BIOMEDICAL OPTICS, 2010, 15 (04)
  • [40] RETINAL OPTICAL COHERENCE TOMOGRAPHY IMAGING BIOMARKERS
    Pandya, Bhadra U.
    Grinton, Michael
    Mandelcorn, Efrem D.
    Felfeli, Tina
    RETINA-THE JOURNAL OF RETINAL AND VITREOUS DISEASES, 2024, 44 (03): : 369 - 380