OCTFormer: An Efficient Hierarchical Transformer Network Specialized for Retinal Optical Coherence Tomography Image Recognition

被引:1
|
作者
Wang, Haoran [1 ]
Guo, Xinyu [1 ]
Song, Kaiwen [1 ]
Sun, Mingyang [1 ]
Shao, Yanbin [1 ]
Xue, Songfeng [1 ]
Zhang, Hongwei [1 ]
Zhang, Tianyu [1 ]
机构
[1] Jilin Univ, Coll Instrumentat & Elect Engn, Key Lab Geophys Explorat Equipment, Minist Educ, Changchun 130000, Peoples R China
关键词
Computer-aided diagnosis; deep learning; image classification; optical coherence tomography (OCT); vision transformer (ViT); MACULAR EDEMA; MECHANISMS;
D O I
10.1109/TIM.2023.3329106
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Diabetic retinopathy (DR) is a common complication of diabetes and one of the main causes of blindness in humans, which can be prevented by early-stage detection and treatment. Clinically, ophthalmologists use optical coherence tomography (OCT) image analysis as a basis for diagnosing DR. The existing medical resources can no longer meet the needs of the escalating patient population. Therefore, deep-learning technology has become a mainstream solution for medical image analysis. Vision transformer (ViT), a new neural network structure, has demonstrated great performance in analyzing images. However, due to the lack of inductive bias and prohibition of input image changes in size, ViT cannot avoid over-fitting problems on small datasets and limits the model to biological tissue characteristics. Thus, we propose an OCT multihead self-attention (OMHSA) block that especially calculates OCT image information based on a hybrid CNN-Transformer strategy. Compared to traditional MHSA, OMHSA integrates local information extraction differences into the calculation of self-attention and adds local information to the transformer model without relying on a multibranch network establishment. We built a neural network architecture (OCTFormer) by stacking convolutional layers and OMHSA blocks repeatedly in each stage. Similar to CNN, OCTFormer allows input size change at each stage to achieve a hierarchical structure effect. The model diagnosis effectiveness on the collected retinal OCT dataset was evaluated, and the accuracy reached 98.60%, surpassing the state-of-the-art (SOTA) model. The OCTFormer deployment to mobile terminals through knowledge distillation technology was shown, which presented a reference for deploying transformer models to actual clinical environments.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [41] Retinal nonperfusion in optical coherence tomography angiography
    Liu, Limin
    Xia, Fan
    Hua, Rui
    PHOTODIAGNOSIS AND PHOTODYNAMIC THERAPY, 2021, 33
  • [42] Retinal optical coherence tomography biomarkers in dementia
    Goerdt, L.
    Holz, F. G.
    Finger, R. P.
    OPHTHALMOLOGIE, 2024, 121 (02): : 84 - 92
  • [43] Statistical Modeling of Retinal Optical Coherence Tomography
    Amini, Zahra
    Rabbani, Hossein
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (06) : 1544 - 1554
  • [44] Image Distortion of Optical Coherence Tomography
    安源
    姚建铨
    Transactions of Tianjin University, 2004, (01) : 81 - 84
  • [45] OPTICAL COHERENCE TOMOGRAPHY IMAGE SEGMENTATION
    Duan, Jinming
    Tench, Christopher
    Gottlob, Irene
    Proudlock, Frank
    Bai, Li
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 4278 - 4282
  • [46] An Efficient Retinal Fluid Segmentation Network Based on Large Receptive Field Context Capture for Optical Coherence Tomography Images
    Qi, Hang
    Wang, Weijiang
    Dang, Hua
    Chen, Yueyang
    Jia, Minli
    Wang, Xiaohua
    ENTROPY, 2025, 27 (01)
  • [47] Optical Coherence Tomography of Retinal and Choroidal Tumors
    Say, Emil Anthony T.
    Shah, Sanket U.
    Ferenczy, Sandor
    Shields, Carol L.
    JOURNAL OF OPHTHALMOLOGY, 2011, 2011
  • [48] Retinal assessment using optical coherence tomography
    Costa, Rogerio A.
    Skaf, Mirian
    Melo, Luiz A. S., Jr.
    Calucci, Daniela
    Cardillo, Jose A.
    Castro, Jarbas C.
    Huang, David
    Wojtkowski, Maciej
    PROGRESS IN RETINAL AND EYE RESEARCH, 2006, 25 (03) : 325 - 353
  • [49] Functional optical coherence tomography of retinal photoreceptors
    Yao, Xincheng
    Son, Taeyoon
    Kim, Tae-Hoon
    Lu, Yiming
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2018, 243 (17-18) : 1256 - 1264
  • [50] Doppler Optical Coherence Tomography of Retinal Circulation
    Tan, Ou
    Wang, Yimin
    Konduru, Ranjith K.
    Zhang, Xinbo
    Sadda, SriniVas R.
    Huang, David
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2012, (67):