An interpretable transformer network for the retinal disease classification using optical coherence tomography

被引:18
|
作者
He, Jingzhen [1 ]
Wang, Junxia [2 ]
Han, Zeyu [3 ]
Ma, Jun [4 ]
Wang, Chongjing [5 ]
Qi, Meng [2 ]
机构
[1] Shandong Univ, Dept Radiol, Qilu Hosp, Jinan 250012, Peoples R China
[2] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[3] Shandong Univ, Sch Math & Stat, Weihai 264209, Peoples R China
[4] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China
[5] China Acad Informat & Commun Technol, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
DIABETIC MACULAR EDEMA;
D O I
10.1038/s41598-023-30853-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Retinal illnesses such as age-related macular degeneration and diabetic macular edema will lead to irreversible blindness. With optical coherence tomography (OCT), doctors are able to see cross-sections of the retinal layers and provide patients with a diagnosis. Manual reading of OCT images is time-consuming, labor-intensive and even error-prone. Computer-aided diagnosis algorithms improve efficiency by automatically analyzing and diagnosing retinal OCT images. However, the accuracy and interpretability of these algorithms can be further improved through effective feature extraction, loss optimization and visualization analysis. In this paper, we propose an interpretable Swin-Poly Transformer network for performing automatically retinal OCT image classification. By shifting the window partition, the Swin-Poly Transformer constructs connections between neighboring non-overlapping windows in the previous layer and thus has the flexibility to model multi-scale features. Besides, the Swin-Poly Transformer modifies the importance of polynomial bases to refine cross entropy for better retinal OCT image classification. In addition, the proposed method also provides confidence score maps, assisting medical practitioners to understand the models' decision-making process. Experiments in OCT2017 and OCT-C8 reveal that the proposed method outperforms both the convolutional neural network approach and ViT, with an accuracy of 99.80% and an AUC of 99.99%.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] An interpretable transformer network for the retinal disease classification using optical coherence tomography
    Jingzhen He
    Junxia Wang
    Zeyu Han
    Jun Ma
    Chongjing Wang
    Meng Qi
    [J]. Scientific Reports, 13
  • [2] Automated retinal disease classification using hybrid transformer model (SViT) using optical coherence tomography images
    G. R. Hemalakshmi
    M. Murugappan
    Mohamed Yacin Sikkandar
    S. Sabarunisha Begum
    N. B. Prakash
    [J]. Neural Computing and Applications, 2024, 36 : 9171 - 9188
  • [3] Automated retinal disease classification using hybrid transformer model (SViT) using optical coherence tomography images
    Hemalakshmi, G. R.
    Murugappan, M.
    Sikkandar, Mohamed Yacin
    Begum, S. Sabarunisha
    Prakash, N. B.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (16): : 9171 - 9188
  • [4] HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification
    Ma, Zongqing
    Xie, Qiaoxue
    Xie, Pinxue
    Fan, Fan
    Gao, Xinxiao
    Zhu, Jiang
    [J]. BIOSENSORS-BASEL, 2022, 12 (07):
  • [5] Retinal disease classification based on optical coherence tomography images using convolutional neural networks
    Stanojevic, Masa
    Draskovic, Drazen
    Nikolic, Bosko
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [6] Classification of optical coherence tomography images using a capsule network
    Takumasa Tsuji
    Yuta Hirose
    Kohei Fujimori
    Takuya Hirose
    Asuka Oyama
    Yusuke Saikawa
    Tatsuya Mimura
    Kenshiro Shiraishi
    Takenori Kobayashi
    Atsushi Mizota
    Jun’ichi Kotoku
    [J]. BMC Ophthalmology, 20
  • [7] Classification of optical coherence tomography images using a capsule network
    Tsuji, Takumasa
    Hirose, Yuta
    Fujimori, Kohei
    Hirose, Takuya
    Oyama, Asuka
    Saikawa, Yusuke
    Mimura, Tatsuya
    Shiraishi, Kenshiro
    Kobayashi, Takenori
    Mizota, Atsushi
    Kotoku, Jun'ichi
    [J]. BMC OPHTHALMOLOGY, 2020, 20 (01)
  • [8] OCTFormer: An Efficient Hierarchical Transformer Network Specialized for Retinal Optical Coherence Tomography Image Recognition
    Wang, Haoran
    Guo, Xinyu
    Song, Kaiwen
    Sun, Mingyang
    Shao, Yanbin
    Xue, Songfeng
    Zhang, Hongwei
    Zhang, Tianyu
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72 : 1 - 17
  • [9] Texture Classification of Retinal Layers in Optical Coherence Tomography
    Baroni, M.
    Diciotti, S.
    Evangelisti, A.
    Fortunato, P.
    La Torre, A.
    [J]. 11TH MEDITERRANEAN CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING AND COMPUTING 2007, VOLS 1 AND 2, 2007, 16 (1-2): : 847 - +
  • [10] Retinal optical coherence tomography image classification with label smoothing generative adversarial network
    He, Xingxin
    Fang, Leyuan
    Rabbani, Hossein
    Chen, Xiangdong
    Liu, Zhimin
    [J]. NEUROCOMPUTING, 2020, 405 : 37 - 47