HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images

被引:2
|
作者
Laouarem A. [1 ]
Kara-Mohamed C. [1 ]
Bourennane E.-B. [2 ]
Hamdi-Cherif A. [1 ]
机构
[1] Department of Computer Science, University of Ferhat Abbas 1, Setif
[2] ImViA Laboratory, University of Burgundy, Dijon
来源
关键词
Convolutional Neural Network; Deep learning; Hybridization; Optical coherence tomography; Retinal disease; Vision transformer;
D O I
10.1016/j.compbiomed.2024.108726
中图分类号
学科分类号
摘要
Retinal diseases are among nowadays major public health issues, deservedly needing advanced computer-aided diagnosis. We propose a hybrid model for multi label classification, whereby seven retinal diseases are automatically classified from Optical Coherence Tomography (OCT) images. We show that, by combining the strengths of Convolutional Neural Networks (CNNs) and Visual Transformers (ViTs), we can produce a more powerful type of model for medical image classification, especially when considering local lesion information such as retinal diseases. CNNs are indeed proved to be efficient at parameter utilization and provide the ability to extract local features and multi-scale feature maps through convolutional operations. On the other hand, ViT's self-attention procedure allows processing long-range and global dependencies within an image. The paper clearly shows that the hybridization of these complementary capabilities (CNNs-ViTs) presents a high image processing potential that is more robust and efficient. The proposed model adopts a hierarchical CNN module called Convolutional Patch and Token Embedding (CPTE) instead of employing a direct tokenization approach using the raw input OCT image in the transformer. The CPTE module's role is to incorporate an inductive bias, to reduce the reliance on large-scale datasets, and to address the low-level feature extraction challenges of the ViT. In addition, considering the importance of local lesion information in OCT images, the model relies on a parallel module called Residual Depthwise-Pointwise ConvNet (RDP-ConvNet) for extracting high-level features. RDP-ConvNet utilizes depthwise and pointwise convolution layers within a residual network architecture. The overall performance of the HTC-Retina model was evaluated on three datasets: the OCT-2017, OCT-C8, and OCT-2014; outperforming previous established models, achieving accuracy rates of 99.40%, 97.00%, and 99.77%, respectively; and sensitivity rates of 99.41%, 97.00%, and 99.77%, respectively. Notably, the model showed high performance while maintaining computational efficiency. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [31] Speckle denoising in optical coherence tomography images using residual deep convolutional neural network
    Gour, Neha
    Khanna, Pritee
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 15679 - 15695
  • [32] Macular hole detection and staging on optical coherence tomography images using convolutional neural network
    Ojima, Akira
    Sekiryu, Tetsuju
    Tomita, Ryutaro
    Sugano, Yukinori
    Kato, Yutaka
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2018, 59 (09)
  • [33] Speckle denoising in optical coherence tomography images using residual deep convolutional neural network
    Neha Gour
    Pritee Khanna
    Multimedia Tools and Applications, 2020, 79 : 15679 - 15695
  • [34] Convolutional Neural Network-Based Classification of Multiple Retinal Diseases Using Fundus Images
    Aslam, Aqsa
    Farhan, Saima
    Khaliq, Momina Abdul
    Anjum, Fatima
    Afzaal, Ayesha
    Kanwal, Faria
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 2607 - 2622
  • [35] The classification of stages of epiretinal membrane using convolutional neural network on optical coherence tomography image
    Hung, Che-Lun
    Lin, Keng-Hung
    Lee, Yu-Kai
    Mrozek, Dariusz
    Tsai, Yin-Te
    Lin, Chun Hsien
    METHODS, 2023, 214 : 28 - 34
  • [36] Classification of optical coherence tomography images using a capsule network
    Takumasa Tsuji
    Yuta Hirose
    Kohei Fujimori
    Takuya Hirose
    Asuka Oyama
    Yusuke Saikawa
    Tatsuya Mimura
    Kenshiro Shiraishi
    Takenori Kobayashi
    Atsushi Mizota
    Jun’ichi Kotoku
    BMC Ophthalmology, 20
  • [37] Classification of optical coherence tomography images using a capsule network
    Tsuji, Takumasa
    Hirose, Yuta
    Fujimori, Kohei
    Hirose, Takuya
    Oyama, Asuka
    Saikawa, Yusuke
    Mimura, Tatsuya
    Shiraishi, Kenshiro
    Kobayashi, Takenori
    Mizota, Atsushi
    Kotoku, Jun'ichi
    BMC OPHTHALMOLOGY, 2020, 20 (01)
  • [38] Automatic detection of microaneurysms in optical coherence tomography images of retina using convolutional neural networks and transfer learning
    Ramin Almasi
    Abbas Vafaei
    Elahe Kazeminasab
    Hossein Rabbani
    Scientific Reports, 12
  • [39] Automatic detection of microaneurysms in optical coherence tomography images of retina using convolutional neural networks and transfer learning
    Almasi, Ramin
    Vafaei, Abbas
    Kazeminasab, Elahe
    Rabbani, Hossein
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [40] Classification of diabetic maculopathy based on optical coherence tomography images using a Vision Transformer model
    Cai, Liwei
    Wen, Chi
    Jiang, Jingwen
    Liang, Congbi
    Zheng, Hongmei
    Su, Yu
    Chen, Changzheng
    BMJ OPEN OPHTHALMOLOGY, 2023, 8 (01):