HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images

被引:2
|
作者
Laouarem A. [1 ]
Kara-Mohamed C. [1 ]
Bourennane E.-B. [2 ]
Hamdi-Cherif A. [1 ]
机构
[1] Department of Computer Science, University of Ferhat Abbas 1, Setif
[2] ImViA Laboratory, University of Burgundy, Dijon
来源
关键词
Convolutional Neural Network; Deep learning; Hybridization; Optical coherence tomography; Retinal disease; Vision transformer;
D O I
10.1016/j.compbiomed.2024.108726
中图分类号
学科分类号
摘要
Retinal diseases are among nowadays major public health issues, deservedly needing advanced computer-aided diagnosis. We propose a hybrid model for multi label classification, whereby seven retinal diseases are automatically classified from Optical Coherence Tomography (OCT) images. We show that, by combining the strengths of Convolutional Neural Networks (CNNs) and Visual Transformers (ViTs), we can produce a more powerful type of model for medical image classification, especially when considering local lesion information such as retinal diseases. CNNs are indeed proved to be efficient at parameter utilization and provide the ability to extract local features and multi-scale feature maps through convolutional operations. On the other hand, ViT's self-attention procedure allows processing long-range and global dependencies within an image. The paper clearly shows that the hybridization of these complementary capabilities (CNNs-ViTs) presents a high image processing potential that is more robust and efficient. The proposed model adopts a hierarchical CNN module called Convolutional Patch and Token Embedding (CPTE) instead of employing a direct tokenization approach using the raw input OCT image in the transformer. The CPTE module's role is to incorporate an inductive bias, to reduce the reliance on large-scale datasets, and to address the low-level feature extraction challenges of the ViT. In addition, considering the importance of local lesion information in OCT images, the model relies on a parallel module called Residual Depthwise-Pointwise ConvNet (RDP-ConvNet) for extracting high-level features. RDP-ConvNet utilizes depthwise and pointwise convolution layers within a residual network architecture. The overall performance of the HTC-Retina model was evaluated on three datasets: the OCT-2017, OCT-C8, and OCT-2014; outperforming previous established models, achieving accuracy rates of 99.40%, 97.00%, and 99.77%, respectively; and sensitivity rates of 99.41%, 97.00%, and 99.77%, respectively. Notably, the model showed high performance while maintaining computational efficiency. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [21] Efficient knowledge distillation for hybrid models: A vision transformer-convolutional neural network to convolutional neural network approach for classifying remote sensing images
    Song, Huaxiang
    Yuan, Yuxuan
    Ouyang, Zhiwei
    Yang, Yu
    Xiang, Hui
    IET CYBER-SYSTEMS AND ROBOTICS, 2024, 6 (03)
  • [22] Single-image super-resolution using lightweight transformer-convolutional neural network hybrid model
    Liu, Yuanyuan
    Yue, Mengtao
    Yan, Han
    Zhu, Lu
    IET IMAGE PROCESSING, 2023, 17 (10) : 2881 - 2893
  • [23] Deep Residual Network for Diagnosis of Retinal Diseases Using Optical Coherence Tomography Images
    Sohaib Asif
    Kamran Amjad
    Interdisciplinary Sciences: Computational Life Sciences, 2022, 14 : 906 - 916
  • [24] Deep Residual Network for Diagnosis of Retinal Diseases Using Optical Coherence Tomography Images
    Asif, Sohaib
    Amjad, Kamran
    Qurrat-ul-Ain
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2022, 14 (04) : 906 - 916
  • [25] Classification of Optical Coherence Tomography using Convolutional Neural Networks
    Saraiva, A. A.
    Santos, D. B. S.
    Pedro, Pimentel
    Moura Sousa, Jose Vigno
    Fonseca Ferreira, N. M.
    Batista Neto, J. E. S.
    Soares, Salviano
    Valente, Antonio
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2020, : 168 - 175
  • [26] Diagnosis of Eye Retinal Diseases Based on Convolutional Neural Networks Using Optical Coherence Images
    Sertkaya, Mehmet Emre
    Ergen, Burhan
    Togacar, Mesut
    PROCEEDINGS OF THE 2019 23RD INTERNATIONAL CONFERENCE ELECTRONICS (ELECTRONICS 2019), 2019,
  • [27] Multi-Fundus Diseases Classification Using Retinal Optical Coherence Tomography Images with Swin Transformer V2
    Li, Zhenwei
    Han, Yanqi
    Yang, Xiaoli
    JOURNAL OF IMAGING, 2023, 9 (10)
  • [28] Robust total retina thickness segmentation in optical coherence tomography images using convolutional neural networks
    Venhuizen, Freerk G.
    van Ginneken, Bram
    Liefers, Bart
    van Grinsven, Mark J. J. P.
    Fauser, Sascha
    Hoyng, Carel
    Theelen, Thomas
    Sanchez, Clara I.
    BIOMEDICAL OPTICS EXPRESS, 2017, 8 (07): : 3292 - 3316
  • [29] Classification of Retinal Diseases in Optical Coherence Tomography Images Using Artificial Intelligence and Firefly Algorithm
    Ozdas, Mehmet Batuhan
    Uysal, Fatih
    Hardalac, Firat
    DIAGNOSTICS, 2023, 13 (03)
  • [30] Iterative fusion convolutional neural networks for classification of optical coherence tomography images
    Fang, Leyuan
    Jin, Yuxuan
    Huang, Laifeng
    Guo, Siyu
    Zhao, Guangzhe
    Chen, Xiangdong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 327 - 333