HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images

被引：2

作者：

Laouarem A. ^{[1
]}

Kara-Mohamed C. ^{[1
]}

Bourennane E.-B. ^{[2
]}

Hamdi-Cherif A. ^{[1
]}

机构：

[1] Department of Computer Science, University of Ferhat Abbas 1, Setif

[2] ImViA Laboratory, University of Burgundy, Dijon

来源：

Comput. Biol. Med. | 2024年

关键词：

Convolutional Neural Network; Deep learning; Hybridization; Optical coherence tomography; Retinal disease; Vision transformer;

D O I：

10.1016/j.compbiomed.2024.108726

中图分类号：

学科分类号：

摘要：

Retinal diseases are among nowadays major public health issues, deservedly needing advanced computer-aided diagnosis. We propose a hybrid model for multi label classification, whereby seven retinal diseases are automatically classified from Optical Coherence Tomography (OCT) images. We show that, by combining the strengths of Convolutional Neural Networks (CNNs) and Visual Transformers (ViTs), we can produce a more powerful type of model for medical image classification, especially when considering local lesion information such as retinal diseases. CNNs are indeed proved to be efficient at parameter utilization and provide the ability to extract local features and multi-scale feature maps through convolutional operations. On the other hand, ViT's self-attention procedure allows processing long-range and global dependencies within an image. The paper clearly shows that the hybridization of these complementary capabilities (CNNs-ViTs) presents a high image processing potential that is more robust and efficient. The proposed model adopts a hierarchical CNN module called Convolutional Patch and Token Embedding (CPTE) instead of employing a direct tokenization approach using the raw input OCT image in the transformer. The CPTE module's role is to incorporate an inductive bias, to reduce the reliance on large-scale datasets, and to address the low-level feature extraction challenges of the ViT. In addition, considering the importance of local lesion information in OCT images, the model relies on a parallel module called Residual Depthwise-Pointwise ConvNet (RDP-ConvNet) for extracting high-level features. RDP-ConvNet utilizes depthwise and pointwise convolution layers within a residual network architecture. The overall performance of the HTC-Retina model was evaluated on three datasets: the OCT-2017, OCT-C8, and OCT-2014; outperforming previous established models, achieving accuracy rates of 99.40%, 97.00%, and 99.77%, respectively; and sensitivity rates of 99.41%, 97.00%, and 99.77%, respectively. Notably, the model showed high performance while maintaining computational efficiency. © 2024 Elsevier Ltd

引用

共 50 条

[21] Efficient knowledge distillation for hybrid models: A vision transformer-convolutional neural network to convolutional neural network approach for classifying remote sensing images
Song, Huaxiang
Yuan, Yuxuan
Ouyang, Zhiwei
Yang, Yu
Xiang, Hui
IET CYBER-SYSTEMS AND ROBOTICS, 2024, 6 (03)
[22] Single-image super-resolution using lightweight transformer-convolutional neural network hybrid model
Liu, Yuanyuan
Yue, Mengtao
Yan, Han
Zhu, Lu
IET IMAGE PROCESSING, 2023, 17 (10) : 2881 - 2893
[23] Deep Residual Network for Diagnosis of Retinal Diseases Using Optical Coherence Tomography Images
Sohaib Asif
Kamran Amjad
Interdisciplinary Sciences: Computational Life Sciences, 2022, 14 : 906 - 916
[24] Deep Residual Network for Diagnosis of Retinal Diseases Using Optical Coherence Tomography Images
Asif, Sohaib
Amjad, Kamran
Qurrat-ul-Ain
INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2022, 14 (04) : 906 - 916
[25] Classification of Optical Coherence Tomography using Convolutional Neural Networks
Saraiva, A. A.
Santos, D. B. S.
Pedro, Pimentel
Moura Sousa, Jose Vigno
Fonseca Ferreira, N. M.
Batista Neto, J. E. S.
Soares, Salviano
Valente, Antonio
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2020, : 168 - 175
[26] Diagnosis of Eye Retinal Diseases Based on Convolutional Neural Networks Using Optical Coherence Images
Sertkaya, Mehmet Emre
Ergen, Burhan
Togacar, Mesut
PROCEEDINGS OF THE 2019 23RD INTERNATIONAL CONFERENCE ELECTRONICS (ELECTRONICS 2019), 2019,
[27] Multi-Fundus Diseases Classification Using Retinal Optical Coherence Tomography Images with Swin Transformer V2
Li, Zhenwei
Han, Yanqi
Yang, Xiaoli
JOURNAL OF IMAGING, 2023, 9 (10)
[28] Robust total retina thickness segmentation in optical coherence tomography images using convolutional neural networks
Venhuizen, Freerk G.
van Ginneken, Bram
Liefers, Bart
van Grinsven, Mark J. J. P.
Fauser, Sascha
Hoyng, Carel
Theelen, Thomas
Sanchez, Clara I.
BIOMEDICAL OPTICS EXPRESS, 2017, 8 (07): : 3292 - 3316
[29] Classification of Retinal Diseases in Optical Coherence Tomography Images Using Artificial Intelligence and Firefly Algorithm
Ozdas, Mehmet Batuhan
Uysal, Fatih
Hardalac, Firat
DIAGNOSTICS, 2023, 13 (03)
[30] Iterative fusion convolutional neural networks for classification of optical coherence tomography images
Fang, Leyuan
Jin, Yuxuan
Huang, Laifeng
Guo, Siyu
Zhao, Guangzhe
Chen, Xiangdong
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 327 - 333

← 1 2 3 4 5 →