HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images

被引：2

作者：

Laouarem A. ^{[1
]}

Kara-Mohamed C. ^{[1
]}

Bourennane E.-B. ^{[2
]}

Hamdi-Cherif A. ^{[1
]}

机构：

[1] Department of Computer Science, University of Ferhat Abbas 1, Setif

[2] ImViA Laboratory, University of Burgundy, Dijon

来源：

Comput. Biol. Med. | 2024年

关键词：

Convolutional Neural Network; Deep learning; Hybridization; Optical coherence tomography; Retinal disease; Vision transformer;

D O I：

10.1016/j.compbiomed.2024.108726

中图分类号：

学科分类号：

摘要：

Retinal diseases are among nowadays major public health issues, deservedly needing advanced computer-aided diagnosis. We propose a hybrid model for multi label classification, whereby seven retinal diseases are automatically classified from Optical Coherence Tomography (OCT) images. We show that, by combining the strengths of Convolutional Neural Networks (CNNs) and Visual Transformers (ViTs), we can produce a more powerful type of model for medical image classification, especially when considering local lesion information such as retinal diseases. CNNs are indeed proved to be efficient at parameter utilization and provide the ability to extract local features and multi-scale feature maps through convolutional operations. On the other hand, ViT's self-attention procedure allows processing long-range and global dependencies within an image. The paper clearly shows that the hybridization of these complementary capabilities (CNNs-ViTs) presents a high image processing potential that is more robust and efficient. The proposed model adopts a hierarchical CNN module called Convolutional Patch and Token Embedding (CPTE) instead of employing a direct tokenization approach using the raw input OCT image in the transformer. The CPTE module's role is to incorporate an inductive bias, to reduce the reliance on large-scale datasets, and to address the low-level feature extraction challenges of the ViT. In addition, considering the importance of local lesion information in OCT images, the model relies on a parallel module called Residual Depthwise-Pointwise ConvNet (RDP-ConvNet) for extracting high-level features. RDP-ConvNet utilizes depthwise and pointwise convolution layers within a residual network architecture. The overall performance of the HTC-Retina model was evaluated on three datasets: the OCT-2017, OCT-C8, and OCT-2014; outperforming previous established models, achieving accuracy rates of 99.40%, 97.00%, and 99.77%, respectively; and sensitivity rates of 99.41%, 97.00%, and 99.77%, respectively. Notably, the model showed high performance while maintaining computational efficiency. © 2024 Elsevier Ltd

引用

共 50 条

[31] Speckle denoising in optical coherence tomography images using residual deep convolutional neural network
Gour, Neha
Khanna, Pritee
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 15679 - 15695
[32] Macular hole detection and staging on optical coherence tomography images using convolutional neural network
Ojima, Akira
Sekiryu, Tetsuju
Tomita, Ryutaro
Sugano, Yukinori
Kato, Yutaka
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2018, 59 (09)
[33] Speckle denoising in optical coherence tomography images using residual deep convolutional neural network
Neha Gour
Pritee Khanna
Multimedia Tools and Applications, 2020, 79 : 15679 - 15695
[34] Convolutional Neural Network-Based Classification of Multiple Retinal Diseases Using Fundus Images
Aslam, Aqsa
Farhan, Saima
Khaliq, Momina Abdul
Anjum, Fatima
Afzaal, Ayesha
Kanwal, Faria
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 2607 - 2622
[35] The classification of stages of epiretinal membrane using convolutional neural network on optical coherence tomography image
Hung, Che-Lun
Lin, Keng-Hung
Lee, Yu-Kai
Mrozek, Dariusz
Tsai, Yin-Te
Lin, Chun Hsien
METHODS, 2023, 214 : 28 - 34
[36] Classification of optical coherence tomography images using a capsule network
Takumasa Tsuji
Yuta Hirose
Kohei Fujimori
Takuya Hirose
Asuka Oyama
Yusuke Saikawa
Tatsuya Mimura
Kenshiro Shiraishi
Takenori Kobayashi
Atsushi Mizota
Jun’ichi Kotoku
BMC Ophthalmology, 20
[37] Classification of optical coherence tomography images using a capsule network
Tsuji, Takumasa
Hirose, Yuta
Fujimori, Kohei
Hirose, Takuya
Oyama, Asuka
Saikawa, Yusuke
Mimura, Tatsuya
Shiraishi, Kenshiro
Kobayashi, Takenori
Mizota, Atsushi
Kotoku, Jun'ichi
BMC OPHTHALMOLOGY, 2020, 20 (01)
[38] Automatic detection of microaneurysms in optical coherence tomography images of retina using convolutional neural networks and transfer learning
Ramin Almasi
Abbas Vafaei
Elahe Kazeminasab
Hossein Rabbani
Scientific Reports, 12
[39] Automatic detection of microaneurysms in optical coherence tomography images of retina using convolutional neural networks and transfer learning
Almasi, Ramin
Vafaei, Abbas
Kazeminasab, Elahe
Rabbani, Hossein
SCIENTIFIC REPORTS, 2022, 12 (01)
[40] Classification of diabetic maculopathy based on optical coherence tomography images using a Vision Transformer model
Cai, Liwei
Wen, Chi
Jiang, Jingwen
Liang, Congbi
Zheng, Hongmei
Su, Yu
Chen, Changzheng
BMJ OPEN OPHTHALMOLOGY, 2023, 8 (01):

← 1 2 3 4 5 →