HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification

被引：13

作者：

Ma, Zongqing ^{[1
,2
]}

Xie, Qiaoxue ^{[1
,2
]}

Xie, Pinxue ^{[3
]}

Fan, Fan ^{[1
,2
]}

Gao, Xinxiao ^{[3
]}

Zhu, Jiang ^{[1
,2
]}

机构：

[1] Beijing Informat Sci & Technol Univ, Key Lab, Minist Educ Optoelect Measurement Technol & Instr, Beijing 100192, Peoples R China

[2] Beijing Informat Sci & Technol Univ, Beijing Lab Biomed Testing Technol & Instruments, Beijing 100192, Peoples R China

[3] Capital Med Univ, Beijing Anzhen Hosp, Beijing 100029, Peoples R China

来源：

BIOSENSORS-BASEL | 2022年 / 12卷 / 07期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

convolutional neural network; vision transformer; optical coherence tomography; image classification; DIABETIC MACULAR EDEMA; DEGENERATION; ATTENTION;

D O I：

10.3390/bios12070542

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Automatic and accurate optical coherence tomography (OCT) image classification is of great significance to computer-assisted diagnosis of retinal disease. In this study, we propose a hybrid ConvNet-Transformer network (HCTNet) and verify the feasibility of a Transformer-based method for retinal OCT image classification. The HCTNet first utilizes a low-level feature extraction module based on the residual dense block to generate low-level features for facilitating the network training. Then, two parallel branches of the Transformer and the ConvNet are designed to exploit the global and local context of the OCT images. Finally, a feature fusion module based on an adaptive re-weighting mechanism is employed to combine the extracted global and local features for predicting the category of OCT images in the testing datasets. The HCTNet combines the advantage of the convolutional neural network in extracting local features and the advantage of the vision Transformer in establishing long-range dependencies. A verification on two public retinal OCT datasets shows that our HCTNet method achieves an overall accuracy of 91.56% and 86.18%, respectively, outperforming the pure ViT and several ConvNet-based classification methods.

引用

页数：15

共 50 条

[1] An interpretable transformer network for the retinal disease classification using optical coherence tomography
Jingzhen He
Junxia Wang
Zeyu Han
Jun Ma
Chongjing Wang
Meng Qi
Scientific Reports, 13
[2] An interpretable transformer network for the retinal disease classification using optical coherence tomography
He, Jingzhen
Wang, Junxia
Han, Zeyu
Ma, Jun
Wang, Chongjing
Qi, Meng
SCIENTIFIC REPORTS, 2023, 13 (01)
[3] OCTFormer: An Efficient Hierarchical Transformer Network Specialized for Retinal Optical Coherence Tomography Image Recognition
Wang, Haoran
Guo, Xinyu
Song, Kaiwen
Sun, Mingyang
Shao, Yanbin
Xue, Songfeng
Zhang, Hongwei
Zhang, Tianyu
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72 : 1 - 17
[4] Retinal optical coherence tomography image classification with label smoothing generative adversarial network
He, Xingxin
Fang, Leyuan
Rabbani, Hossein
Chen, Xiangdong
Liu, Zhimin
NEUROCOMPUTING, 2020, 405 : 37 - 47
[5] HCTNet: A hybrid CNN-transformer network for breast ultrasound image segmentation
He, Qiqi
Yang, Qiuju
Xie, Minghao
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
[6] MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification
Hammou, Badr Ait
Antaki, Fares
Boucher, Marie-Carole
Duval, Renaud
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 178
[7] Automated retinal disease classification using hybrid transformer model (SViT) using optical coherence tomography images
G. R. Hemalakshmi
M. Murugappan
Mohamed Yacin Sikkandar
S. Sabarunisha Begum
N. B. Prakash
Neural Computing and Applications, 2024, 36 : 9171 - 9188
[8] Automated retinal disease classification using hybrid transformer model (SViT) using optical coherence tomography images
Hemalakshmi, G. R.
Murugappan, M.
Sikkandar, Mohamed Yacin
Begum, S. Sabarunisha
Prakash, N. B.
NEURAL COMPUTING & APPLICATIONS, 2024, 36 (16): : 9171 - 9188
[9] Optical coherence tomography retinal classification using deep neural network
Khudhur, Ahmed Mahmood
JOURNAL OF OPTICS-INDIA, 2025,
[10] A lightweight deep learning model for retinal optical coherence tomography image classification
Mathews, Mili Rosline
Anzar, Sharafudeen Thaha Mohammed
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2023, 33 (01) : 204 - 216

← 1 2 3 4 5 →