HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification

被引:13
|
作者
Ma, Zongqing [1 ,2 ]
Xie, Qiaoxue [1 ,2 ]
Xie, Pinxue [3 ]
Fan, Fan [1 ,2 ]
Gao, Xinxiao [3 ]
Zhu, Jiang [1 ,2 ]
机构
[1] Beijing Informat Sci & Technol Univ, Key Lab, Minist Educ Optoelect Measurement Technol & Instr, Beijing 100192, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Beijing Lab Biomed Testing Technol & Instruments, Beijing 100192, Peoples R China
[3] Capital Med Univ, Beijing Anzhen Hosp, Beijing 100029, Peoples R China
来源
BIOSENSORS-BASEL | 2022年 / 12卷 / 07期
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
convolutional neural network; vision transformer; optical coherence tomography; image classification; DIABETIC MACULAR EDEMA; DEGENERATION; ATTENTION;
D O I
10.3390/bios12070542
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Automatic and accurate optical coherence tomography (OCT) image classification is of great significance to computer-assisted diagnosis of retinal disease. In this study, we propose a hybrid ConvNet-Transformer network (HCTNet) and verify the feasibility of a Transformer-based method for retinal OCT image classification. The HCTNet first utilizes a low-level feature extraction module based on the residual dense block to generate low-level features for facilitating the network training. Then, two parallel branches of the Transformer and the ConvNet are designed to exploit the global and local context of the OCT images. Finally, a feature fusion module based on an adaptive re-weighting mechanism is employed to combine the extracted global and local features for predicting the category of OCT images in the testing datasets. The HCTNet combines the advantage of the convolutional neural network in extracting local features and the advantage of the vision Transformer in establishing long-range dependencies. A verification on two public retinal OCT datasets shows that our HCTNet method achieves an overall accuracy of 91.56% and 86.18%, respectively, outperforming the pure ViT and several ConvNet-based classification methods.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] An interpretable transformer network for the retinal disease classification using optical coherence tomography
    Jingzhen He
    Junxia Wang
    Zeyu Han
    Jun Ma
    Chongjing Wang
    Meng Qi
    Scientific Reports, 13
  • [2] An interpretable transformer network for the retinal disease classification using optical coherence tomography
    He, Jingzhen
    Wang, Junxia
    Han, Zeyu
    Ma, Jun
    Wang, Chongjing
    Qi, Meng
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [3] OCTFormer: An Efficient Hierarchical Transformer Network Specialized for Retinal Optical Coherence Tomography Image Recognition
    Wang, Haoran
    Guo, Xinyu
    Song, Kaiwen
    Sun, Mingyang
    Shao, Yanbin
    Xue, Songfeng
    Zhang, Hongwei
    Zhang, Tianyu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72 : 1 - 17
  • [4] Retinal optical coherence tomography image classification with label smoothing generative adversarial network
    He, Xingxin
    Fang, Leyuan
    Rabbani, Hossein
    Chen, Xiangdong
    Liu, Zhimin
    NEUROCOMPUTING, 2020, 405 : 37 - 47
  • [5] HCTNet: A hybrid CNN-transformer network for breast ultrasound image segmentation
    He, Qiqi
    Yang, Qiuju
    Xie, Minghao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
  • [6] MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification
    Hammou, Badr Ait
    Antaki, Fares
    Boucher, Marie-Carole
    Duval, Renaud
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 178
  • [7] Automated retinal disease classification using hybrid transformer model (SViT) using optical coherence tomography images
    G. R. Hemalakshmi
    M. Murugappan
    Mohamed Yacin Sikkandar
    S. Sabarunisha Begum
    N. B. Prakash
    Neural Computing and Applications, 2024, 36 : 9171 - 9188
  • [8] Automated retinal disease classification using hybrid transformer model (SViT) using optical coherence tomography images
    Hemalakshmi, G. R.
    Murugappan, M.
    Sikkandar, Mohamed Yacin
    Begum, S. Sabarunisha
    Prakash, N. B.
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (16): : 9171 - 9188
  • [9] Optical coherence tomography retinal classification using deep neural network
    Khudhur, Ahmed Mahmood
    JOURNAL OF OPTICS-INDIA, 2025,
  • [10] A lightweight deep learning model for retinal optical coherence tomography image classification
    Mathews, Mili Rosline
    Anzar, Sharafudeen Thaha Mohammed
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2023, 33 (01) : 204 - 216