A Novel Transformer Model With Multiple Instance Learning for Diabetic Retinopathy Classification

被引:2
|
作者
Yang, Yaoming [1 ]
Cai, Zhili [1 ]
Qiu, Shuxia [1 ,2 ]
Xu, Peng [1 ,2 ]
机构
[1] China Jiliang Univ, Coll Sci, Hangzhou 310018, Peoples R China
[2] Key Lab Intelligent Mfg Qual Big Data Tracing & An, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Vision Transformer; multiple instance learning; diabetic retinopathy; high-resolution fundus retinal images; medical image classification; DISEASE; IMAGES;
D O I
10.1109/ACCESS.2024.3351473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Diabetic retinopathy (DR) is an irreversible fundus retinopathy. A deep learning-based auto-mated DR diagnosis system can save diagnostic time. While Transformer has shown superior performance compared to Convolutional Neural Network (CNN), it typically requires pre-training with large amounts of data. Although Transformer-based DR diagnosis method may alleviate the problem of limited performance on small-scale retinal datasets by loading pre-trained weights, the size of input images is restricted to 224 x 224. The resolution of retinal images captured by fundus cameras is much higher than 224 x 224, reducing resolution in training will result in the loss of valuable information. In order to efficiently utilize high-resolution retinal images, a new Transformer model with multiple instance learning (TMIL) is proposed for DR classification. A multiple instance learning approach is firstly applied on the retinal images to segment these high-resolution images into 224 x 224 image patches. Subsequently, Vision Transformer (ViT) is used to extract features from each patch. Then, Global Instance Computing Block (GICB) is designed to calculate the inter-instance features. After introducing global information from GICB, the features are used to output the classification results. When using high-resolution retinal images, TMIL can load pre-trained weights of Transformer without being affected by weight interpolation on model performance. Experimental results using the APTOS dataset and the Messidor-1 dataset demonstrate that TMIL achieves better classification performance and reduces inference time by 62% compared with that directly inputting high-resolution images into ViT. And TMIL shows highest classification accuracy compared with the current state-of-the-art results.
引用
下载
收藏
页码:6768 / 6776
页数:9
相关论文
共 50 条
  • [1] Diabetic Retinopathy Images Classification via Multiple Instance Learning
    Vocaturo, Eugenio
    Zumpano, Ester
    2021 IEEE/ACM CONFERENCE ON CONNECTED HEALTH: APPLICATIONS, SYSTEMS AND ENGINEERING TECHNOLOGIES (CHASE 2021), 2021, : 143 - 148
  • [2] A multiple-instance learning framework for diabetic retinopathy screening
    Quellec, Gwenole
    Lamard, Mathieu
    Abramoff, Michael D.
    Decenciere, Etienne
    Lay, Bruno
    Erginay, Ali
    Cochener, Beatrice
    Cazuguel, Guy
    MEDICAL IMAGE ANALYSIS, 2012, 16 (06) : 1228 - 1240
  • [3] MSTNet: Multi-scale spatial-aware transformer with multi-instance learning for diabetic retinopathy classification
    Wei, Xin
    Liu, Yanbei
    Zhang, Fang
    Geng, Lei
    Shan, Chunyan
    Cao, Xiangyu
    Xiao, Zhitao
    Medical Image Analysis, 2025, 102
  • [4] Transformer based multiple instance learning for WSI breast cancer classification
    Gao, Chengyang
    Sun, Qiule
    Zhu, Wen
    Zhang, Lizhi
    Zhang, Jianxin
    Liu, Bin
    Zhang, Junxing
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [5] Deep multiple instance learning for automatic detection of diabetic retinopathy in retinal images
    Zhou, Lei
    Zhao, Yu
    Yang, Jie
    Yu, Qi
    Xu, Xun
    IET IMAGE PROCESSING, 2018, 12 (04) : 563 - 571
  • [6] Multiple instance learning based classification of diabetic retinopathy in weakly-labeled widefield OCTA en face images
    Matten, Philipp
    Scherer, Julius
    Schlegl, Thomas
    Nienhaus, Jonas
    Stino, Heiko
    Niederleithner, Michael
    Schmidt-Erfurth, Ursula M.
    Leitgeb, Rainer A.
    Drexler, Wolfgang
    Pollreisz, Andreas
    Schmoll, Tilman
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [7] A Robust Machine Learning Model for Diabetic Retinopathy Classification
    Tabacaru, Gigi
    Moldovanu, Simona
    Raducan, Elena
    Barbu, Marian
    JOURNAL OF IMAGING, 2024, 10 (01)
  • [8] Multiple instance learning based classification of diabetic retinopathy in weakly-labeled widefield OCTA en face images
    Philipp Matten
    Julius Scherer
    Thomas Schlegl
    Jonas Nienhaus
    Heiko Stino
    Michael Niederleithner
    Ursula M. Schmidt-Erfurth
    Rainer A. Leitgeb
    Wolfgang Drexler
    Andreas Pollreisz
    Tilman Schmoll
    Scientific Reports, 13
  • [9] Neighborhood attention transformer multiple instance learning for whole slide image classification
    Aftab, Rukhma
    Yan, Qiang
    Zhao, Juanjuan
    Yong, Gao
    Huajie, Yue
    Urrehman, Zia
    Khalid, Faizi Mohammad
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [10] Diabetic Retinopathy Classification using Vision Transformer
    Mutawa, A. M.
    Sruthi, Sai
    2022 6TH EUROPEAN CONFERENCE ON ELECTRICAL ENGINEERING & COMPUTER SCIENCE, ELECS, 2022, : 25 - 30