A Novel Transformer Model With Multiple Instance Learning for Diabetic Retinopathy Classification

被引:3
|
作者
Yang, Yaoming [1 ]
Cai, Zhili [1 ]
Qiu, Shuxia [1 ,2 ]
Xu, Peng [1 ,2 ]
机构
[1] China Jiliang Univ, Coll Sci, Hangzhou 310018, Peoples R China
[2] Key Lab Intelligent Mfg Qual Big Data Tracing & An, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Vision Transformer; multiple instance learning; diabetic retinopathy; high-resolution fundus retinal images; medical image classification; DISEASE; IMAGES;
D O I
10.1109/ACCESS.2024.3351473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Diabetic retinopathy (DR) is an irreversible fundus retinopathy. A deep learning-based auto-mated DR diagnosis system can save diagnostic time. While Transformer has shown superior performance compared to Convolutional Neural Network (CNN), it typically requires pre-training with large amounts of data. Although Transformer-based DR diagnosis method may alleviate the problem of limited performance on small-scale retinal datasets by loading pre-trained weights, the size of input images is restricted to 224 x 224. The resolution of retinal images captured by fundus cameras is much higher than 224 x 224, reducing resolution in training will result in the loss of valuable information. In order to efficiently utilize high-resolution retinal images, a new Transformer model with multiple instance learning (TMIL) is proposed for DR classification. A multiple instance learning approach is firstly applied on the retinal images to segment these high-resolution images into 224 x 224 image patches. Subsequently, Vision Transformer (ViT) is used to extract features from each patch. Then, Global Instance Computing Block (GICB) is designed to calculate the inter-instance features. After introducing global information from GICB, the features are used to output the classification results. When using high-resolution retinal images, TMIL can load pre-trained weights of Transformer without being affected by weight interpolation on model performance. Experimental results using the APTOS dataset and the Messidor-1 dataset demonstrate that TMIL achieves better classification performance and reduces inference time by 62% compared with that directly inputting high-resolution images into ViT. And TMIL shows highest classification accuracy compared with the current state-of-the-art results.
引用
收藏
页码:6768 / 6776
页数:9
相关论文
共 50 条
  • [21] MIL-VT: Multiple Instance Learning Enhanced Vision Transformer for Fundus Image Classification
    Yu, Shuang
    Ma, Kai
    Bi, Qi
    Bian, Cheng
    Ning, Munan
    He, Nanjun
    Li, Yuexiang
    Liu, Hanruo
    Zheng, Yefeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 45 - 54
  • [22] MULTIPLE-INSTANCE LEARNING WITH EFFICIENT TRANSFORMER FOR BREAST TUMOR IMAGE CLASSIFICATION IN BRIGHT CHALLENGE
    Feng Wentai
    Kuang Jinbo
    Ji Zheng
    Xu Shuoyu
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING CHALLENGES (IEEE ISBI 2022), 2022,
  • [23] An Interpretable Ensemble Deep Learning Model for Diabetic Retinopathy Disease Classification
    Jiang, Hongyang
    Yang, Kang
    Gao, Mengdi
    Zhang, Dongdong
    Ma, He
    Qian, Wei
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 2045 - 2048
  • [24] Evolutionary Intelligence and Deep Learning Enabled Diabetic Retinopathy Classification Model
    Alqaralleh, Bassam A. Y.
    Aldhaban, Fahad
    Abukaraki, Anas
    AlQaralleh, Esam A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 86 - 100
  • [25] Deep learning model using classification for diabetic retinopathy detection: an overview
    Muthusamy, Dharmalingam
    Palani, Parimala
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (07)
  • [26] A diagnosis model for detection and classification of diabetic retinopathy using deep learning
    Syed, Saba Raoof
    Durai, M. A. Saleem
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2023, 12 (01):
  • [27] A diagnosis model for detection and classification of diabetic retinopathy using deep learning
    Saba Raoof Syed
    Saleem Durai M A
    Network Modeling Analysis in Health Informatics and Bioinformatics, 12
  • [28] DRCCT: Enhancing Diabetic Retinopathy Classification with a Compact Convolutional Transformer
    Touati, Mohamed
    Touati, Rabeb
    Nana, Laurent
    Benzarti, Faouzi
    Ben Yahia, Sadok
    BIG DATA AND COGNITIVE COMPUTING, 2025, 9 (01)
  • [29] Multilevel Classification Model for Diabetic Retinopathy
    Lotlekar, Keerthi. S.
    Desai, Shrinivas. D.
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES, ELECTRONICS AND MECHANICAL SYSTEMS (CTEMS), 2018, : 326 - 331
  • [30] Sparse multiple instance learning as document classification
    Yan, Shengye
    Zhu, Xiaodong
    Liu, Guoqing
    Wu, Jianxin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) : 4553 - 4570