Semantic-Oriented Visual Prompt Learning for Diabetic Retinopathy Grading on Fundus Images

被引:0
|
作者
Zhang, Yuhan [1 ]
Ma, Xiao [2 ]
Huang, Kun [2 ]
Li, Mingchao [2 ]
Heng, Pheng-Ann [3 ,4 ]
机构
[1] Chinese Univ Hong Kong, Shenzhen Res Inst, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Nanjing Univ Sci & Technol, Dept Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, Inst Med Intelligence & XR, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Task analysis; Semantics; Biomedical imaging; Lesions; Tuning; Training; Diabetic retinopathy; prompt learning; pre-trained model; vision transformer; fundus images; CLASSIFICATION; DIAGNOSIS;
D O I
10.1109/TMI.2024.3383827
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Diabetic retinopathy (DR) is a serious ocular condition that requires effective monitoring and treatment by ophthalmologists. However, constructing a reliable DR grading model remains a challenging and costly task, heavily reliant on high-quality training sets and adequate hardware resources. In this paper, we investigate the knowledge transferability of large-scale pre-trained models (LPMs) to fundus images based on prompt learning to construct a DR grading model efficiently. Unlike full-tuning which fine-tunes all parameters of LPMs, prompt learning only involves a minimal number of additional learnable parameters while achieving a competitive effect as full-tuning. Inspired by visual prompt tuning, we propose Semantic-oriented Visual Prompt Learning (SVPL) to enhance the semantic perception ability for better extracting task-specific knowledge from LPMs, without any additional annotations. Specifically, SVPL assigns a group of learnable prompts for each DR level to fit the complex pathological manifestations and then aligns each prompt group to task-specific semantic space via a contrastive group alignment (CGA) module. We also propose a plug-and-play adapter module, Hierarchical Semantic Delivery (HSD), which allows the semantic transition of prompt groups from shallow to deep layers to facilitate efficient knowledge mining and model convergence. Our extensive experiments on three public DR grading datasets demonstrate that SVPL achieves superior results compared to other transfer tuning and DR grading methods. Further analysis suggests that the generalized knowledge from LPMs is advantageous for constructing the DR grading model on fundus images.
引用
收藏
页码:2960 / 2969
页数:10
相关论文
共 50 条
  • [21] Optimal hybrid feature selection technique for diabetic retinopathy grading using fundus images
    Mohan, N. Jagan
    Murugan, R.
    Goel, Tripti
    Mirjalili, Seyedali
    Singh, Y. K.
    Deb, Debasis
    Roy, Parthapratim
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2023, 48 (03):
  • [22] Analysis of Foveal Avascular Zone in Colour Fundus Images for Grading of Diabetic Retinopathy Severity
    Hani, Ahmad Fadzil M.
    Ngah, Nor Fariza
    George, Tara M.
    Izhar, Lila I.
    Nugroho, Hermawan
    Nugroho, Hanung Adi
    2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 5632 - 5635
  • [23] Deep Learning Fundus Image Analysis for Diabetic Retinopathy and Macular Edema Grading
    Jaakko Sahlsten
    Joel Jaskari
    Jyri Kivinen
    Lauri Turunen
    Esa Jaanio
    Kustaa Hietala
    Kimmo Kaski
    Scientific Reports, 9
  • [24] Deep Learning Fundus Image Analysis for Diabetic Retinopathy and Macular Edema Grading
    Sahlsten, Jaakko
    Jaskari, Joel
    Kivinen, Jyri
    Turunen, Lauri
    Jaanio, Esa
    Hietala, Kustaa
    Kaski, Kimmo
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [25] REPRODUCIBILITY OF RETINAL THICKENING GRADING ON STEREO COLOUR FUNDUS AND OCT IMAGES IN DIABETIC RETINOPATHY
    Picton, F.
    Hamill, B.
    Peto, T.
    Mudrew, K. A.
    Quinn, M. J.
    EUROPEAN JOURNAL OF OPHTHALMOLOGY, 2012, 22 (03) : 526 - 526
  • [26] Screening Fundus Images for Diabetic Retinopathy
    Roychowdhury, Sohini
    Koozekanani, Dara D.
    Parhi, Keshab K.
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1641 - 1645
  • [27] Detection and Classification of Microaneurysms and Haemorrhages from Fundus Images for Efficient Grading of Diabetic Retinopathy
    Patil, Preethi
    Sheelavant, Savita
    2018 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT - 2018), 2018, : 727 - 730
  • [28] A Cross-Lesion Attention Network for Accurate Diabetic Retinopathy Grading With Fundus Images
    Liu, Xiang
    Chi, Wei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [29] Attention-Driven Cascaded Network for Diabetic Retinopathy Grading from Fundus Images
    Yue, Guanghui
    Li, Yuan
    Zhou, Tianwei
    Zhou, Xiaoyan
    Liu, Yun
    Wang, Tianfu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 80
  • [30] Optimal hybrid feature selection technique for diabetic retinopathy grading using fundus images
    N Jagan Mohan
    R Murugan
    Tripti Goel
    Seyedali Mirjalili
    Y K Singh
    Debasis Deb
    Parthapratim Roy
    Sādhanā, 48