Semantic-Oriented Visual Prompt Learning for Diabetic Retinopathy Grading on Fundus Images

被引:0
|
作者
Zhang, Yuhan [1 ]
Ma, Xiao [2 ]
Huang, Kun [2 ]
Li, Mingchao [2 ]
Heng, Pheng-Ann [3 ,4 ]
机构
[1] Chinese Univ Hong Kong, Shenzhen Res Inst, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Nanjing Univ Sci & Technol, Dept Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, Inst Med Intelligence & XR, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Task analysis; Semantics; Biomedical imaging; Lesions; Tuning; Training; Diabetic retinopathy; prompt learning; pre-trained model; vision transformer; fundus images; CLASSIFICATION; DIAGNOSIS;
D O I
10.1109/TMI.2024.3383827
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Diabetic retinopathy (DR) is a serious ocular condition that requires effective monitoring and treatment by ophthalmologists. However, constructing a reliable DR grading model remains a challenging and costly task, heavily reliant on high-quality training sets and adequate hardware resources. In this paper, we investigate the knowledge transferability of large-scale pre-trained models (LPMs) to fundus images based on prompt learning to construct a DR grading model efficiently. Unlike full-tuning which fine-tunes all parameters of LPMs, prompt learning only involves a minimal number of additional learnable parameters while achieving a competitive effect as full-tuning. Inspired by visual prompt tuning, we propose Semantic-oriented Visual Prompt Learning (SVPL) to enhance the semantic perception ability for better extracting task-specific knowledge from LPMs, without any additional annotations. Specifically, SVPL assigns a group of learnable prompts for each DR level to fit the complex pathological manifestations and then aligns each prompt group to task-specific semantic space via a contrastive group alignment (CGA) module. We also propose a plug-and-play adapter module, Hierarchical Semantic Delivery (HSD), which allows the semantic transition of prompt groups from shallow to deep layers to facilitate efficient knowledge mining and model convergence. Our extensive experiments on three public DR grading datasets demonstrate that SVPL achieves superior results compared to other transfer tuning and DR grading methods. Further analysis suggests that the generalized knowledge from LPMs is advantageous for constructing the DR grading model on fundus images.
引用
收藏
页码:2960 / 2969
页数:10
相关论文
共 50 条
  • [31] DR|GRADUATE: Uncertainty-aware deep learning-based diabetic retinopathy grading in eye fundus images
    Araujo, Teresa
    Aresta, Guilherme
    Mendonca, Luis
    Penas, Susana
    Maia, Carolina
    Carneiro, Angela
    Maria Mendonca, Ana
    Campilho, Aurelio
    MEDICAL IMAGE ANALYSIS, 2020, 63 (63)
  • [32] Annotating Retina Fundus Images for Teaching and Learning Diabetic Retinopathy Software
    Alam, Rakibul
    Poon, Bruce
    Mohammed, Mahmood Kazi
    Amin, M. Ashraful
    Yan, Hong
    INFORMATION TECHNOLOGY IN INDUSTRY, 2014, 2 (02): : 62 - 67
  • [33] Comparison of ophthalmoscopy and fundus photography in the grading of diabetic retinopathy
    Sarraf, D
    Coleman, AL
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1999, 40 (04) : S305 - S305
  • [34] Deep Learning for Predicting the Progression of Diabetic Retinopathy using Fundus Images
    Bora, Ashish
    Babenko, Boris
    Virmani, Sunny
    Cuadros, Jorge
    Balasubramanian, Siva
    Varadarajan, Avinash V.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2020, 61 (07)
  • [35] A Dictionary Learning Based Method for Detection of Diabetic Retinopathy in Color Fundus Images
    Karami, Narjes
    Rabbani, Hossein
    2017 10TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2017, : 119 - 122
  • [36] Fundus Image Quality-Guided Diabetic Retinopathy Grading
    Zhou, Kang
    Gu, Zaiwang
    Li, Annan
    Cheng, Jun
    Gao, Shenghua
    Liu, Jiang
    COMPUTATIONAL PATHOLOGY AND OPHTHALMIC MEDICAL IMAGE ANALYSIS, 2018, 11039 : 245 - 252
  • [37] Rapid Grading of Fundus Photographs for Diabetic Retinopathy Using Crowdsourcing
    Brady, Christopher J.
    Villanti, Andrea C.
    Pearson, Jennifer L.
    Kirchner, Thomas R.
    Gupta, Omesh P.
    Shah, Chirag P.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2014, 16 (10) : 175 - 184
  • [38] Deep learning for diabetic retinopathy detection and classification based on fundus images: A review
    Tsiknakis, Nikos
    Theodoropoulos, Dimitris
    Manikis, Georgios
    Ktistakis, Emmanouil
    Boutsora, Ourania
    Berto, Alexa
    Scarpa, Fabio
    Scarpa, Alberto
    Fotiadis, Dimitrios, I
    Marias, Kostas
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135
  • [39] Classification of Fundus Images for Diabetic Retinopathy Using Machine Learning: a Brief Review
    Bala, Ruchika
    Sharma, Arun
    Goel, Nidhi
    PROCEEDINGS OF ACADEMIA-INDUSTRY CONSORTIUM FOR DATA SCIENCE (AICDS 2020), 2022, 1411 : 37 - 45
  • [40] Investigation of Fundus Images for Detection of Diabetic Retinopathy Stage Using Deep Learning
    Basarab, M. R.
    Ivanko, K. O.
    VISNYK NTUU KPI SERIIA-RADIOTEKHNIKA RADIOAPARATOBUDUVANNIA, 2023, (94): : 49 - 57