Semantic-Oriented Visual Prompt Learning for Diabetic Retinopathy Grading on Fundus Images

被引:0
|
作者
Zhang, Yuhan [1 ]
Ma, Xiao [2 ]
Huang, Kun [2 ]
Li, Mingchao [2 ]
Heng, Pheng-Ann [3 ,4 ]
机构
[1] Chinese Univ Hong Kong, Shenzhen Res Inst, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Nanjing Univ Sci & Technol, Dept Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, Inst Med Intelligence & XR, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Task analysis; Semantics; Biomedical imaging; Lesions; Tuning; Training; Diabetic retinopathy; prompt learning; pre-trained model; vision transformer; fundus images; CLASSIFICATION; DIAGNOSIS;
D O I
10.1109/TMI.2024.3383827
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Diabetic retinopathy (DR) is a serious ocular condition that requires effective monitoring and treatment by ophthalmologists. However, constructing a reliable DR grading model remains a challenging and costly task, heavily reliant on high-quality training sets and adequate hardware resources. In this paper, we investigate the knowledge transferability of large-scale pre-trained models (LPMs) to fundus images based on prompt learning to construct a DR grading model efficiently. Unlike full-tuning which fine-tunes all parameters of LPMs, prompt learning only involves a minimal number of additional learnable parameters while achieving a competitive effect as full-tuning. Inspired by visual prompt tuning, we propose Semantic-oriented Visual Prompt Learning (SVPL) to enhance the semantic perception ability for better extracting task-specific knowledge from LPMs, without any additional annotations. Specifically, SVPL assigns a group of learnable prompts for each DR level to fit the complex pathological manifestations and then aligns each prompt group to task-specific semantic space via a contrastive group alignment (CGA) module. We also propose a plug-and-play adapter module, Hierarchical Semantic Delivery (HSD), which allows the semantic transition of prompt groups from shallow to deep layers to facilitate efficient knowledge mining and model convergence. Our extensive experiments on three public DR grading datasets demonstrate that SVPL achieves superior results compared to other transfer tuning and DR grading methods. Further analysis suggests that the generalized knowledge from LPMs is advantageous for constructing the DR grading model on fundus images.
引用
收藏
页码:2960 / 2969
页数:10
相关论文
共 50 条
  • [41] Suitability Classification of Retinal Fundus Images for Diabetic Retinopathy Using Deep Learning
    Pinedo-Diaz, German
    Ortega-Cisneros, Susana
    Moya-Sanchez, Eduardo Ulises
    Rivera, Jorge
    Mejia-Alvarez, Pedro
    Rodriguez-Navarrete, Francisco J.
    Sanchez, Abraham
    ELECTRONICS, 2022, 11 (16)
  • [42] Diabetic Retinopathy Classification With Deep Learning via Fundus Images: A Short Survey
    Zhu, Shanshan
    Xiong, Changchun
    Zhong, Qingshan
    Yao, Yudong
    IEEE ACCESS, 2024, 12 : 20540 - 20558
  • [43] Rapid grading of fundus photos for diabetic retinopathy using crowdsourcing
    Brady, Christopher J.
    Villanti, Andrea C.
    Pearson, Jennifer L.
    Kirchner, Thomas R.
    Gup, Omesh
    Shah, Chirag
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2014, 55 (13)
  • [44] Automatic Detection of Diabetic Hypertensive Retinopathy in Fundus Images Using Transfer Learning
    Nagpal, Dimple
    Alsubaie, Najah
    Soufiene, Ben Othman
    Alqahtani, Mohammed S.
    Abbas, Mohamed
    Almohiy, Hussain M.
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [45] Automatic Screening of Diabetic Retinopathy Using Fundus Images and Machine Learning Algorithms
    Rahman, K. K. Mujeeb
    Nasor, Mohamed
    Imran, Ahmed
    DIAGNOSTICS, 2022, 12 (09)
  • [46] Hemorrhage semantic segmentation in fundus images for the diagnosis of diabetic retinopathy by using a convolutional neural network
    Skouta, Ayoub
    Elmoufidi, Abdelali
    Jai-Andaloussi, Said
    Ouchetto, Ouail
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [47] An Automated System for the Grading of Diabetic Maculopathy in Fundus Images
    Akram, Muhammad Usman
    Akhtar, Mahmood
    Javed, M. Younus
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT IV, 2012, 7666 : 36 - 43
  • [48] Deep attentive convolutional neural network for automatic grading of imbalanced diabetic retinopathy in retinal fundus images
    Li, Feng
    Tang, Shiqing
    Chen, Yuyang
    Zou, Haidong
    BIOMEDICAL OPTICS EXPRESS, 2022, 13 (11) : 5813 - 5835
  • [49] Hemorrhage semantic segmentation in fundus images for the diagnosis of diabetic retinopathy by using a convolutional neural network
    Ayoub Skouta
    Abdelali Elmoufidi
    Said Jai-Andaloussi
    Ouail Ouchetto
    Journal of Big Data, 9
  • [50] Automated Grading of Diabetic Retinopathy by Simulating Human's Attention with Deep Learning in Fundus Image
    Jiang, Yanyun
    Sui, Xiaodan
    Jia, Weikuan
    Lian, Jian
    Jiao, Wanzhen
    Zheng, Yuanjie
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2020, 61 (07)