Lesion classification and diabetic retinopathy grading by integrating softmax and pooling operators into vision transformer

被引:0
|
作者
Liu, Chong [1 ]
Wang, Weiguang [2 ]
Lian, Jian [1 ]
Jiao, Wanzhen [2 ]
机构
[1] Shandong Management Univ, Sch Intelligence Engn, Jinan, Peoples R China
[2] Shandong First Med Univ, Dept Ophthalmol, Shandong Prov Hosp, Jinan, Peoples R China
关键词
medical image analysis; image classification; deep learning; Bi-LSTM; transformer;
D O I
10.3389/fpubh.2024.1442114
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Introduction Diabetic retinopathy grading plays a vital role in the diagnosis and treatment of patients. In practice, this task mainly relies on manual inspection using human visual system. However, the human visual system-based screening process is labor-intensive, time-consuming, and error-prone. Therefore, plenty of automated screening technique have been developed to address this task.Methods Among these techniques, the deep learning models have demonstrated promising outcomes in various types of machine vision tasks. However, most of the medical image analysis-oriented deep learning approaches are built upon the convolutional operations, which might neglect the global dependencies between long-range pixels in the medical images. Therefore, the vision transformer models, which can unveil the associations between global pixels, have been gradually employed in medical image analysis. However, the quadratic computation complexity of attention mechanism has hindered the deployment of vision transformer in clinical practices. Bearing the analysis above in mind, this study introduces an integrated self-attention mechanism with both softmax and linear modules to guarantee efficiency and expressiveness, simultaneously. To be specific, a portion of query and key tokens, which are much less than the original query and key tokens, are adopted in the attention module by adding a set of proxy tokens. Note that the proxy tokens can fully utilize both the advantages of softmax and linear attention.Results To evaluate the performance of the presented approach, the comparison experiments between state-of-the-art algorithms and the proposed approach are conducted. Experimental results demonstrate that the proposed approach achieves superior outcome over the state-of-the-art algorithms on the publicly available datasets.Discussion Accordingly, the proposed approach can be taken as a potentially valuable instrument in clinical practices.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Prior-guided attention fusion transformer for multi-lesion segmentation of diabetic retinopathy
    Xu, Chenfangqian
    Guo, Xiaoxin
    Yang, Guangqi
    Cui, Yihao
    Su, Longchen
    Dong, Hongliang
    Hu, Xiaoying
    Che, Songtian
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [42] Identification of diabetic retinopathy lesions in fundus images by integrating CNN and vision mamba models
    Liu, Zenglei
    Gao, Ailian
    Sheng, Hui
    Wang, Xueling
    PLOS ONE, 2025, 20 (01):
  • [43] CRA-Net: Transformer guided category-relation attention network for diabetic retinopathy grading
    Zang, Feng
    Ma, Hui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
  • [44] COMPARISON OF STANDARDIZED CLINICAL CLASSIFICATION WITH FUNDUS PHOTOGRAPH GRADING FOR THE ASSESSMENT OF DIABETIC RETINOPATHY AND DIABETIC MACULAR EDEMA SEVERITY
    Gangaputra, Sapna
    Lovato, James F.
    Hubbard, Larry
    Davis, Matthew D.
    Esser, Barbara A.
    Ambrosius, Walter T.
    Chew, Emily Y.
    Greven, Craig
    Perdue, Letitia H.
    Wong, Wai T.
    Condren, Audree
    Wilkinson, Charles P.
    Agron, Elvira
    Adler, Sharon
    Danis, Ronald P.
    RETINA-THE JOURNAL OF RETINAL AND VITREOUS DISEASES, 2013, 33 (07): : 1393 - 1399
  • [45] Enhancing Skin Lesion Classification: A Self-Attention Fusion Approach with Vision Transformer
    Heroza, Rahmat Izwan
    Gan, John Q.
    Raza, Haider
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, PT II, MIUA 2024, 2024, 14860 : 309 - 322
  • [46] A NOVEL TRANSFORMER METHOD PRETRAINED WITH MASKED AUTOENCODERS AND FRACTAL DIMENSION FOR DIABETIC RETINOPATHY CLASSIFICATION
    Yang, Yaoming
    Zha, Zhao
    Zhou, Chennan
    Zhang, Lida
    Qiu, Shuxia
    Xu, Peng
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2024, 32 (03)
  • [47] Joint ordinal regression and multiclass classification for diabetic retinopathy grading with transformers and CNNs fusion network
    Ma, Lei
    Xu, Qihang
    Hong, Hanyu
    Shi, Yu
    Zhu, Ying
    Wang, Lei
    APPLIED INTELLIGENCE, 2023, 53 (22) : 27505 - 27518
  • [48] Image Quality Assessment Guided Collaborative Learning of Image Enhancement and Classification for Diabetic Retinopathy Grading
    Hou, Qingshan
    Cao, Peng
    Jia, Liyu
    Chen, Leqi
    Yang, Jinzhu
    Zaiane, Osmar R.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (03) : 1455 - 1466
  • [49] Hybrid Deep Transfer Learning and Feature Fusion Architecture for Diabetic Retinopathy Classification and Severity Grading
    Anand, M.
    Sundaram, A. Meenakshi
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (09) : 2623 - 2633
  • [50] Efficient diabetic retinopathy classification grading using GAN based EM and PCA learning framework
    Sunil S.S.
    Vindhya A.S.
    Multimedia Tools and Applications, 2025, 84 (08) : 5311 - 5334