Lesion classification and diabetic retinopathy grading by integrating softmax and pooling operators into vision transformer

被引:0
|
作者
Liu, Chong [1 ]
Wang, Weiguang [2 ]
Lian, Jian [1 ]
Jiao, Wanzhen [2 ]
机构
[1] Shandong Management Univ, Sch Intelligence Engn, Jinan, Peoples R China
[2] Shandong First Med Univ, Dept Ophthalmol, Shandong Prov Hosp, Jinan, Peoples R China
关键词
medical image analysis; image classification; deep learning; Bi-LSTM; transformer;
D O I
10.3389/fpubh.2024.1442114
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Introduction Diabetic retinopathy grading plays a vital role in the diagnosis and treatment of patients. In practice, this task mainly relies on manual inspection using human visual system. However, the human visual system-based screening process is labor-intensive, time-consuming, and error-prone. Therefore, plenty of automated screening technique have been developed to address this task.Methods Among these techniques, the deep learning models have demonstrated promising outcomes in various types of machine vision tasks. However, most of the medical image analysis-oriented deep learning approaches are built upon the convolutional operations, which might neglect the global dependencies between long-range pixels in the medical images. Therefore, the vision transformer models, which can unveil the associations between global pixels, have been gradually employed in medical image analysis. However, the quadratic computation complexity of attention mechanism has hindered the deployment of vision transformer in clinical practices. Bearing the analysis above in mind, this study introduces an integrated self-attention mechanism with both softmax and linear modules to guarantee efficiency and expressiveness, simultaneously. To be specific, a portion of query and key tokens, which are much less than the original query and key tokens, are adopted in the attention module by adding a set of proxy tokens. Note that the proxy tokens can fully utilize both the advantages of softmax and linear attention.Results To evaluate the performance of the presented approach, the comparison experiments between state-of-the-art algorithms and the proposed approach are conducted. Experimental results demonstrate that the proposed approach achieves superior outcome over the state-of-the-art algorithms on the publicly available datasets.Discussion Accordingly, the proposed approach can be taken as a potentially valuable instrument in clinical practices.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Diabetic Retinopathy Classification using Vision Transformer
    Mutawa, A. M.
    Sruthi, Sai
    2022 6TH EUROPEAN CONFERENCE ON ELECTRICAL ENGINEERING & COMPUTER SCIENCE, ELECS, 2022, : 25 - 30
  • [2] ViT features for diabetic retinopathy grading and lesion segmentation
    Kay, Olivia
    Miller, Keith
    Nguyen, Mickey
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (09)
  • [3] Lesion-Aware Transformers for Diabetic Retinopathy Grading
    Sun, Rui
    Li, Yihao
    Zhang, Tianzhu
    Mao, Zhendong
    Wu, Feng
    Zhang, Yongdong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10933 - 10942
  • [4] Diabetic retinopathy grading based on Lesion correlation graph
    Luo, Daming
    Kamata, Sei-ichiro
    2020 JOINT 9TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2020 4TH INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2020,
  • [5] Integrating convolution and transformer for enhanced diabetic retinopathy detection
    Cao, Xinrong
    Lin, Jie
    Gao, Xiaozhi
    Li, Zuoyong
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2024, 23 (04) : 225 - 235
  • [6] Lesion-attention pyramid network for diabetic retinopathy grading
    Li, Xiang
    Jiang, Yuchen
    Zhang, Jiusi
    Li, Minglei
    Luo, Hao
    Yin, Shen
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 126
  • [7] Detection and classification of retinal lesions for grading of diabetic retinopathy
    Akram, M. Usman
    Khalid, Shehzad
    Tariq, Anam
    Khan, Shoab A.
    Azam, Farooque
    COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 45 : 161 - 171
  • [8] SkinDistilViT: Lightweight Vision Transformer for Skin Lesion Classification
    Lungu-Stan, Vlad-Constantin
    Cercel, Dumitru-Clementin
    Pop, Florin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT I, 2023, 14254 : 268 - 280
  • [9] Multi-Task Learning for Diabetic Retinopathy Grading and Lesion Segmentation
    Foo, Alex
    Hsu, Wynne
    Lee, Mong Li
    Lim, Gilbert
    Wong, Tien Yin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13267 - 13272
  • [10] SUNET: A LESION REGULARIZED MODEL FOR SIMULTANEOUS DIABETIC RETINOPATHY AND DIABETIC MACULAR EDEMA GRADING
    Tu, Zhi
    Gao, Shenghua
    Zhou, Kang
    Chen, Xianing
    Fu, Huazhu
    Gu, Zaiwang
    Cheng, Jun
    Yu, Zehao
    Liu, Jiang
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 1378 - 1382