Enhancing glaucoma detection through multi-modal integration of retinal images and clinical biomarkers

被引:0
|
作者
Sivakumar, Rishikesh [1 ]
Penkova, Anita [2 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
[2] Univ Southern Calif, Dept Aerosp & Mech Engn, Los Angeles, CA 90089 USA
关键词
Glaucoma detection; Vision Transformers; Convolutional Neural Networks; Machine Learning; Clinical Biomarkers; AUTOMATED EXTRACTION; FRAMEWORK;
D O I
10.1016/j.engappai.2025.110010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Glaucoma, a major cause of irreversible blindness globally, often progresses without early symptoms, making prompt and precise detection vital. This paper introduces a multi-modal glaucoma detection system that combines advanced deep learning architectures to analyze retinal images and clinical biomarkers. We developed three hybrid models: the first blends Vision Transformers (ViT) with Convolutional Neural Networks (CNN), specifically Residual Networks (ResNet), for comprehensive feature extraction; the second uses ObjectWindow-Location Vision Transformer (OWL-ViT) with Residual Networks for enhanced global contextual insights; and the third employs a Hierarchical Vision Transformer using Shifted Windows (Swin Transformer) with Residual Networks, which demonstrated the best performance. The strengths of these models, broad contextual capture by ViT, localized detail extraction by CNNs, and refined granularity by Swin Transformer, thereby improving both feature representation and computational efficiency, make them well-suited for clinical use. The best-optimized system, featuring the Swin Transformer hybrid model, achieved an F1-score of 0.993 for glaucoma and 0.995 for non-glaucoma, with an overall accuracy of 99.4% on a dataset of 2874 new cases, correctly classifying 2857 of them, thus confirming its efficacy in enhancing early-stage glaucoma detection and significantly advancing over existing methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Special Issue on Multi-modal Integration and Development
    Rao, A. Ravishankar
    Choe, Yoonsuck
    Chakravarthy, Srinivasa
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2016, 8 (04) : 312 - 312
  • [32] Vae-Clip: Unveiling Deception through Cross-Modal Models and Multi-Feature Integration in Multi-Modal Fake News Detection
    Zhou, Yufeng
    Pang, Aiping
    Yu, Guang
    ELECTRONICS, 2024, 13 (15)
  • [33] A control architecture for multi-modal sensory integration
    Goncalves, LMG
    Grupen, RA
    Oliveira, AAF
    SIBGRAPI '98 - INTERNATIONAL SYMPOSIUM ON COMPUTER GRAPHICS, IMAGE PROCESSING, AND VISION, PROCEEDINGS, 1998, : 418 - 425
  • [34] Multi-modal Information Integration for Document Retrieval
    Hassan, Ehtesham
    Chaudhury, Santanu
    Gopal, M.
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1200 - 1204
  • [35] Integration of Multi-modal Features for Android Malware Detection Using Linear SVM
    Ban, Tao
    Takahashi, Takeshi
    Guo, Shanqing
    Inoue, Daisuke
    Nakao, Koji
    2016 11TH ASIA JOINT CONFERENCE ON INFORMATION SECURITY (ASIAJCIS), 2016, : 141 - 146
  • [36] Multi-modal Feature Integration for Secure Authentication
    Kang, Hang-Bong
    Ju, Myung-Ho
    INTELLIGENT COMPUTING, PART I: INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, ICIC 2006, PART I, 2006, 4113 : 1191 - 1200
  • [37] Multi-modal browsing of images in Web documents
    Chen, F
    Gargi, U
    Niles, L
    Schütze, H
    DOCUMENT RECOGNITION AND RETRIEVAL VI, 1999, 3651 : 122 - 133
  • [38] Multi-modal pedestrian detection with misalignment based on modal-wise regression and multi-modal IoU
    Wanchaitanawong, Napat
    Tanaka, Masayuki
    Shibata, Takashi
    Okutomi, Masatoshi
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [39] Glaucoma detection from retinal images
    Devi, Gayathri T. M.
    Sudha, S.
    Suraj, P.
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2015, : 423 - 428
  • [40] Automatic Detection of Glaucoma in Retinal Images
    Xiong, Li
    Li, Huiqi
    Zheng, Yan
    PROCEEDINGS OF THE 2014 9TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2014, : 1016 - +