Discriminative, generative artificial intelligence, and foundation models in retina imaging

被引:1
|
作者
Ruamviboonsuk, Paisan [1 ]
Arjkongharn, Niracha [1 ]
Vongsa, Nattaporn [1 ]
Pakaymaskul, Pawin [1 ]
Kaothanthong, Natsuda [2 ]
机构
[1] Rangsit Univ, Coll Med, Dept Ophthalmol, Bangkok, Thailand
[2] Thammasat Univ, Sirindhorn Int Inst Technol, Bangkok, Thailand
关键词
Discriminative artificial intelligence; foundation models; generative artificial intelligence; retinal imaging; vision transformer; OPTICAL COHERENCE TOMOGRAPHY; HEAD-TO-HEAD; DIABETIC-RETINOPATHY; AUTOMATED DETECTION; VALIDATION; DISEASE; DEGENERATION; PREDICTION;
D O I
10.4103/tjo.TJO-D-24-00064
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Recent advances of artificial intelligence (AI) in retinal imaging found its application in two major categories: discriminative and generative AI. For discriminative tasks, conventional convolutional neural networks (CNNs) are still major AI techniques. Vision transformers (ViT), inspired by the transformer architecture in natural language processing, has emerged as useful techniques for discriminating retinal images. ViT can attain excellent results when pretrained at sufficient scale and transferred to specific tasks with fewer images, compared to conventional CNN. Many studies found better performance of ViT, compared to CNN, for common tasks such as diabetic retinopathy screening on color fundus photographs (CFP) and segmentation of retinal fluid on optical coherence tomography (OCT) images. Generative Adversarial Network (GAN) is the main AI technique in generative AI in retinal imaging. Novel images generated by GAN can be applied for training AI models in imbalanced or inadequate datasets. Foundation models are also recent advances in retinal imaging. They are pretrained with huge datasets, such as millions of CFP and OCT images and fine-tuned for downstream tasks with much smaller datasets. A foundation model, RETFound, which was self-supervised and found to discriminate many eye and systemic diseases better than supervised models. Large language models are foundation models that may be applied for text-related tasks, like reports of retinal angiography. Whereas AI technology moves forward fast, real-world use of AI models moves slowly, making the gap between development and deployment even wider. Strong evidence showing AI models can prevent visual loss may be required to close this gap.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Analysis Generative artificial intelligence
    Stokel-Walker, Chris
    NEW SCIENTIST, 2024, 247 (3492) : 18 - 18
  • [32] On Chatbots and Generative Artificial Intelligence
    Oermann, Eric Karl
    Kondziolka, Douglas
    NEUROSURGERY, 2023, 92 (04) : 665 - 666
  • [33] Generative Artificial Intelligence: Fundamentals
    Corchado, Juan M.
    Lopez, F. Sebastian
    Nunez, V. Juan M.
    Garcia, S. Raul
    Chamoso, Pablo
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2023, 12 (01):
  • [34] Generative artificial intelligence in ophthalmology
    Waisberg, Ethan
    Ong, Joshua
    Kamran, Sharif Amit
    Masalkhi, Mouayad
    Paladugu, Phani
    Zaman, Nasif
    Lee, Andrew G.
    Tavakkoli, Alireza
    SURVEY OF OPHTHALMOLOGY, 2025, 70 (01) : 1 - 11
  • [35] Generative Artificial Intelligence and ChatGPT
    Byrne, Matthew D.
    JOURNAL OF PERIANESTHESIA NURSING, 2023, 38 (03) : 519 - 522
  • [36] Generative artificial intelligence in oncology
    Ganjavi, Conner
    Melamed, Sam
    Biedermann, Brett
    Eppler, Michael B.
    Rodler, Severin
    Layne, Ethan
    Cei, Francesco
    Gill, Inderbir
    Cacciamani, Giovanni E.
    CURRENT OPINION IN UROLOGY, 2025, 35 (03) : 205 - 213
  • [37] Generative artificial intelligence and surgeons
    Lai, Paul B. S.
    SURGICAL PRACTICE, 2023, 27 (03) : 128 - 130
  • [38] A Primer on Generative Artificial Intelligence
    Kalota, Faisal
    EDUCATION SCIENCES, 2024, 14 (02):
  • [39] Generative artificial intelligence substantially enhances the accuracy of embryo selection models
    Cao, P.
    Derhaag, J.
    Coonen, E.
    Brunner, H.
    Acharya, G.
    Salumets, A.
    Esteki, M. Zamani
    HUMAN REPRODUCTION, 2024, 39 : I144 - I144
  • [40] LEGAL IMPLICATIONS OF WEB SCRAPING IN THE TRAINING OF GENERATIVE ARTIFICIAL INTELLIGENCE MODELS
    Chaparro, Juan Manuel Pacheco
    Ramirez, Laura Barrero
    REVISTA LA PROPIEDAD INMATERIAL, 2024, (38): : 167 - 189