Discriminative, generative artificial intelligence, and foundation models in retina imaging

被引:1
|
作者
Ruamviboonsuk, Paisan [1 ]
Arjkongharn, Niracha [1 ]
Vongsa, Nattaporn [1 ]
Pakaymaskul, Pawin [1 ]
Kaothanthong, Natsuda [2 ]
机构
[1] Rangsit Univ, Coll Med, Dept Ophthalmol, Bangkok, Thailand
[2] Thammasat Univ, Sirindhorn Int Inst Technol, Bangkok, Thailand
关键词
Discriminative artificial intelligence; foundation models; generative artificial intelligence; retinal imaging; vision transformer; OPTICAL COHERENCE TOMOGRAPHY; HEAD-TO-HEAD; DIABETIC-RETINOPATHY; AUTOMATED DETECTION; VALIDATION; DISEASE; DEGENERATION; PREDICTION;
D O I
10.4103/tjo.TJO-D-24-00064
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Recent advances of artificial intelligence (AI) in retinal imaging found its application in two major categories: discriminative and generative AI. For discriminative tasks, conventional convolutional neural networks (CNNs) are still major AI techniques. Vision transformers (ViT), inspired by the transformer architecture in natural language processing, has emerged as useful techniques for discriminating retinal images. ViT can attain excellent results when pretrained at sufficient scale and transferred to specific tasks with fewer images, compared to conventional CNN. Many studies found better performance of ViT, compared to CNN, for common tasks such as diabetic retinopathy screening on color fundus photographs (CFP) and segmentation of retinal fluid on optical coherence tomography (OCT) images. Generative Adversarial Network (GAN) is the main AI technique in generative AI in retinal imaging. Novel images generated by GAN can be applied for training AI models in imbalanced or inadequate datasets. Foundation models are also recent advances in retinal imaging. They are pretrained with huge datasets, such as millions of CFP and OCT images and fine-tuned for downstream tasks with much smaller datasets. A foundation model, RETFound, which was self-supervised and found to discriminate many eye and systemic diseases better than supervised models. Large language models are foundation models that may be applied for text-related tasks, like reports of retinal angiography. Whereas AI technology moves forward fast, real-world use of AI models moves slowly, making the gap between development and deployment even wider. Strong evidence showing AI models can prevent visual loss may be required to close this gap.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Revolutionizing Digital Pathology With the Power of Generative Artificial Intelligence and Foundation Models
    Waqas, Asim
    Bui, Marilyn M.
    Glassy, Eric F.
    El Naqa, Issam
    Borkowskif, Piotr
    Borkowski, Andrew A.
    Rasool, Ghulam
    LABORATORY INVESTIGATION, 2023, 103 (11)
  • [2] Primer on Generative Artificial Intelligence and Large Language Models in Medical Imaging
    Kim, Kiduk
    Hong, Gil-Sun
    Kim, Namkug
    JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY, 2024, 85 (05): : 848 - 860
  • [3] Foundation Models A New Paradigm for Artificial Intelligence
    Schneider, Johannes
    Meske, Christian
    Kuss, Pauline
    BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2024, 66 (02) : 221 - 231
  • [4] Foundation models for generalist medical artificial intelligence
    Moor, Michael
    Banerjee, Oishi
    Abad, Zahra Shakeri Hossein
    Krumholz, Harlan M.
    Leskovec, Jure
    Topol, Eric J.
    Rajpurkar, Pranav
    NATURE, 2023, 616 (7956) : 259 - 265
  • [5] Foundation models: the future of surgical artificial intelligence?
    Lam, Kyle
    Qiu, Jianing
    BRITISH JOURNAL OF SURGERY, 2024, 111 (04)
  • [6] Foundation models for generalist medical artificial intelligence
    Michael Moor
    Oishi Banerjee
    Zahra Shakeri Hossein Abad
    Harlan M. Krumholz
    Jure Leskovec
    Eric J. Topol
    Pranav Rajpurkar
    Nature, 2023, 616 : 259 - 265
  • [7] Special Issue: Generative Models in Artificial Intelligence and Their Applications
    Castelli, Mauro
    Manzoni, Luca
    APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [8] Updated Primer on Generative Artificial Intelligence and Large Language Models in Medical Imaging for Medical Professionals
    Kim, Kiduk
    Cho, Kyungjin
    Jang, Ryoungwoo
    Kyung, Sunggu
    Lee, Soyoung
    Ham, Sungwon
    Choi, Edward
    Hong, Gil-Sun
    Kim, Namkug
    KOREAN JOURNAL OF RADIOLOGY, 2024, 25 (03) : 224 - 242
  • [9] Generative Artificial Intelligence
    Lee, Christoph I.
    Chen, Jonathan H.
    Kohli, Marc D.
    Smith, Andrew D.
    Liao, Joshua M.
    JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2024, 21 (08) : 1318 - 1320
  • [10] Generative Artificial Intelligence
    Hawk, Heather
    Coriasco, Michael
    Jones, Jeffrey R.
    NURSE EDUCATOR, 2025, 50 (01) : 18 - 22