Discriminative, generative artificial intelligence, and foundation models in retina imaging

被引:1
|
作者
Ruamviboonsuk, Paisan [1 ]
Arjkongharn, Niracha [1 ]
Vongsa, Nattaporn [1 ]
Pakaymaskul, Pawin [1 ]
Kaothanthong, Natsuda [2 ]
机构
[1] Rangsit Univ, Coll Med, Dept Ophthalmol, Bangkok, Thailand
[2] Thammasat Univ, Sirindhorn Int Inst Technol, Bangkok, Thailand
关键词
Discriminative artificial intelligence; foundation models; generative artificial intelligence; retinal imaging; vision transformer; OPTICAL COHERENCE TOMOGRAPHY; HEAD-TO-HEAD; DIABETIC-RETINOPATHY; AUTOMATED DETECTION; VALIDATION; DISEASE; DEGENERATION; PREDICTION;
D O I
10.4103/tjo.TJO-D-24-00064
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Recent advances of artificial intelligence (AI) in retinal imaging found its application in two major categories: discriminative and generative AI. For discriminative tasks, conventional convolutional neural networks (CNNs) are still major AI techniques. Vision transformers (ViT), inspired by the transformer architecture in natural language processing, has emerged as useful techniques for discriminating retinal images. ViT can attain excellent results when pretrained at sufficient scale and transferred to specific tasks with fewer images, compared to conventional CNN. Many studies found better performance of ViT, compared to CNN, for common tasks such as diabetic retinopathy screening on color fundus photographs (CFP) and segmentation of retinal fluid on optical coherence tomography (OCT) images. Generative Adversarial Network (GAN) is the main AI technique in generative AI in retinal imaging. Novel images generated by GAN can be applied for training AI models in imbalanced or inadequate datasets. Foundation models are also recent advances in retinal imaging. They are pretrained with huge datasets, such as millions of CFP and OCT images and fine-tuned for downstream tasks with much smaller datasets. A foundation model, RETFound, which was self-supervised and found to discriminate many eye and systemic diseases better than supervised models. Large language models are foundation models that may be applied for text-related tasks, like reports of retinal angiography. Whereas AI technology moves forward fast, real-world use of AI models moves slowly, making the gap between development and deployment even wider. Strong evidence showing AI models can prevent visual loss may be required to close this gap.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Generative artificial intelligence for enzyme design: Recent advances in models and applications
    Wen, Shuixiu
    Zheng, Wen
    Bornscheuer, Uwe T.
    Wu, Shuke
    CURRENT OPINION IN GREEN AND SUSTAINABLE CHEMISTRY, 2025, 52
  • [42] Medical education empowered by generative artificial intelligence large language models
    Jowsey, Tanisha
    Stokes-Parish, Jessica
    Singleton, Rachelle
    Todorovic, Michael
    TRENDS IN MOLECULAR MEDICINE, 2023, 29 (12) : 971 - 973
  • [43] Toward the unification of generative and discriminative visual foundation model: a survey
    Liu, Xu
    Zhou, Tong
    Wang, Chong
    Wang, Yuping
    Wang, Yuanxin
    Cao, Qinjingwen
    Du, Weizhi
    Yang, Yonghuan
    He, Junjun
    Qiao, Yu
    Shen, Yiqing
    VISUAL COMPUTER, 2024, : 3371 - 3412
  • [44] Combining Generative and Discriminative Models for Hybrid Inference
    Satorras, Victor Garcia
    Akata, Zeynep
    Welling, Max
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [45] Training Discriminative Models to Evaluate Generative Ones
    Lesort, Timothee
    Stoain, Andrei
    Goudou, Jean-Francois
    Filliat, David
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 604 - 619
  • [46] Anomaly Detection Combining Discriminative and Generative Models
    Higa, Kyota
    Sato, Hideaki
    Shiraishi, Soma
    Kikuchi, Katsumi
    Iwamoto, Kota
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS & TECHNIQUES (IST 2019), 2019,
  • [47] On the Evaluation of Generative Adversarial Networks By Discriminative Models
    Torfi, Amirsina
    Beyki, Mohammadreza
    Fox, Edward A.
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 991 - 998
  • [48] Learning generative models via discriminative approaches
    Tu, Zhuowen
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 500 - 507
  • [49] Legal aspects of generative artificial intelligence and large language models in examinations and theses
    Maerz, Maren
    Himmelbauer, Monika
    Boldt, Kevin
    Oksche, Alexander
    GMS JOURNAL FOR MEDICAL EDUCATION, 2024, 41 (04):
  • [50] Clinical Science and Practice in the Age of Large Language Models and Generative Artificial Intelligence
    Schueller, Stephen M.
    Morris, Robert R.
    JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 2023, 91 (10) : 559 - 561