Discriminative, generative artificial intelligence, and foundation models in retina imaging

被引:1
|
作者
Ruamviboonsuk, Paisan [1 ]
Arjkongharn, Niracha [1 ]
Vongsa, Nattaporn [1 ]
Pakaymaskul, Pawin [1 ]
Kaothanthong, Natsuda [2 ]
机构
[1] Rangsit Univ, Coll Med, Dept Ophthalmol, Bangkok, Thailand
[2] Thammasat Univ, Sirindhorn Int Inst Technol, Bangkok, Thailand
关键词
Discriminative artificial intelligence; foundation models; generative artificial intelligence; retinal imaging; vision transformer; OPTICAL COHERENCE TOMOGRAPHY; HEAD-TO-HEAD; DIABETIC-RETINOPATHY; AUTOMATED DETECTION; VALIDATION; DISEASE; DEGENERATION; PREDICTION;
D O I
10.4103/tjo.TJO-D-24-00064
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Recent advances of artificial intelligence (AI) in retinal imaging found its application in two major categories: discriminative and generative AI. For discriminative tasks, conventional convolutional neural networks (CNNs) are still major AI techniques. Vision transformers (ViT), inspired by the transformer architecture in natural language processing, has emerged as useful techniques for discriminating retinal images. ViT can attain excellent results when pretrained at sufficient scale and transferred to specific tasks with fewer images, compared to conventional CNN. Many studies found better performance of ViT, compared to CNN, for common tasks such as diabetic retinopathy screening on color fundus photographs (CFP) and segmentation of retinal fluid on optical coherence tomography (OCT) images. Generative Adversarial Network (GAN) is the main AI technique in generative AI in retinal imaging. Novel images generated by GAN can be applied for training AI models in imbalanced or inadequate datasets. Foundation models are also recent advances in retinal imaging. They are pretrained with huge datasets, such as millions of CFP and OCT images and fine-tuned for downstream tasks with much smaller datasets. A foundation model, RETFound, which was self-supervised and found to discriminate many eye and systemic diseases better than supervised models. Large language models are foundation models that may be applied for text-related tasks, like reports of retinal angiography. Whereas AI technology moves forward fast, real-world use of AI models moves slowly, making the gap between development and deployment even wider. Strong evidence showing AI models can prevent visual loss may be required to close this gap.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] APPLICATIONS OF MULTIMODAL GENERATIVE ARTIFICIAL INTELLIGENCE IN A REAL-WORLD RETINA CLINIC SETTING
    Ghalibafan, Seyyedehfatemeh
    Gonzalez, David J. Taylor
    Cai, Louis Z.
    Chou, Brandon Graham
    Panneerselvam, Sugi
    Barrett, Spencer Conrad
    Djulbegovic, Mak B.
    Yannuzzi, Nicolas A.
    RETINA-THE JOURNAL OF RETINAL AND VITREOUS DISEASES, 2024, 44 (10): : 1732 - 1740
  • [22] Leveraging foundation and large language models in medical artificial intelligence
    Wong, Io Nam
    Monteiro, Olivia
    Baptista-Hon, Daniel T.
    Wang, Kai
    Lu, Wenyang
    Sun, Zhuo
    Nie, Sheng
    Yin, Yun
    CHINESE MEDICAL JOURNAL, 2024, 137 (21) : 2529 - 2539
  • [23] Leveraging foundation and large language models in medical artificial intelligence
    Wong Io Nam
    Monteiro Olivia
    BaptistaHon Daniel T
    Wang Kai
    Lu Wenyang
    Sun Zhuo
    Nie Sheng
    Yin Yun
    中华医学杂志英文版, 2024, 137 (21)
  • [24] Classification with hybrid generative/discriminative models
    Raina, R
    Shen, YR
    Ng, AY
    McCallum, A
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 545 - 552
  • [25] Exploiting generative models in discriminative classifiers
    Jaakkola, TS
    Haussler, D
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 487 - 493
  • [26] Generative/Discriminative Models for Nucleosome Positioning
    Zhang, Yu
    Liu, Xiuwen
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 922 - 924
  • [27] Generative Artificial Intelligence in Marketing
    Kshetri, Nir
    IT PROFESSIONAL, 2023, 25 (05) : 71 - 75
  • [28] Generative artificial intelligence and ELT
    Cogo, Alessia
    Patsko, Laura
    Szoke, Joanna
    ELT JOURNAL, 2024, 78 (04) : 373 - 377
  • [29] Generative artificial intelligence in surgery
    Rodler, Severin
    Ganjavi, Conner
    De Backer, Pieter
    Magoulianitis, Vasileios
    Ramacciotti, Lorenzo Storino
    Abreu, Andre Luis De Castro
    Gill, Inderbir S.
    Cacciamani, Giovanni E.
    SURGERY, 2024, 175 (06) : 1496 - 1502
  • [30] Generative artificial intelligence and ELT
    Moorhouse, Benjamin Luke
    ELT JOURNAL, 2024, 78 (04) : 378 - 392