Evaluating the Accuracy of Artificial Intelligence (AI)-Generated Illustrations for Laser-Assisted In Situ Keratomileusis (LASIK), Photorefractive Keratectomy (PRK), and Small Incision Lenticule Extraction (SMILE)

被引:1
|
作者
Petroff, Dallas J. [1 ]
Nasir, Ayesha A. [2 ]
Moin, Kayvon A. [3 ,4 ]
Loveless, Bosten A. [3 ,5 ]
Moshirfar, Omeed A. [6 ]
Hoopes, Phillip C. [3 ]
Moshirfar, Majid [3 ,7 ,8 ]
机构
[1] Idaho Coll Osteopath Med, Ophthalmol, Meridian, ID USA
[2] Univ Louisville, Ophthalmol, Louisville, KY USA
[3] Hoopes Vis, Hoopes Vis Res Ctr, Ophthalmol, Draper, UT 84020 USA
[4] Amer Univ Caribbean, Med, Cupecoy, Sint Maarten
[5] Rocky Vista Univ, Ophthalmol, Coll Osteopath Med, Ivins, UT USA
[6] Washington Univ St Louis, Sam Fox Sch Design & Visual Arts, St Louis, MO USA
[7] Univ Utah, John A Moran Eye Ctr, Ophthalmol, Sch Med, Salt Lake City, UT 84102 USA
[8] Utah Lions Eye Bank, Eye Banking & Corneal Transplantat, Murray, UT 84107 USA
关键词
klex; astigmatism; generative ai model; eye; cornea; myopia; corneal refractive surgery; medical illustration; artificial intelligence;
D O I
10.7759/cureus.67747
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Purpose: To utilize artificial intelligence (AI) platforms to generate medical illustrations for refractive surgeries, aiding patients in visualizing and comprehending procedures like laser-assisted in situ keratomileusis (LASIK), photorefractive keratectomy (PRK), and small incision lenticule extraction (SMILE). This study displays the current performance of two OpenAI programs in terms of their accuracy in common corneal refractive procedures. Methods: We selected AI image generators based on their popularity, choosing Decoder-Only Autoregressive Language and Image Synthesis 3 (DALL-E 3) for its leading position and Medical Illustration Master (MiM) for its high engagement. We developed six non-AI-generated prompts targeting specific outcomes related to LASIK, PRK, and SMILE procedures to assess medical accuracy. We generated images using these prompts (18 total images per AI platform) and used the final images produced after the sixth prompt for this study (three final images per AI platform). Human-created procedural images were also gathered for comparison. Four experts independently graded the images, and their scores were averaged. Each image was evaluated with our grading system on "Legibility," "Detail & Clarity," "Anatomical Realism & Accuracy," "Procedural Step Accuracy," and "Lack of Fictitious Anatomy," with scores ranging from 0 to 3 per category allowing 15 points total. A score of 15 points signifies excellent performance, indicating a highly accurate medical illustration. Conversely, a low score suggests a poor-quality illustration. Additionally, we submitted the same AI-generated images back into Chat Generative Pre-Trained Transformer-4o (ChatGPT-4o) along with our grading system. This allowed ChatGPT-4o to use and evaluate both AI-generated and human-created images (HCIs). Results: In individual category scoring, HCIs significantly outperformed AI images in legibility, anatomical realism, procedural step accuracy, and lack of fictitious anatomy. There were no significant differences between DALL-E 3 and MiM in these categories (p>0.05). In procedure-specific comparisons, HCIs consistently scored higher than AI-generated images for LASIK, PRK, and SMILE. For LASIK, HCIs scored 14 +/- 0.82 (93.3%), while DALL-E 3 scored 4.5 +/- 0.58 (30%) and MiM scored 4.5 +/- 1.91 (30%) (p<0.001). For PRK, HCIs scored 14.5 +/- 0.58 (96.7%), compared to DALL-E 3's 5.25 +/- 1.26 (35%) and MiM's 7 +/- 3.56 (46.7%) (p<0.001). For SMILE, HCIs scored 14.5 +/- 0.68 (96.7%), while DALL-E 3 scored 5 +/- 0.82 (33.3%) and MiM scored 6 +/- 2.71 (40%) (p<0.001). HCIs significantly outperformed AI-generated images from DALL-E 3 and MiM in overall accuracy for medical illustrations, achieving scores of 14.33 +/- 0.23 (95.6%), 4.93 +/- 0.69 (32.8%), and 5.83 +/- 0.23 (38.9%) respectively (p<0.001). ChatGPT-4o evaluations were consistent with human evaluations for HCIs (3 +/- 0, 2.87 +/- 0.23; p=0.121) but rated AI images higher than human evaluators (2 +/- 0 vs 1.07 +/- 0.73; p<0.001). Conclusion: This study highlights the inaccuracy of AI-generated images in illustrating corneal refractive procedures such as LASIK, PRK, and SMILE. Although the OpenAI platform can create images recognizable as eyes, they lack educational value. AI excels in quickly generating creative, vibrant images, but accurate medical illustration remains a significant challenge. While AI performs well with text-based actions, its capability to produce precise medical images needs substantial improvement.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Comparison of tear proteomic and neuromediator profiles changes between small incision lenticule extraction (SMILE) and femtosecond laser-assisted in-situ keratomileusis (LASIK)
    Liu, Yu-Chi
    Yam, Gary Hin-Fai
    Lin, Molly Tzu-Yu
    Teo, Ericia
    Koh, Siew-Kwan
    Deng, Lu
    Zhou, Lei
    Tong, Louis
    Mehta, Jodhbir S.
    JOURNAL OF ADVANCED RESEARCH, 2021, 29 : 67 - 81
  • [12] Three-year outcomes of small incision lenticule extraction (SMILE) and femtosecond laser-assisted laser in situ keratomileusis (FS-LASIK) for myopia and myopic astigmatism
    Han, Tian
    Xu, Ye
    Han, Xiao
    Zeng, Li
    Shang, Jianmin
    Chen, Xun
    Zhou, Xingtao
    BRITISH JOURNAL OF OPHTHALMOLOGY, 2019, 103 (04) : 565 - 568
  • [13] Small Incision Lenticule Extraction (SMILE) versus Femtosecond Laser-Assisted In Situ Keratomileusis (FS-LASIK) for Myopia: A Systematic Review and Meta-Analysis
    Shen, Zeren
    Shi, Keda
    Yu, Yinhui
    Yu, Xiaoning
    Lin, Yuchen
    Yao, Ke
    PLOS ONE, 2016, 11 (07):
  • [14] Early Corneal Nerve Damage and Recovery Following Small Incision Lenticule Extraction (SMILE) and Laser In Situ Keratomileusis (LASIK)
    Mohamed-Noriega, Karim
    Riau, Andri K.
    Lwin, Nyein C.
    Chaurasia, Shyam S.
    Tan, Donald T.
    Mehta, Jodhbir S.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2014, 55 (03) : 1823 - 1834
  • [15] Dry Eye after Small Incision Lenticule Extraction (SMILE) versus Femtosecond Laser-Assisted in Situ Keratomileusis (FS-LASIK) for Myopia: A Meta-Analysis
    Shen, Zeren
    Zhu, Yanan
    Song, Xiaohui
    Yan, Jie
    Yao, Ke
    PLOS ONE, 2016, 11 (12):
  • [16] Neuropathic Ocular Surface Changes, Corneal Nerve Imaging and neuromediator Profiles in Small Incision Lenticule Extraction (SMILE) and Femtosecond Laser-Assisted in-situ Keratomileusis (LASIK)
    Liu, Yu-Chi
    Lin, Molly Tzu-Yu
    Lee, Isabelle Xin Yu
    Mehta, Jodhbir S.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2022, 63 (07)
  • [17] Comparison of Corneal Sensation Between Small Incision Lenticule Extraction (SMILE) and Femtosecond Laser-Assisted LASIK for Myopia
    Li, Meiyan
    Zhou, Zimei
    Shen, Yang
    Knorz, Michael C.
    Gong, Lan
    Zhou, Xingtao
    JOURNAL OF REFRACTIVE SURGERY, 2014, 30 (02) : 94 - 100
  • [18] Comparison of Corneal Epithelial Remodeling After Femtosecond Laser-Assisted LASIK and Small Incision Lenticule Extraction (SMILE)
    Ryu, Ik-Hee
    Kim, Bong Jun
    Lee, Jong-Hyuck
    Kim, Sun Woong
    JOURNAL OF REFRACTIVE SURGERY, 2017, 33 (04) : 250 - +
  • [19] Comparison of central corneal thickness treated with small incision lenticule extraction, femtosecond laser-assisted in situ keratomileusis, or laser-assisted subepithelial keratomileusis for myopia
    Ge Tian
    Tong Chen
    Xin Liu
    Yue Lin
    Na Li
    Hua Gao
    Mingna Liu
    Lasers in Medical Science, 38
  • [20] Comparison of central corneal thickness treated with small incision lenticule extraction, femtosecond laser-assisted in situ keratomileusis, or laser-assisted subepithelial keratomileusis for myopia
    Tian, Ge
    Chen, Tong
    Liu, Xin
    Lin, Yue
    Li, Na
    Gao, Hua
    Liu, Mingna
    LASERS IN MEDICAL SCIENCE, 2023, 38 (01)