Diagnostic Accuracy of a Custom Large Language Model on Rare Pediatric Disease Case Reports

被引:2
|
作者
Young, Cameron C. [1 ,2 ]
Enichen, Ellie [1 ,2 ]
Rivera, Christian [1 ,2 ]
Auger, Corinne A. [1 ,2 ]
Grant, Nathan [1 ,2 ]
Rao, Arya [1 ,2 ]
Succi, Marc D. [2 ,3 ]
机构
[1] Harvard Med Sch, Boston, MA USA
[2] Mass Gen Brigham, Innovat Operat Res Ctr, Medically Engn Solut Healthcare Incubator, Boston, MA 02199 USA
[3] Massachusetts Gen Hosp, Dept Radiol, Boston, MA 02114 USA
关键词
artificial intelligence; diagnostic support; genetics; large language models; pediatric rare disease;
D O I
10.1002/ajmg.a.63878
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Accurately diagnosing rare pediatric diseases frequently represent a clinical challenge due to their complex and unusual clinical presentations. Here, we explore the capabilities of three large language models (LLMs), GPT-4, Gemini Pro, and a custom-built LLM (GPT-4 integrated with the Human Phenotype Ontology [GPT-4 HPO]), by evaluating their diagnostic performance on 61 rare pediatric disease case reports. The performance of the LLMs were assessed for accuracy in identifying specific diagnoses, listing the correct diagnosis among a differential list, and broad disease categories. In addition, GPT-4 HPO was tested on 100 general pediatrics case reports previously assessed on other LLMs to further validate its performance. The results indicated that GPT-4 was able to predict the correct diagnosis with a diagnostic accuracy of 13.1%, whereas both GPT-4 HPO and Gemini Pro had diagnostic accuracies of 8.2%. Further, GPT-4 HPO showed an improved performance compared with the other two LLMs in identifying the correct diagnosis among its differential list and the broad disease category. Although these findings underscore the potential of LLMs for diagnostic support, particularly when enhanced with domain-specific ontologies, they also stress the need for further improvement prior to integration into clinical practice.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Diagnostic Accuracy of a Large Language Model in Pediatric Case Studies
    Barile, Joseph
    Margolis, Alex
    Cason, Grace
    Kim, Rachel
    Kalash, Saia
    Tchaconas, Alexis
    Milanaik, Ruth
    JAMA PEDIATRICS, 2024, 178 (03) : 313 - 315
  • [2] Diagnostic accuracy of pediatric atypical appendicitis Three case reports
    Wang, Zhi-hua
    Ye, Jing
    Wang, Yu-shui
    Liu, Yan
    MEDICINE, 2019, 98 (13)
  • [3] Accuracy of a Proprietary Large Language Model in Labeling Obstetric Incident Reports
    Johnson, Jeanene
    Brown, Conner
    Lee, Grace
    Morse, Keith
    JOINT COMMISSION JOURNAL ON QUALITY AND PATIENT SAFETY, 2024, 50 (12): : 877 - 881
  • [4] Diagnostic accuracy of large language models in psychiatry
    Gargari, Omid Kohandel
    Fatehi, Farhad
    Mohammadi, Ida
    Firouzabadi, Shahryar Rajai
    Shafiee, Arman
    Habibi, Gholamreza
    ASIAN JOURNAL OF PSYCHIATRY, 2024, 100
  • [5] Diagnostic accuracy of a large language model in rheumatology: comparison of physician and ChatGPT-4
    Martin Krusche
    Johnna Callhoff
    Johannes Knitza
    Nikolas Ruffer
    Rheumatology International, 2024, 44 : 303 - 306
  • [6] Diagnostic accuracy of a large language model in rheumatology: comparison of physician and ChatGPT-4
    Krusche, Martin
    Callhoff, Johnna
    Knitza, Johannes
    Ruffer, Nikolas
    RHEUMATOLOGY INTERNATIONAL, 2024, 44 (02) : 303 - 306
  • [7] Semi-automated generation of custom clinical genomic reports for rare disease
    Matalonga, L.
    Tonda, R.
    Piscia, D.
    Laurie, S.
    Whalley, J.
    Thompson, R.
    Lochmuller, H.
    Gut, I.
    Beltran, S.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 639 - 639
  • [8] Diagnostic Accuracy of Tests in Pediatric Gastroesophageal Reflux Disease
    van der Pol, Rachel J.
    Smits, Marije J.
    Venmans, Leonie
    Boluyt, Nicole
    Benninga, Marc A.
    Tabbers, Merit M.
    JOURNAL OF PEDIATRICS, 2013, 162 (05): : 983 - U141
  • [9] Can large language models assist with pediatric dosing accuracy?
    Levin, Chedva
    Orkaby, Brurya
    Kerner, Erika
    Saban, Mor
    PEDIATRIC RESEARCH, 2025,
  • [10] Anatomic extent of disease: A critical variable in reports of diagnostic accuracy
    Black, WC
    RADIOLOGY, 2000, 217 (02) : 319 - 320