Diagnostic Accuracy of a Custom Large Language Model on Rare Pediatric Disease Case Reports

被引:2
|
作者
Young, Cameron C. [1 ,2 ]
Enichen, Ellie [1 ,2 ]
Rivera, Christian [1 ,2 ]
Auger, Corinne A. [1 ,2 ]
Grant, Nathan [1 ,2 ]
Rao, Arya [1 ,2 ]
Succi, Marc D. [2 ,3 ]
机构
[1] Harvard Med Sch, Boston, MA USA
[2] Mass Gen Brigham, Innovat Operat Res Ctr, Medically Engn Solut Healthcare Incubator, Boston, MA 02199 USA
[3] Massachusetts Gen Hosp, Dept Radiol, Boston, MA 02114 USA
关键词
artificial intelligence; diagnostic support; genetics; large language models; pediatric rare disease;
D O I
10.1002/ajmg.a.63878
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Accurately diagnosing rare pediatric diseases frequently represent a clinical challenge due to their complex and unusual clinical presentations. Here, we explore the capabilities of three large language models (LLMs), GPT-4, Gemini Pro, and a custom-built LLM (GPT-4 integrated with the Human Phenotype Ontology [GPT-4 HPO]), by evaluating their diagnostic performance on 61 rare pediatric disease case reports. The performance of the LLMs were assessed for accuracy in identifying specific diagnoses, listing the correct diagnosis among a differential list, and broad disease categories. In addition, GPT-4 HPO was tested on 100 general pediatrics case reports previously assessed on other LLMs to further validate its performance. The results indicated that GPT-4 was able to predict the correct diagnosis with a diagnostic accuracy of 13.1%, whereas both GPT-4 HPO and Gemini Pro had diagnostic accuracies of 8.2%. Further, GPT-4 HPO showed an improved performance compared with the other two LLMs in identifying the correct diagnosis among its differential list and the broad disease category. Although these findings underscore the potential of LLMs for diagnostic support, particularly when enhanced with domain-specific ontologies, they also stress the need for further improvement prior to integration into clinical practice.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Disseminated Neuroendocrine Carcinoma in a Pediatric Patient: A Rare Case and Diagnostic Challenge
    Post, Ginell R.
    Lewis, Jocelyn A.
    Hudspeth, Michelle P.
    Caplan, Michael J.
    Lazarchick, John
    JOURNAL OF PEDIATRIC HEMATOLOGY ONCOLOGY, 2012, 34 (03) : 200 - 203
  • [32] Neurosarcoidosis, a rare disease of the CNS. Two case reports
    Nikolakaki, E.
    Kalamafkianaki, K.
    Kouroumalos, N.
    Kontogiorgaki, M.
    Andrinos, D.
    Georgakakis, G.
    JOURNAL OF NEUROLOGY, 2008, 255 : 137 - 137
  • [33] Tuberculosis of prostate and epidydimis: two case reports of a rare disease
    Stamatiou, K.
    Efstratiadi, E.
    Tzamarias, S.
    Zavradinos, D.
    Tsavari, A.
    Christopoulos, G.
    SCIENTIFIC CHRONICLES, 2019, 24 (01) : 121 - 126
  • [34] Panatrophy of Gowers is a rare disease: case reports and review of the literature
    Paliwal, Vijay Kumar
    Bhargawa, Puneet
    Gupta, Rahul
    Saran, Jitendra
    Mathur, Deepak K.
    INTERNATIONAL JOURNAL OF DERMATOLOGY, 2015, 54 (06) : 656 - 661
  • [35] Leiomyomatosis Peritonealis Disseminata - Four Case Reports of a Rare Disease
    Goppel, K.
    Becker, K.
    Schmalfeldt, B.
    Kiechle, M.
    Seifert-Klauss, V.
    GEBURTSHILFE UND FRAUENHEILKUNDE, 2009, 69 (10) : 945 - 951
  • [36] TWO CASE REPORTS ON DARIER DISEASE: RARE DISORDER OF KERATINIZATION
    Paul, Arup
    JOURNAL OF EVOLUTION OF MEDICAL AND DENTAL SCIENCES-JEMDS, 2014, 3 (14): : 3661 - 3664
  • [37] Pediatric Cushing's disease: Case reports and retrospective review
    Pomahacova, Renata
    Paterova, Petra
    Nykodymova, Eva
    Sykora, Josef
    Krsek, Michal
    BIOMEDICAL PAPERS-OLOMOUC, 2024, 168 (01): : 85 - 91
  • [38] Medical large language model for diagnostic reasoning across specialties
    Wang, Guangyu
    Liu, Xiaohong
    NATURE MEDICINE, 2025, : 743 - 744
  • [39] The Accuracy and Potential Impact of a Diagnostic Decision Support System in Rare Disease Cases
    Ronicke, Simon
    Hirsch, Martin C.
    Tuerk, Ewelina
    Larionov, Katharina
    Tientcheu, Daphne
    Wagner, Annette D.
    ARTHRITIS & RHEUMATOLOGY, 2018, 70
  • [40] Evaluating the Accuracy of Responses by Large Language Models for Information on Disease Epidemiology
    Zhu, Kexin
    Zhang, Jiajie
    Klishin, Anton
    Esser, Mario
    Blumentals, William A.
    Juhaeri, Juhaeri
    Jouquelet-Royer, Corinne
    Sinnott, Sarah-Jo
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2025, 34 (02)