Can large language models assist with pediatric dosing accuracy?

被引:0
|
作者
Levin, Chedva [1 ,2 ]
Orkaby, Brurya [1 ,3 ]
Kerner, Erika [4 ]
Saban, Mor [4 ]
机构
[1] Jerusalem Coll Technol, Lev Acad Ctr, Fac Sch Life & Hlth Sci, Nursing Dept, Jerusalem, Israel
[2] Chaim Sheba Med Ctr, Dept Vasc Surg, Tel Aviv, Israel
[3] Shaare Zedek Med Ctr, Dept Hemodialysis children, Jerusalem, Israel
[4] Tel Aviv Univ, Fac Med & Hlth Sci, Sch Hlth Profess, Dept Nursing Sci, Tel Aviv, Israel
关键词
MEDICATION ERRORS; IMPACT;
D O I
10.1038/s41390-025-03980-8
中图分类号
R72 [儿科学];
学科分类号
100202 ;
摘要
BACKGROUND AND OBJECTIVE: Medication errors in pediatric care remain a significant healthcare challenge despite technological advancements, necessitating innovative approaches. This study aims to evaluate Large Language Models' (LLMs) potential in reducing pediatric medication dosage calculation errors compared to experienced nurses. METHODS: This cross-sectional study (June-August 2024) involved 101 nurses from pediatric and neonatal departments and three LLMs (ChatGPT-4o, Claude-3.0, Llama 3 8B). Participants completed a nine-question survey on pediatric medication calculations. Primary outcomes were accuracy and response time. Secondary measures included seniority and group membership on accuracy. RESULTS: Significant differences (P < 0.001) were observed between nurses and LLMs. Nurses averaged 93.14 +/- 9.39 accuracy. Claude-3.0 and ChatGPT-4o achieved 100 accuracy, while Llama 3 8B was 66 accurate. LLMs were faster (15.7-75.12 seconds) than nurses (1621.2 +/- 8379.3 s).The Generalized Linear Model analysis revealed task performance was significantly influenced by duration (Wald chi(2) = 27,881.261, p < 0.001) and interaction between relative seniority and group membership (Wald chi(2) = 3,938.250, p < 0.001), with participants achieving a mean total grade of 91.03 (SD = 13.87). CONCLUSIONS: Claude-3.0 and ChatGPT-4o demonstrated perfect accuracy and rapid calculation capabilities, showing promise in reducing pediatric medication dosage errors. Further research is needed to explore their integration into practice.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Can Large Language Models Better Predict Software Vulnerability?
    Katsadouros, Evangelos
    Patrikakis, Charalampos Z.
    Hurlburt, George
    IT PROFESSIONAL, 2023, 25 (03) : 4 - 8
  • [42] Can Large Language Models be Anomaly Detectors for Time Series?
    Alnegheimish, Sarah
    Nguyen, Linh
    Berti-Equille, Laure
    Veeramachaneni, Kalyan
    2024 IEEE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, DSAA 2024, 2024, : 218 - 227
  • [43] Accuracy, Hallucinations, and Misgeneralizations of Large Language Models in Reviewing Cardiology Literature
    Fang, Spencer
    Pillai, Joshua
    Mahin, Baharullah
    Kim, Sarah
    CIRCULATION, 2024, 150
  • [44] The Accuracy of Domain Specific and Descriptive Analysis Generated by Large Language Models
    Otieno, Denish Omondi
    Abri, Faranak
    Siami-Namini, Sima
    Namin, Akbar Siami
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 1739 - 1746
  • [45] Large Language Models as Evaluators in Education: Verification of Feedback Consistency and Accuracy
    Seo, Hyein
    Hwang, Taewook
    Jung, Jeesu
    Kang, Hyeonseok
    Namgoong, Hyuk
    Lee, Yohan
    Jung, Sangkeun
    APPLIED SCIENCES-BASEL, 2025, 15 (02):
  • [46] Evaluating the Accuracy of Responses by Large Language Models for Information on Disease Epidemiology
    Zhu, Kexin
    Zhang, Jiajie
    Klishin, Anton
    Esser, Mario
    Blumentals, William A.
    Juhaeri, Juhaeri
    Jouquelet-Royer, Corinne
    Sinnott, Sarah-Jo
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2025, 34 (02)
  • [47] Accuracy of Large Language Models for Infective Endocarditis Prophylaxis in Dental Procedures
    Rewthamrongsris, Paak
    Burapacheep, Jirayu
    Trachoo, Vorapat
    Porntaveetus, Thantrira
    INTERNATIONAL DENTAL JOURNAL, 2025, 75 (01) : 206 - 212
  • [48] Leveraging Large Language Models to Improve Triage Accuracy in Emergency Departments
    Nashwan, Abdulqadir J.
    Abujaber, Ahmad A.
    JOURNAL OF EMERGENCY NURSING, 2023, 49 (05) : 651 - 653
  • [49] Large Language Models for Predicting Empathic Accuracy Between a Designer and a User
    Fabunmi, Oluwatoba
    Halgamuge, Saman
    Beck, Daniel
    Holtta-Otto, Katja
    JOURNAL OF MECHANICAL DESIGN, 2025, 147 (04)
  • [50] A simplified retriever to improve accuracy of phenotype normalizations by large language models
    Hier, Daniel B.
    Do, Thanh Son
    Obafemi-Ajayi, Tayo
    FRONTIERS IN DIGITAL HEALTH, 2025, 7