Can large language models assist with pediatric dosing accuracy?

被引:0
|
作者
Levin, Chedva [1 ,2 ]
Orkaby, Brurya [1 ,3 ]
Kerner, Erika [4 ]
Saban, Mor [4 ]
机构
[1] Jerusalem Coll Technol, Lev Acad Ctr, Fac Sch Life & Hlth Sci, Nursing Dept, Jerusalem, Israel
[2] Chaim Sheba Med Ctr, Dept Vasc Surg, Tel Aviv, Israel
[3] Shaare Zedek Med Ctr, Dept Hemodialysis children, Jerusalem, Israel
[4] Tel Aviv Univ, Fac Med & Hlth Sci, Sch Hlth Profess, Dept Nursing Sci, Tel Aviv, Israel
关键词
MEDICATION ERRORS; IMPACT;
D O I
10.1038/s41390-025-03980-8
中图分类号
R72 [儿科学];
学科分类号
100202 ;
摘要
BACKGROUND AND OBJECTIVE: Medication errors in pediatric care remain a significant healthcare challenge despite technological advancements, necessitating innovative approaches. This study aims to evaluate Large Language Models' (LLMs) potential in reducing pediatric medication dosage calculation errors compared to experienced nurses. METHODS: This cross-sectional study (June-August 2024) involved 101 nurses from pediatric and neonatal departments and three LLMs (ChatGPT-4o, Claude-3.0, Llama 3 8B). Participants completed a nine-question survey on pediatric medication calculations. Primary outcomes were accuracy and response time. Secondary measures included seniority and group membership on accuracy. RESULTS: Significant differences (P < 0.001) were observed between nurses and LLMs. Nurses averaged 93.14 +/- 9.39 accuracy. Claude-3.0 and ChatGPT-4o achieved 100 accuracy, while Llama 3 8B was 66 accurate. LLMs were faster (15.7-75.12 seconds) than nurses (1621.2 +/- 8379.3 s).The Generalized Linear Model analysis revealed task performance was significantly influenced by duration (Wald chi(2) = 27,881.261, p < 0.001) and interaction between relative seniority and group membership (Wald chi(2) = 3,938.250, p < 0.001), with participants achieving a mean total grade of 91.03 (SD = 13.87). CONCLUSIONS: Claude-3.0 and ChatGPT-4o demonstrated perfect accuracy and rapid calculation capabilities, showing promise in reducing pediatric medication dosage errors. Further research is needed to explore their integration into practice.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Can Large Language Models Assist in Hazard Analysis?
    Diemert, Simon
    Weber, Jens H.
    COMPUTER SAFETY, RELIABILITY, AND SECURITY, SAFECOMP 2023 WORKSHOPS, 2023, 14182 : 410 - 422
  • [2] How can large language models assist with a FRAM analysis?
    Sujan, M.
    Slater, D.
    Crumpton, E.
    SAFETY SCIENCE, 2025, 181
  • [3] LLM-Mod: Can Large Language Models Assist Content Moderation?
    Kolla, Mahi
    Salunkhe, Siddharth
    Chandrasekharan, Eshwar
    Saha, Koustuv
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [4] Can large language models fully automate or partially assist paper selection in systematic reviews?
    Chen, Haichao
    Jiang, Zehua
    Liu, Xinyu
    Xue, Can Can
    Yew, Samantha Min Er
    Sheng, Bin
    Zheng, Ying-Feng
    Wang, Xiaofei
    Wu, You
    Sivaprasad, Sobha
    Wong, Tien Yin
    Chaudhary, Varun
    Tham, Yih Chung
    BRITISH JOURNAL OF OPHTHALMOLOGY, 2025,
  • [5] Large language models in cryptocurrency securities cases: can a GPT model meaningfully assist lawyers?
    Trozze, Arianna
    Davies, Toby
    Kleinberg, Bennett
    ARTIFICIAL INTELLIGENCE AND LAW, 2024,
  • [6] Coffee grinders assist pediatric dosing
    Burkhart, CG
    CUTIS, 2000, 65 (05): : 276 - 276
  • [7] Performance and Accuracy Research of the Large Language Models
    Gaitan, Nicoleta Cristina
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 62 - 69
  • [8] EVALUATING LARGE LANGUAGE MODELS ON THEIR ACCURACY AND COMPLETENESS
    Edalat, Camellia
    Kirupaharan, Nila
    Dalvin, Lauren A.
    Mishra, Kapil
    Marshall, Rayna
    Xu, Hannah
    Francis, Jasmine H.
    Berkenstock, Meghan
    RETINA-THE JOURNAL OF RETINAL AND VITREOUS DISEASES, 2025, 45 (01): : 128 - 132
  • [9] Diagnostic accuracy of large language models in psychiatry
    Gargari, Omid Kohandel
    Fatehi, Farhad
    Mohammadi, Ida
    Firouzabadi, Shahryar Rajai
    Shafiee, Arman
    Habibi, Gholamreza
    ASIAN JOURNAL OF PSYCHIATRY, 2024, 100
  • [10] Evaluating large language models in pediatric nephrology
    Filler, Guido
    Niel, Olivier
    PEDIATRIC NEPHROLOGY, 2025,