A Large Language Model Screening Tool to Target Patients for Best Practice Alerts: Development and Validation

被引:7
|
作者
Savage, Thomas [1 ,3 ]
Wang, John [2 ]
Shieh, Lisa [1 ]
机构
[1] Stanford Univ, Div Hosp Med, Dept Med, Sch Med, Palo Alto, CA USA
[2] Stanford Univ, Dept Med, Divison Gastroenterol & Hepatol, Palo Alto, CA USA
[3] Stanford Univ, Dept Med, Div Hosp Med, 300 Pasteur Dr, Palo Alto, CA 94304 USA
关键词
large language models; language models; language model; EHR; health record; health records; quality improvement; Artificial Intelligence; Natural Language Processing;
D O I
10.2196/49886
中图分类号
R-058 [];
学科分类号
摘要
Background: Best Practice Alerts (BPAs) are alert messages to physicians in the electronic health record that are used to encourage appropriate use of health care resources. While these alerts are helpful in both improving care and reducing costs, BPAs are often broadly applied nonselectively across entire patient populations. The development of large language models (LLMs) provides an opportunity to selectively identify patients for BPAs.Objective: In this paper, we present an example case where an LLM screening tool is used to select patients appropriate for a BPA encouraging the prescription of deep vein thrombosis (DVT) anticoagulation prophylaxis. The artificial intelligence (AI) screening tool was developed to identify patients experiencing acute bleeding and exclude them from receiving a DVT prophylaxis BPA.Methods: Our AI screening tool used a BioMed-RoBERTa (Robustly Optimized Bidirectional Encoder Representations from Transformers Pretraining Approach; AllenAI) model to perform classification of physician notes, identifying patients without active bleeding and thus appropriate for a thromboembolism prophylaxis BPA. The BioMed-RoBERTa model was fine-tuned using 500 history and physical notes of patients from the MIMIC-III (Medical Information Mart for Intensive Care) database who were not prescribed anticoagulation. A development set of 300 MIMIC patient notes was used to determine the model's hyperparameters, and a separate test set of 300 patient notes was used to evaluate the screening tool.Results: Our MIMIC-III test set population of 300 patients included 72 patients with bleeding (ie, were not appropriate for a DVT prophylaxis BPA) and 228 without bleeding who were appropriate for a DVT prophylaxis BPA. The AI screening tool achieved impressive accuracy with a precision-recall area under the curve of 0.82 (95% CI 0.75-0.89) and a receiver operator curve area under the curve of 0.89 (95% CI 0.84-0.94). The screening tool reduced the number of patients who would trigger an alert by 20% (240 instead of 300 alerts) and increased alert applicability by 14.8% (218 [90.8%] positive alerts from 240 total alerts instead of 228 [76%] positive alerts from 300 total alerts), compared to nonselectively sending alerts for all patients. Conclusions: These results show a proof of concept on how language models can be used as a screening tool for BPAs. We provide an example AI screening tool that uses a HIPAA (Health Insurance Portability and Accountability Act)-compliant BioMed-RoBERTa model deployed with minimal computing power. Larger models (eg, Generative Pre-trained Transformers-3, Generative Pre-trained Transformers-4, and Pathways Language Model) will exhibit superior performance but require data use agreements to be HIPAA compliant. We anticipate LLMs to revolutionize quality improvement in hospital medicine.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Testing and Validation of a Custom Trained Large Language Model for HN Patients with Guardrails
    Zhu, L.
    Anand, A.
    Gevorkyan, G.
    Mcgee, L. A.
    Rwigema, J. C.
    Rong, Y.
    Patel, S. H.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2024, 118 (05): : E52 - E53
  • [22] The Development of an Inuktitut and English Language Screening Tool in Nunavut
    Dench, Catherine
    Cleave, Patricia L.
    Tagak, Jane
    Beddard, Janice
    CANADIAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY AND AUDIOLOGY, 2011, 35 (02): : 168 - 176
  • [23] Performance of a Large Language Model in Screening Citations
    Oami, Takehiko
    Okada, Yohei
    Nakada, Taka-aki
    JAMA NETWORK OPEN, 2024, 7 (07) : e2420496
  • [24] The development and validation of a screening tool for the identification of patients experiencing medication-related problems
    Gordon, Karen J.
    Smith, Felicity J.
    Dhillon, Soraya
    INTERNATIONAL JOURNAL OF PHARMACY PRACTICE, 2005, 13 (03) : 187 - 193
  • [25] Development and Validation of a Dietary Screening Tool for High Sodium Consumption in Australian Renal Patients
    Mason, Belinda
    Ross, Lynda
    Gill, Emily
    Healy, Helen
    Juffs, Philip
    Kark, Adrian
    JOURNAL OF RENAL NUTRITION, 2014, 24 (02) : 123 - +
  • [26] The development and validation of Australian aphasia rehabilitation best practice statements
    Worrall, L.
    Power, E.
    Thomas, E.
    Rose, M.
    Togher, L.
    INTERNATIONAL JOURNAL OF STROKE, 2014, 9 : 300 - 300
  • [27] The development and validation of Australian aphasia rehabilitation best practice statements
    Power, E.
    Thomas, E.
    Worrall, L.
    Rose, M.
    Togher, L.
    INTERNATIONAL JOURNAL OF STROKE, 2014, 9 : 6 - 7
  • [28] Development and validation of a mental practice tool for laparoscopic salpingectomy
    Zielinski, E.
    Dilley, J.
    Graham, N.
    Bharathan, R.
    BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2021, 128 : 257 - 258
  • [29] EyeGPT for Patient Inquiries and Medical Education: Development and Validation of an Ophthalmology Large Language Model
    Chen, Xiaolan
    Zhao, Ziwei
    Zhang, Weiyi
    Xu, Pusheng
    Wu, Yue
    Xu, Mingpu
    Gao, Le
    Li, Yinwen
    Shang, Xianwen
    Shi, Danli
    He, Mingguang
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [30] Development of Prompt Templates for Large Language Model-Driven Screening in Systematic Reviews
    Cao, Christian
    Sang, Jason
    Arora, Rohit
    Chen, David
    Kloosterman, Robert
    Cecere, Matthew
    Gorla, Jaswanth
    Saleh, Richard
    Drennan, Ian
    Teja, Bijan
    Fehlings, Michael
    Ronksley, Paul
    Leung, Alexander A.
    Weisz, Dany E.
    Ware, Harriet
    Whelan, Mairead
    Emerson, David B.
    Arora, Rahul K.
    Bobrovitz, Niklas
    ANNALS OF INTERNAL MEDICINE, 2025,