A Large Language Model Screening Tool to Target Patients for Best Practice Alerts: Development and Validation

被引:7
|
作者
Savage, Thomas [1 ,3 ]
Wang, John [2 ]
Shieh, Lisa [1 ]
机构
[1] Stanford Univ, Div Hosp Med, Dept Med, Sch Med, Palo Alto, CA USA
[2] Stanford Univ, Dept Med, Divison Gastroenterol & Hepatol, Palo Alto, CA USA
[3] Stanford Univ, Dept Med, Div Hosp Med, 300 Pasteur Dr, Palo Alto, CA 94304 USA
关键词
large language models; language models; language model; EHR; health record; health records; quality improvement; Artificial Intelligence; Natural Language Processing;
D O I
10.2196/49886
中图分类号
R-058 [];
学科分类号
摘要
Background: Best Practice Alerts (BPAs) are alert messages to physicians in the electronic health record that are used to encourage appropriate use of health care resources. While these alerts are helpful in both improving care and reducing costs, BPAs are often broadly applied nonselectively across entire patient populations. The development of large language models (LLMs) provides an opportunity to selectively identify patients for BPAs.Objective: In this paper, we present an example case where an LLM screening tool is used to select patients appropriate for a BPA encouraging the prescription of deep vein thrombosis (DVT) anticoagulation prophylaxis. The artificial intelligence (AI) screening tool was developed to identify patients experiencing acute bleeding and exclude them from receiving a DVT prophylaxis BPA.Methods: Our AI screening tool used a BioMed-RoBERTa (Robustly Optimized Bidirectional Encoder Representations from Transformers Pretraining Approach; AllenAI) model to perform classification of physician notes, identifying patients without active bleeding and thus appropriate for a thromboembolism prophylaxis BPA. The BioMed-RoBERTa model was fine-tuned using 500 history and physical notes of patients from the MIMIC-III (Medical Information Mart for Intensive Care) database who were not prescribed anticoagulation. A development set of 300 MIMIC patient notes was used to determine the model's hyperparameters, and a separate test set of 300 patient notes was used to evaluate the screening tool.Results: Our MIMIC-III test set population of 300 patients included 72 patients with bleeding (ie, were not appropriate for a DVT prophylaxis BPA) and 228 without bleeding who were appropriate for a DVT prophylaxis BPA. The AI screening tool achieved impressive accuracy with a precision-recall area under the curve of 0.82 (95% CI 0.75-0.89) and a receiver operator curve area under the curve of 0.89 (95% CI 0.84-0.94). The screening tool reduced the number of patients who would trigger an alert by 20% (240 instead of 300 alerts) and increased alert applicability by 14.8% (218 [90.8%] positive alerts from 240 total alerts instead of 228 [76%] positive alerts from 300 total alerts), compared to nonselectively sending alerts for all patients. Conclusions: These results show a proof of concept on how language models can be used as a screening tool for BPAs. We provide an example AI screening tool that uses a HIPAA (Health Insurance Portability and Accountability Act)-compliant BioMed-RoBERTa model deployed with minimal computing power. Larger models (eg, Generative Pre-trained Transformers-3, Generative Pre-trained Transformers-4, and Pathways Language Model) will exhibit superior performance but require data use agreements to be HIPAA compliant. We anticipate LLMs to revolutionize quality improvement in hospital medicine.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Validation of a swallow screening tool in acute neuroscience patients
    Goldsmith, Tessa
    Cadogan, Elizabeth
    Furie, Karen L.
    Schwamm, Lee H.
    Singhal, Aneesh B.
    Vega-Barachowitz, Carmen
    Lee, Hang
    Cohen, Audrey Kurash
    STROKE, 2008, 39 (02) : 558 - 558
  • [32] VALIDATION OF A DYSPHAGIA SCREENING TOOL IN ACUTE STROKE PATIENTS
    Edmiaston, Jeff
    Connor, Lisa Tabor
    Loehr, Lynda
    Nassief, Abdullah
    AMERICAN JOURNAL OF CRITICAL CARE, 2010, 19 (04) : 357 - 364
  • [33] VALIDATION OF A NUTRITIONAL SCREENING TOOL FOR HOSPITALIZED PEDIATRIC PATIENTS
    Lama More, R. A.
    Morais Lopez, A.
    Herrero Alvarez, M.
    Caraballo Chicano, S.
    Galera Martinez, R.
    Lopez Ruzafa, E.
    Rodriguez Martinez, G.
    de la Mano Hernandez, A.
    Rivero de la Rosa, Ma C.
    NUTRICION HOSPITALARIA, 2012, 27 (05) : 1429 - 1436
  • [34] Development, feasibility, and preliminary validation of a Spanish language version of the TAPS Tool for substance use screening in primary care
    Gryczynski, Jan
    Sanchez, Katherine
    Carswell, Steven B.
    Schwartz, Robert P.
    ADDICTION SCIENCE & CLINICAL PRACTICE, 2022, 17 (SUPPL 1):
  • [35] Best Practices for QSAR Model Development, Validation, and Exploitation
    Tropsha, Alexander
    MOLECULAR INFORMATICS, 2010, 29 (6-7) : 476 - 488
  • [36] Screening Tool for Anxiety Disorders: Development and Validation of the Korean Anxiety Screening Assessment
    Kim, Yeseul
    Park, Yeonsoo
    Cho, Gyeongcheol
    Park, Kiho
    Kim, Shin-Hyang
    Baik, Seung Yeon
    Kim, Cho Long
    Jung, Sooyun
    Lee, Won-Hye
    Choi, Younyoung
    Lee, Seung-Hwan
    Choi, Kee-Hong
    PSYCHIATRY INVESTIGATION, 2018, 15 (11) : 1053 - 1063
  • [37] The Dimensionality of Language Ability in Four-Year-Olds: Construct Validation of a Language Screening Tool
    Klem, Marianne
    Gustafsson, Jan-Eric
    Hagtvet, Bente
    SCANDINAVIAN JOURNAL OF EDUCATIONAL RESEARCH, 2015, 59 (02) : 195 - 213
  • [38] THE DEVELOPMENT AND VALIDATION OF A SIMULATION MODEL-BASED CLINICAL DECISION TOOL TO SUPPORT ONCOLOGY PRACTICE
    Jayasekera, J.
    Mandelblatt, J.
    Schechter, C.
    VALUE IN HEALTH, 2021, 24 : S22 - S22
  • [39] Development and External Validation of a Natural Language Processing Tool to Identify Hospitalized Patients With Infection on Admission
    Khan, A. Z.
    Buell, K. G.
    Karway, G.
    Carey, K.
    Dussault, N.
    Dumanian, J.
    Bhavani, S.
    Gilbert, E. R.
    Winslow, C.
    Shah, N.
    Afshar, M.
    Edelson, D. P.
    Churpek, M. M.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2024, 209
  • [40] Development and validation of a functional screening tool for adults with intellectual disabilities
    Ben-David, Nophar
    Lotan, Meir
    Moran, Daniel Sender
    JOURNAL OF APPLIED RESEARCH IN INTELLECTUAL DISABILITIES, 2022, 35 (06) : 1281 - 1296