A Large Language Model Screening Tool to Target Patients for Best Practice Alerts: Development and Validation

被引：7

作者：

Savage, Thomas ^{[1
,3
]}

Wang, John ^{[2
]}

Shieh, Lisa ^{[1
]}

机构：

[1] Stanford Univ, Div Hosp Med, Dept Med, Sch Med, Palo Alto, CA USA

[2] Stanford Univ, Dept Med, Divison Gastroenterol & Hepatol, Palo Alto, CA USA

[3] Stanford Univ, Dept Med, Div Hosp Med, 300 Pasteur Dr, Palo Alto, CA 94304 USA

来源：

JMIR MEDICAL INFORMATICS | 2023年 / 11卷

关键词：

large language models; language models; language model; EHR; health record; health records; quality improvement; Artificial Intelligence; Natural Language Processing;

D O I：

10.2196/49886

中图分类号：

R-058 [];

学科分类号：

摘要：

Background: Best Practice Alerts (BPAs) are alert messages to physicians in the electronic health record that are used to encourage appropriate use of health care resources. While these alerts are helpful in both improving care and reducing costs, BPAs are often broadly applied nonselectively across entire patient populations. The development of large language models (LLMs) provides an opportunity to selectively identify patients for BPAs.Objective: In this paper, we present an example case where an LLM screening tool is used to select patients appropriate for a BPA encouraging the prescription of deep vein thrombosis (DVT) anticoagulation prophylaxis. The artificial intelligence (AI) screening tool was developed to identify patients experiencing acute bleeding and exclude them from receiving a DVT prophylaxis BPA.Methods: Our AI screening tool used a BioMed-RoBERTa (Robustly Optimized Bidirectional Encoder Representations from Transformers Pretraining Approach; AllenAI) model to perform classification of physician notes, identifying patients without active bleeding and thus appropriate for a thromboembolism prophylaxis BPA. The BioMed-RoBERTa model was fine-tuned using 500 history and physical notes of patients from the MIMIC-III (Medical Information Mart for Intensive Care) database who were not prescribed anticoagulation. A development set of 300 MIMIC patient notes was used to determine the model's hyperparameters, and a separate test set of 300 patient notes was used to evaluate the screening tool.Results: Our MIMIC-III test set population of 300 patients included 72 patients with bleeding (ie, were not appropriate for a DVT prophylaxis BPA) and 228 without bleeding who were appropriate for a DVT prophylaxis BPA. The AI screening tool achieved impressive accuracy with a precision-recall area under the curve of 0.82 (95% CI 0.75-0.89) and a receiver operator curve area under the curve of 0.89 (95% CI 0.84-0.94). The screening tool reduced the number of patients who would trigger an alert by 20% (240 instead of 300 alerts) and increased alert applicability by 14.8% (218 [90.8%] positive alerts from 240 total alerts instead of 228 [76%] positive alerts from 300 total alerts), compared to nonselectively sending alerts for all patients. Conclusions: These results show a proof of concept on how language models can be used as a screening tool for BPAs. We provide an example AI screening tool that uses a HIPAA (Health Insurance Portability and Accountability Act)-compliant BioMed-RoBERTa model deployed with minimal computing power. Larger models (eg, Generative Pre-trained Transformers-3, Generative Pre-trained Transformers-4, and Pathways Language Model) will exhibit superior performance but require data use agreements to be HIPAA compliant. We anticipate LLMs to revolutionize quality improvement in hospital medicine.

引用

页数：6

共 50 条

[31] Validation of a swallow screening tool in acute neuroscience patients
Goldsmith, Tessa
Cadogan, Elizabeth
Furie, Karen L.
Schwamm, Lee H.
Singhal, Aneesh B.
Vega-Barachowitz, Carmen
Lee, Hang
Cohen, Audrey Kurash
STROKE, 2008, 39 (02) : 558 - 558
[32] VALIDATION OF A DYSPHAGIA SCREENING TOOL IN ACUTE STROKE PATIENTS
Edmiaston, Jeff
Connor, Lisa Tabor
Loehr, Lynda
Nassief, Abdullah
AMERICAN JOURNAL OF CRITICAL CARE, 2010, 19 (04) : 357 - 364
[33] VALIDATION OF A NUTRITIONAL SCREENING TOOL FOR HOSPITALIZED PEDIATRIC PATIENTS
Lama More, R. A.
Morais Lopez, A.
Herrero Alvarez, M.
Caraballo Chicano, S.
Galera Martinez, R.
Lopez Ruzafa, E.
Rodriguez Martinez, G.
de la Mano Hernandez, A.
Rivero de la Rosa, Ma C.
NUTRICION HOSPITALARIA, 2012, 27 (05) : 1429 - 1436
[34] Development, feasibility, and preliminary validation of a Spanish language version of the TAPS Tool for substance use screening in primary care
Gryczynski, Jan
Sanchez, Katherine
Carswell, Steven B.
Schwartz, Robert P.
ADDICTION SCIENCE & CLINICAL PRACTICE, 2022, 17 (SUPPL 1):
[35] Best Practices for QSAR Model Development, Validation, and Exploitation
Tropsha, Alexander
MOLECULAR INFORMATICS, 2010, 29 (6-7) : 476 - 488
[36] Screening Tool for Anxiety Disorders: Development and Validation of the Korean Anxiety Screening Assessment
Kim, Yeseul
Park, Yeonsoo
Cho, Gyeongcheol
Park, Kiho
Kim, Shin-Hyang
Baik, Seung Yeon
Kim, Cho Long
Jung, Sooyun
Lee, Won-Hye
Choi, Younyoung
Lee, Seung-Hwan
Choi, Kee-Hong
PSYCHIATRY INVESTIGATION, 2018, 15 (11) : 1053 - 1063
[37] The Dimensionality of Language Ability in Four-Year-Olds: Construct Validation of a Language Screening Tool
Klem, Marianne
Gustafsson, Jan-Eric
Hagtvet, Bente
SCANDINAVIAN JOURNAL OF EDUCATIONAL RESEARCH, 2015, 59 (02) : 195 - 213
[38] THE DEVELOPMENT AND VALIDATION OF A SIMULATION MODEL-BASED CLINICAL DECISION TOOL TO SUPPORT ONCOLOGY PRACTICE
Jayasekera, J.
Mandelblatt, J.
Schechter, C.
VALUE IN HEALTH, 2021, 24 : S22 - S22
[39] Development and External Validation of a Natural Language Processing Tool to Identify Hospitalized Patients With Infection on Admission
Khan, A. Z.
Buell, K. G.
Karway, G.
Carey, K.
Dussault, N.
Dumanian, J.
Bhavani, S.
Gilbert, E. R.
Winslow, C.
Shah, N.
Afshar, M.
Edelson, D. P.
Churpek, M. M.
AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2024, 209
[40] Development and validation of a functional screening tool for adults with intellectual disabilities
Ben-David, Nophar
Lotan, Meir
Moran, Daniel Sender
JOURNAL OF APPLIED RESEARCH IN INTELLECTUAL DISABILITIES, 2022, 35 (06) : 1281 - 1296

← 1 2 3 4 5 →