Development and external validation of multimodal postoperative acute kidney injury risk machine learning models

被引:2
|
作者
Karway, George K. [1 ]
Koyner, Jay L. [2 ]
Caskey, John [1 ]
Spicer, Alexandra B. [1 ]
Carey, Kyle A. [2 ]
Gilbert, Emily R. [3 ]
Dligach, Dmitriy [4 ]
Mayampurath, Anoop [1 ,5 ]
Afshar, Majid [1 ,5 ]
Churpek, Matthew M. [1 ,5 ,6 ]
机构
[1] Univ Wisconsin, Dept Med, 600 Highland Ave, Madison, WI 53792 USA
[2] Univ Chicago, Dept Med, Sect Nephrol, Chicago, IL 60637 USA
[3] Loyola Univ Chicago, Dept Med, Chicago, IL 60153 USA
[4] Loyola Univ Chicago, Dept Comp Sci, Chicago, IL 60626 USA
[5] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53726 USA
[6] Univ Wisconsin, Dept Biostat & Med Informat, 600 Highland Ave, Madison, WI 53792 USA
关键词
multimodal models; artificial intelligence; intensive care unit; machine learning; acute kidney injury; natural language processing; ELECTRONIC MEDICAL-RECORDS; DE-IDENTIFICATION; PREDICTION; MORTALITY; INFORMATION; TEXT; DISCRIMINATION; CALIBRATION; DIAGNOSIS; OUTCOMES;
D O I
10.1093/jamiaopen/ooad109
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives To develop and externally validate machine learning models using structured and unstructured electronic health record data to predict postoperative acute kidney injury (AKI) across inpatient settings.Materials and Methods Data for adult postoperative admissions to the Loyola University Medical Center (2009-2017) were used for model development and admissions to the University of Wisconsin-Madison (2009-2020) were used for validation. Structured features included demographics, vital signs, laboratory results, and nurse-documented scores. Unstructured text from clinical notes were converted into concept unique identifiers (CUIs) using the clinical Text Analysis and Knowledge Extraction System. The primary outcome was the development of Kidney Disease Improvement Global Outcomes stage 2 AKI within 7 days after leaving the operating room. We derived unimodal extreme gradient boosting machines (XGBoost) and elastic net logistic regression (GLMNET) models using structured-only data and multimodal models combining structured data with CUI features. Model comparison was performed using the receiver operating characteristic curve (AUROC), with Delong's test for statistical differences.Results The study cohort included 138 389 adult patient admissions (mean [SD] age 58 [16] years; 11 506 [8%] African-American; and 70 826 [51%] female) across the 2 sites. Of those, 2959 (2.1%) developed stage 2 AKI or higher. Across all data types, XGBoost outperformed GLMNET (mean AUROC 0.81 [95% confidence interval (CI), 0.80-0.82] vs 0.78 [95% CI, 0.77-0.79]). The multimodal XGBoost model incorporating CUIs parameterized as term frequency-inverse document frequency (TF-IDF) showed the highest discrimination performance (AUROC 0.82 [95% CI, 0.81-0.83]) over unimodal models (AUROC 0.79 [95% CI, 0.78-0.80]).Discussion A multimodality approach with structured data and TF-IDF weighting of CUIs increased model performance over structured data-only models.Conclusion These findings highlight the predictive power of CUIs when merged with structured data for clinical prediction models, which may improve the detection of postoperative AKI. Acute kidney injury (AKI) after an operation, called postoperative AKI, is common in hospitalized patients and associated with increased morbidity and mortality. Early detection of high-risk patients could facilitate timely treatment and improve outcomes. Although a few studies have developed machine learning (ML) models to identify patients with postoperative AKI, these are primarily limited to structured data (eg, laboratory values) and ignore predictors from clinical notes. Further, models built from clinical notes are often not externally validated because doing so risks leaking protected health information.Given these limitations in the field, we developed and externally validated ML models to predict postoperative AKI using structured data and information from clinical notes. To preserve patient privacy, we used concept unique identifiers (CUIs), which are de-identified medical terms from clinical notes. We compared unimodal models with structured data to multimodal models with CUIs plus structured data, as well as different approaches to modeling the CUI data. We found that multimodal models significantly improved model performance compared to unimodal models. We also found that normalizing CUI data based on term frequency had the highest performance. In conclusion, using CUIs to account for information in clinical notes adds significant value for predicting postoperative AKI.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Machine Learning to Predict Acute Kidney Injury
    Wilson, F. Perry
    AMERICAN JOURNAL OF KIDNEY DISEASES, 2020, 75 (06) : 965 - 967
  • [22] Development and validation of a risk nomogram for postoperative acute kidney injury in older patients undergoing liver resection: a pilot study
    Yu, Yao
    Zhang, Changsheng
    Zhang, Faqiang
    Liu, Chang
    Li, Hao
    Lou, Jingsheng
    Xu, Zhipeng
    Liu, Yanhong
    Cao, Jiangbei
    Mi, Weidong
    BMC ANESTHESIOLOGY, 2022, 22 (01)
  • [23] Development and validation of a risk nomogram for postoperative acute kidney injury in older patients undergoing liver resection: a pilot study
    Yao Yu
    Changsheng Zhang
    Faqiang Zhang
    Chang Liu
    Hao Li
    Jingsheng Lou
    Zhipeng Xu
    Yanhong Liu
    Jiangbei Cao
    Weidong Mi
    BMC Anesthesiology, 22
  • [24] Development and validation of machine learning models for the prediction of blunt cerebrovascular injury in children
    Farzaneh, Cyrus A.
    Schomberg, John
    Sullivan, Brittany G.
    Guner, Yigit S.
    Nance, Michael L.
    Gibbs, David
    Yu, Peter T.
    JOURNAL OF PEDIATRIC SURGERY, 2022, 57 (04) : 732 - 738
  • [25] Prediction Models and Their External Validation Studies for Mortality of Patients with Acute Kidney Injury: A Systematic Review
    Ohnuma, Tetsu
    Uchino, Shigehiko
    PLOS ONE, 2017, 12 (01):
  • [26] Deep Learning to Predict Postoperative Acute Kidney Injury
    Rakocz, Nadav
    Hill, Brian L.
    Sankararaman, Sriram
    Halperin, Eran
    Hofer, Ira
    ANESTHESIA AND ANALGESIA, 2019, 128
  • [27] Development and Validation of Machine Learning Models for Real-Time Mortality Prediction in Critically Ill Patients With Sepsis-Associated Acute Kidney Injury
    Luo, Xiao-Qin
    Yan, Ping
    Duan, Shao-Bin
    Kang, Yi-Xin
    Deng, Ying-Hao
    Liu, Qian
    Wu, Ting
    Wu, Xi
    FRONTIERS IN MEDICINE, 2022, 9
  • [28] External validation of the Madrid Acute Kidney Injury Prediction Score
    Del Carpio, Jacqueline
    Paz Marco, Maria
    Luisa Martin, Maria
    Craver, Lourdes
    Jatem, Elias
    Gonzalez, Jorge
    Chang, Pamela
    Ibarz, Mercedes
    Pico, Silvia
    Falcon, Gloria
    Canales, Marina
    Huertas, Elisard
    Romero, Inaki
    Nieto, Nacho
    Segarra, Alfons
    CLINICAL KIDNEY JOURNAL, 2021, 14 (11) : 2377 - 2382
  • [29] DEVELOPMENT AND VALIDATION OF AN ACUTE KIDNEY INJURY RISK PREDICTION SCORE FOR USE IN THE COMMUNITY
    Bell, Samira
    Hounkpatin, Hilda
    De Souza, Nicosha
    Roderick, Paul
    Fraser, Simon
    Witham, Miles
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2017, 32 : 167 - +
  • [30] Prediction of the development of acute kidney injury following cardiac surgery by machine learning
    Po-Yu Tseng
    Yi-Ting Chen
    Chuen-Heng Wang
    Kuan-Ming Chiu
    Yu-Sen Peng
    Shih-Ping Hsu
    Kang-Lung Chen
    Chih-Yu Yang
    Oscar Kuang-Sheng Lee
    Critical Care, 24