Development and external validation of multimodal postoperative acute kidney injury risk machine learning models

被引:2
|
作者
Karway, George K. [1 ]
Koyner, Jay L. [2 ]
Caskey, John [1 ]
Spicer, Alexandra B. [1 ]
Carey, Kyle A. [2 ]
Gilbert, Emily R. [3 ]
Dligach, Dmitriy [4 ]
Mayampurath, Anoop [1 ,5 ]
Afshar, Majid [1 ,5 ]
Churpek, Matthew M. [1 ,5 ,6 ]
机构
[1] Univ Wisconsin, Dept Med, 600 Highland Ave, Madison, WI 53792 USA
[2] Univ Chicago, Dept Med, Sect Nephrol, Chicago, IL 60637 USA
[3] Loyola Univ Chicago, Dept Med, Chicago, IL 60153 USA
[4] Loyola Univ Chicago, Dept Comp Sci, Chicago, IL 60626 USA
[5] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53726 USA
[6] Univ Wisconsin, Dept Biostat & Med Informat, 600 Highland Ave, Madison, WI 53792 USA
关键词
multimodal models; artificial intelligence; intensive care unit; machine learning; acute kidney injury; natural language processing; ELECTRONIC MEDICAL-RECORDS; DE-IDENTIFICATION; PREDICTION; MORTALITY; INFORMATION; TEXT; DISCRIMINATION; CALIBRATION; DIAGNOSIS; OUTCOMES;
D O I
10.1093/jamiaopen/ooad109
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives To develop and externally validate machine learning models using structured and unstructured electronic health record data to predict postoperative acute kidney injury (AKI) across inpatient settings.Materials and Methods Data for adult postoperative admissions to the Loyola University Medical Center (2009-2017) were used for model development and admissions to the University of Wisconsin-Madison (2009-2020) were used for validation. Structured features included demographics, vital signs, laboratory results, and nurse-documented scores. Unstructured text from clinical notes were converted into concept unique identifiers (CUIs) using the clinical Text Analysis and Knowledge Extraction System. The primary outcome was the development of Kidney Disease Improvement Global Outcomes stage 2 AKI within 7 days after leaving the operating room. We derived unimodal extreme gradient boosting machines (XGBoost) and elastic net logistic regression (GLMNET) models using structured-only data and multimodal models combining structured data with CUI features. Model comparison was performed using the receiver operating characteristic curve (AUROC), with Delong's test for statistical differences.Results The study cohort included 138 389 adult patient admissions (mean [SD] age 58 [16] years; 11 506 [8%] African-American; and 70 826 [51%] female) across the 2 sites. Of those, 2959 (2.1%) developed stage 2 AKI or higher. Across all data types, XGBoost outperformed GLMNET (mean AUROC 0.81 [95% confidence interval (CI), 0.80-0.82] vs 0.78 [95% CI, 0.77-0.79]). The multimodal XGBoost model incorporating CUIs parameterized as term frequency-inverse document frequency (TF-IDF) showed the highest discrimination performance (AUROC 0.82 [95% CI, 0.81-0.83]) over unimodal models (AUROC 0.79 [95% CI, 0.78-0.80]).Discussion A multimodality approach with structured data and TF-IDF weighting of CUIs increased model performance over structured data-only models.Conclusion These findings highlight the predictive power of CUIs when merged with structured data for clinical prediction models, which may improve the detection of postoperative AKI. Acute kidney injury (AKI) after an operation, called postoperative AKI, is common in hospitalized patients and associated with increased morbidity and mortality. Early detection of high-risk patients could facilitate timely treatment and improve outcomes. Although a few studies have developed machine learning (ML) models to identify patients with postoperative AKI, these are primarily limited to structured data (eg, laboratory values) and ignore predictors from clinical notes. Further, models built from clinical notes are often not externally validated because doing so risks leaking protected health information.Given these limitations in the field, we developed and externally validated ML models to predict postoperative AKI using structured data and information from clinical notes. To preserve patient privacy, we used concept unique identifiers (CUIs), which are de-identified medical terms from clinical notes. We compared unimodal models with structured data to multimodal models with CUIs plus structured data, as well as different approaches to modeling the CUI data. We found that multimodal models significantly improved model performance compared to unimodal models. We also found that normalizing CUI data based on term frequency had the highest performance. In conclusion, using CUIs to account for information in clinical notes adds significant value for predicting postoperative AKI.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Internal and External Validation of a Machine Learning Risk Score for Acute Kidney Injury
    Churpek, Matthew M.
    Carey, Kyle A.
    Edelson, Dana P.
    Singh, Tripti
    Astor, Brad C.
    Gilbert, Emily R.
    Winslow, Christopher
    Shah, Nirav
    Afshar, Majid
    Koyner, Jay L.
    JAMA NETWORK OPEN, 2020, 3 (08) : E2012892
  • [2] Development, External Validation, and Visualization of Machine Learning Models for Predicting Occurrence of Acute Kidney Injury after Cardiac Surgery
    Shao, Jiakang
    Liu, Feng
    Ji, Shuaifei
    Song, Chao
    Ma, Yan
    Shen, Ming
    Sun, Yuntian
    Zhu, Siming
    Guo, Yilong
    Liu, Bing
    Wu, Yuanbin
    Qin, Handai
    Lai, Shengwei
    Fan, Yunlong
    REVIEWS IN CARDIOVASCULAR MEDICINE, 2023, 24 (08)
  • [3] Development and external validation of an acute kidney injury risk score for use in the general population
    Bell, Samira
    James, Matthew T.
    Farmer, Chris K. T.
    Tan, Zhi
    de Souza, Nicosha
    Witham, Miles D.
    CLINICAL KIDNEY JOURNAL, 2020, 13 (03) : 402 - 412
  • [4] Development and validation of a machine-learning model for predicting the risk of death in sepsis patients with acute kidney injury
    Dong, Lei
    Liu, Pei
    Qi, Zhili
    Lin, Jin
    Duan, Meili
    HELIYON, 2024, 10 (09)
  • [5] Internal and external validation of machine learning-assisted prediction models for mechanical ventilation-associated severe acute kidney injury
    Huang, Sai
    Teng, Yue
    Du, Jiajun
    Zhou, Xuan
    Duan, Feng
    Feng, Cong
    AUSTRALIAN CRITICAL CARE, 2023, 36 (04) : 604 - 612
  • [6] Advances in machine learning prediction models for acute kidney injury
    Zeng, Zhihe
    Du, Qing
    Zhang, Bin
    Xiao, Zhaoyang
    PANMINERVA MEDICA, 2024,
  • [7] Development and validation of interpretable machine learning models for postoperative pneumonia prediction
    Xiang, Bingbing
    Liu, Yiran
    Jiao, Shulan
    Zhang, Wensheng
    Wang, Shun
    Yi, Mingliang
    FRONTIERS IN PUBLIC HEALTH, 2024, 12
  • [8] Internal and External Validation of Machine Learning Models for Predicting Acute Kidney Injury Following Non-Cardiac Surgery Using Open Datasets
    Lee, Sang-Wook
    Jang, Jaewon
    Seo, Woo-Young
    Lee, Donghee
    Kim, Sung-Hoon
    JOURNAL OF PERSONALIZED MEDICINE, 2024, 14 (06):
  • [9] Machine Learning Models of Acute Kidney Injury Prediction in Acute Pancreatitis Patients
    Qu, Cheng
    Gao, Lin
    Yu, Xian-qiang
    Wei, Mei
    Fang, Guo-quan
    He, Jianing
    Cao, Long-xiang
    Ke, Lu
    Tong, Zhi-hui
    Li, Wei-qin
    GASTROENTEROLOGY RESEARCH AND PRACTICE, 2020, 2020
  • [10] Development and Validation of a Machine Learning Predictive Model for Cardiac Surgery-Associated Acute Kidney Injury
    Li, Qian
    Lv, Hong
    Chen, Yuye
    Shen, Jingjia
    Shi, Jia
    Zhou, Chenghui
    JOURNAL OF CLINICAL MEDICINE, 2023, 12 (03)