Development and external validation of multimodal postoperative acute kidney injury risk machine learning models

被引:2
|
作者
Karway, George K. [1 ]
Koyner, Jay L. [2 ]
Caskey, John [1 ]
Spicer, Alexandra B. [1 ]
Carey, Kyle A. [2 ]
Gilbert, Emily R. [3 ]
Dligach, Dmitriy [4 ]
Mayampurath, Anoop [1 ,5 ]
Afshar, Majid [1 ,5 ]
Churpek, Matthew M. [1 ,5 ,6 ]
机构
[1] Univ Wisconsin, Dept Med, 600 Highland Ave, Madison, WI 53792 USA
[2] Univ Chicago, Dept Med, Sect Nephrol, Chicago, IL 60637 USA
[3] Loyola Univ Chicago, Dept Med, Chicago, IL 60153 USA
[4] Loyola Univ Chicago, Dept Comp Sci, Chicago, IL 60626 USA
[5] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53726 USA
[6] Univ Wisconsin, Dept Biostat & Med Informat, 600 Highland Ave, Madison, WI 53792 USA
关键词
multimodal models; artificial intelligence; intensive care unit; machine learning; acute kidney injury; natural language processing; ELECTRONIC MEDICAL-RECORDS; DE-IDENTIFICATION; PREDICTION; MORTALITY; INFORMATION; TEXT; DISCRIMINATION; CALIBRATION; DIAGNOSIS; OUTCOMES;
D O I
10.1093/jamiaopen/ooad109
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives To develop and externally validate machine learning models using structured and unstructured electronic health record data to predict postoperative acute kidney injury (AKI) across inpatient settings.Materials and Methods Data for adult postoperative admissions to the Loyola University Medical Center (2009-2017) were used for model development and admissions to the University of Wisconsin-Madison (2009-2020) were used for validation. Structured features included demographics, vital signs, laboratory results, and nurse-documented scores. Unstructured text from clinical notes were converted into concept unique identifiers (CUIs) using the clinical Text Analysis and Knowledge Extraction System. The primary outcome was the development of Kidney Disease Improvement Global Outcomes stage 2 AKI within 7 days after leaving the operating room. We derived unimodal extreme gradient boosting machines (XGBoost) and elastic net logistic regression (GLMNET) models using structured-only data and multimodal models combining structured data with CUI features. Model comparison was performed using the receiver operating characteristic curve (AUROC), with Delong's test for statistical differences.Results The study cohort included 138 389 adult patient admissions (mean [SD] age 58 [16] years; 11 506 [8%] African-American; and 70 826 [51%] female) across the 2 sites. Of those, 2959 (2.1%) developed stage 2 AKI or higher. Across all data types, XGBoost outperformed GLMNET (mean AUROC 0.81 [95% confidence interval (CI), 0.80-0.82] vs 0.78 [95% CI, 0.77-0.79]). The multimodal XGBoost model incorporating CUIs parameterized as term frequency-inverse document frequency (TF-IDF) showed the highest discrimination performance (AUROC 0.82 [95% CI, 0.81-0.83]) over unimodal models (AUROC 0.79 [95% CI, 0.78-0.80]).Discussion A multimodality approach with structured data and TF-IDF weighting of CUIs increased model performance over structured data-only models.Conclusion These findings highlight the predictive power of CUIs when merged with structured data for clinical prediction models, which may improve the detection of postoperative AKI. Acute kidney injury (AKI) after an operation, called postoperative AKI, is common in hospitalized patients and associated with increased morbidity and mortality. Early detection of high-risk patients could facilitate timely treatment and improve outcomes. Although a few studies have developed machine learning (ML) models to identify patients with postoperative AKI, these are primarily limited to structured data (eg, laboratory values) and ignore predictors from clinical notes. Further, models built from clinical notes are often not externally validated because doing so risks leaking protected health information.Given these limitations in the field, we developed and externally validated ML models to predict postoperative AKI using structured data and information from clinical notes. To preserve patient privacy, we used concept unique identifiers (CUIs), which are de-identified medical terms from clinical notes. We compared unimodal models with structured data to multimodal models with CUIs plus structured data, as well as different approaches to modeling the CUI data. We found that multimodal models significantly improved model performance compared to unimodal models. We also found that normalizing CUI data based on term frequency had the highest performance. In conclusion, using CUIs to account for information in clinical notes adds significant value for predicting postoperative AKI.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Prediction of the development of acute kidney injury following cardiac surgery by machine learning
    Tseng, Po-Yu
    Chen, Yi-Ting
    Wang, Chuen-Heng
    Chiu, Kuan-Ming
    Peng, Yu-Sen
    Hsu, Shih-Ping
    Chen, Kang-Lung
    Yang, Chih-Yu
    Lee, Oscar Kuang-Sheng
    CRITICAL CARE, 2020, 24 (01)
  • [32] Application of Machine Learning Algorithms to Predict Acute Kidney Injury in Elderly Orthopedic Postoperative Patients
    Chen, Qiuchong
    Zhang, Yixue
    Zhang, Mengjun
    Li, Ziying
    Liu, Jindong
    CLINICAL INTERVENTIONS IN AGING, 2022, 17 : 317 - 330
  • [33] Machine learning in the prediction of cardiac surgery associated acute kidney injury with early postoperative biomarkers
    Fan, Rui
    Qin, Wei
    Zhang, Hao
    Guan, Lichun
    Wang, Wuwei
    Li, Jian
    Chen, Wen
    Huang, Fuhua
    Zhang, Hang
    Chen, Xin
    FRONTIERS IN SURGERY, 2023, 10
  • [34] External validation of risk stratification strategy in the use of renal ultrasonography in the evaluation of acute kidney injury
    Ip, Ivan K.
    Silveira, Patricia C.
    Alper, Emily C.
    Benson, Carol B.
    Khorasani, Ramin
    JOURNAL OF HOSPITAL MEDICINE, 2016, 11 (11) : 763 - 767
  • [35] Machine Learning Model for Risk Prediction of Community-Acquired Acute Kidney Injury Hospitalization From Electronic Health Records: Development and Validation Study
    Hsu, Chien-Ning
    Liu, Chien-Liang
    Tain, You-Lin
    Kuo, Chin-Yu
    Lin, Yun-Chun
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (08)
  • [36] Validation of Risk Prediction Models to Inform Clinical Decisions After Acute Kidney Injury
    Sawhney, Simon
    Tan, Zhi
    Black, Corri
    Marks, Angharad
    Mclernon, David J.
    Ronksley, Paul
    James, Matthew T.
    AMERICAN JOURNAL OF KIDNEY DISEASES, 2021, 78 (01) : 28 - 37
  • [37] Machine learning models for predicting acute kidney injury: a systematic review and critical appraisal
    Vagliano, Iacopo
    Chesnaye, Nicholas C.
    Leopold, Jan Hendrik
    Jager, Kitty J.
    Abu-Hanna, Ameen
    Schut, Martijn C.
    CLINICAL KIDNEY JOURNAL, 2022, 15 (12) : 2266 - 2280
  • [38] Predicting the risk of postoperative acute kidney injury: development and assessment of a novel predictive nomogram
    Wu, Yukun
    Chen, Junxing
    Luo, Cheng
    Chen, Lingwu
    Huang, Bin
    JOURNAL OF INTERNATIONAL MEDICAL RESEARCH, 2021, 49 (08)
  • [39] Machine Learning for Predicting the Development of Postoperative Acute Kidney Injury After Coronary Artery Bypass Grafting Without Extracorporeal Circulation
    Zheng, Sai
    Li, Yugui
    Luo, Cheng
    Chen, Fang
    Ling, Guoxing
    Zheng, Baoshi
    CARDIOVASCULAR INNOVATIONS AND APPLICATIONS, 2022, 7 (01)
  • [40] Development and internal validation of machine learning-based models and external validation of existing risk scores for outcome prediction in patients with ischaemic stroke
    Axford, Daniel
    Sohel, Ferdous
    Abedi, Vida
    Zhu, Ye
    Zand, Ramin
    Barkoudah, Ebrahim
    Krupica, Troy
    Iheasirim, Kingsley
    Sharma, Umesh M.
    Dugani, Sagar B.
    Takahashi, Paul Y.
    Bhagra, Sumit
    Murad, Mohammad H.
    Saposnik, Gustavo
    Yousufuddin, Mohammed
    EUROPEAN HEART JOURNAL - DIGITAL HEALTH, 2024, 5 (02): : 109 - 122