Identification and Prediction of Clinical Phenotypes in Hospitalized Patients With COVID-19: Machine Learning From Medical Records

被引:0
|
作者
Velez, Tom [1 ]
Wang, Tony [2 ]
Garibaldi, Brian [3 ]
Singman, Eric [4 ,5 ,7 ]
Koutroulis, Ioannis [6 ]
机构
[1] Comp Technol Associates, Cardiff By The Sea, CA USA
[2] Imedacs, Ann Arbor, MI USA
[3] Johns Hopkins Univ, Sch Med, Div Pulm & Crit Care Med, Biocontainment Unit, Baltimore, MD USA
[4] Univ Maryland, Sch Med, Dept Ophthalmol & Visual Sci, Baltimore, MD USA
[5] Univ Maryland, Sch Med, Dept Neurol, Baltimore, MD USA
[6] Childrens Natl Hosp, Div Emergency Med, Washington, DC USA
[7] Univ Maryland, Sch Med, Dept Ophthalmol & Visual Sci, 419 Redwood St,Suite 470, Baltimore, MD 21209 USA
关键词
big data; COVID; respiratory distress; critical care; early warning; electronic medical record; machine learning; clinical phenotypes; pathogenesis; infection; immune response; treatment; biomarkers; training; sepsis; mortality; utility; phenotype; support tool; RESPIRATORY-DISTRESS-SYNDROME; LATENT CLASS ANALYSIS; UNITED-STATES; SUBPHENOTYPES; VALIDATION; STABILITY; MORTALITY; CLUSTERS; MODEL; MANAGEMENT;
D O I
10.2196/46807
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: There is significant heterogeneity in disease progression among hospitalized patients with COVID-19. The pathogenesis of SARS-CoV-2 infection is attributed to a complex interplay between virus and host immune response that in some patients unpredictably and rapidly leads to "hyperinflammation" associated with increased risk of mortality. The early identification of patients at risk of progression to hyperinflammation may help inform timely therapeutic decisions and lead to improved outcomes.Objective: The primary objective of this study was to use machine learning to reproducibly identify specific risk-stratifying clinical phenotypes across hospitalized patients with COVID-19 and compare treatment response characteristics and outcomes. A secondary objective was to derive a predictive phenotype classification model using routinely available early encounter data that may be useful in informing optimal COVID-19 bedside clinical management.Methods: This was a retrospective analysis of electronic health record data of adult patients (N=4379) who were admitted to a Johns Hopkins Health System hospital for COVID-19 treatment from 2020 to 2021. Phenotypes were identified by clustering 38 routine clinical observations recorded during inpatient care. To examine the reproducibility and validity of the derived phenotypes, patient data were randomly divided into 2 cohorts, and clustering analysis was performed independently for each cohort. A predictive phenotype classifier using the gradient-boosting machine method was derived using routine clinical observations recorded during the first 6 hours following admission.Results: A total of 2 phenotypes (designated as phenotype 1 and phenotype 2) were identified in patients admitted for COVID-19 in both the training and validation cohorts with similar distributions of features, correlations with biomarkers, treatments, comorbidities, and outcomes. In both the training and validation cohorts, phenotype-2 patients were older; had elevated markers of inflammation; and were at an increased risk of requiring intensive care unit-level care, developing sepsis, and mortality compared with phenotype-1 patients. The gradient-boosting machine phenotype prediction model yielded an area under the curve of 0.89 and a positive predictive value of 0.83.Conclusions: Using machine learning clustering, we identified and internally validated 2 clinical COVID-19 phenotypes with distinct treatment or response characteristics consistent with similar 2-phenotype models derived from other hospitalized populations with COVID-19, supporting the reliability and generalizability of these findings. COVID-19 phenotypes can be accurately identified using machine learning models based on readily available early encounter clinical data. A phenotype prediction model based on early encounter data may be clinically useful for timely bedside risk stratification and treatment personalization.
引用
收藏
页数:30
相关论文
共 50 条
  • [1] Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach
    Vaid, Akhil
    Jaladanki, Suraj K.
    Xu, Jie
    Teng, Shelly
    Kumar, Arvind
    Lee, Samuel
    Somani, Sulaiman
    Paranjpe, Ishan
    De Freitas, Jessica K.
    Wanyan, Tingyi
    Johnson, Kipp W.
    Bicak, Mesude
    Klang, Eyal
    Kwon, Young Joon
    Costa, Anthony
    Zhao, Shan
    Miotto, Riccardo
    Charney, Alexander W.
    Boettinger, Erwin
    Fayad, Zahi A.
    Nadkarni, Girish N.
    Wang, Fei
    Glicksberg, Benjamin S.
    [J]. JMIR MEDICAL INFORMATICS, 2021, 9 (01)
  • [2] Machine learning-based model for prediction of clinical deterioration in hospitalized patients by COVID 19
    Garcia-Gutierrez, Susana
    Esteban-Aizpiri, Cristobal
    Lafuente, Iratxe
    Barrio, Irantzu
    Quiros, Raul
    Maria Quintana, Jose
    Uranga, Ane
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [3] Machine learning-based model for prediction of clinical deterioration in hospitalized patients by COVID 19
    Susana Garcia-Gutiérrez
    Cristobal Esteban-Aizpiri
    Iratxe Lafuente
    Irantzu Barrio
    Raul Quiros
    Jose Maria Quintana
    Ane Uranga
    [J]. Scientific Reports, 12
  • [4] Machine learning based on clinical characteristics and chest CT quantitative measurements for prediction of adverse clinical outcomes in hospitalized patients with COVID-19
    Feng, Zhichao
    Shen, Hui
    Gao, Kai
    Su, Jianpo
    Yao, Shanhu
    Liu, Qin
    Yan, Zhimin
    Duan, Junhong
    Yi, Dali
    Zhao, Huafei
    Li, Huiling
    Yu, Qizhi
    Zhou, Wenming
    Mao, Xiaowen
    Ouyang, Xin
    Mei, Ji
    Zeng, Qiuhua
    Williams, Lindy
    Ma, Xiaoqian
    Rong, Pengfei
    Hu, Dewen
    Wang, Wei
    [J]. EUROPEAN RADIOLOGY, 2021, 31 (10) : 7925 - 7935
  • [5] Machine learning based on clinical characteristics and chest CT quantitative measurements for prediction of adverse clinical outcomes in hospitalized patients with COVID-19
    Zhichao Feng
    Hui Shen
    Kai Gao
    Jianpo Su
    Shanhu Yao
    Qin Liu
    Zhimin Yan
    Junhong Duan
    Dali Yi
    Huafei Zhao
    Huiling Li
    Qizhi Yu
    Wenming Zhou
    Xiaowen Mao
    Xin Ouyang
    Ji Mei
    Qiuhua Zeng
    Lindy Williams
    Xiaoqian Ma
    Pengfei Rong
    Dewen Hu
    Wei Wang
    [J]. European Radiology, 2021, 31 : 7925 - 7935
  • [6] Early Prediction Model for Critical Illness of Hospitalized COVID-19 Patients Based on Machine Learning Techniques
    Fu, Yacheng
    Zhong, Weijun
    Liu, Tao
    Li, Jianmin
    Xiao, Kui
    Ma, Xinhua
    Xie, Lihua
    Jiang, Junyi
    Zhou, Honghao
    Liu, Rong
    Zhang, Wei
    [J]. FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [7] Clinical Decision Making and Outcome Prediction for COVID-19 Patients Using Machine Learning
    Maria, Adamopoulou
    Dimitrios, Velissaris
    Ioanna, Michou
    Charalampos, Matzaroglou
    Gerasimos, Messaris
    Constantinos, Koutsojannis
    [J]. PERVASIVE COMPUTING TECHNOLOGIES FOR HEALTHCARE, PERVASIVE HEALTH 2021, 2022, 431 : 3 - 14
  • [8] Identification of hospitalized mortality of patients with COVID-19 by machine learning models based on blood inflammatory cytokines
    Yu, Zhixiang
    Li, Xiayin
    Zhao, Jin
    Sun, Shiren
    [J]. FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [9] Decompensation Prediction for Hospitalized COVID-19 Patients
    Singh, Meghna
    Liu, Jiacheng
    Kirkland, Lisa
    Srivastava, Jaideep
    [J]. 2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 502 - 504
  • [10] Identification of Endotypes of Hospitalized COVID-19 Patients
    Ranard, Benjamin L.
    Megjhani, Murad
    Terilli, Kalijah
    Doyle, Kevin
    Claassen, Jan
    Pinsky, Michael R.
    Clermont, Gilles
    Vodovotz, Yoram
    Asgari, Shadnaz
    Park, Soojin
    [J]. FRONTIERS IN MEDICINE, 2021, 8