Development of An Individualized Risk Prediction Model for COVID-19 Using Electronic Health Record Data

被引:7
|
作者
Mamidi, Tarun Karthik Kumar [1 ,2 ]
Tran-Nguyen, Thi K. [3 ]
Melvin, Ryan L. [4 ]
Worthey, Elizabeth A. [1 ,2 ,3 ]
机构
[1] Univ Alabama Birmingham, Sch Med, Dept Pediat, Ctr Computat Genom & Data Sci, Birmingham, AL 35294 USA
[2] Univ Alabama Birmingham, Sch Med, Dept Pathol, Ctr Computat Genom & Data Sci, Birmingham, AL 35294 USA
[3] Univ Alabama Birmingham, Hugh Kaul Precis Med Inst, Birmingham, AL 35294 USA
[4] Univ Alabama Birmingham, Dept Anesthesiol & Perioperat Med, Birmingham, AL USA
来源
FRONTIERS IN BIG DATA | 2021年 / 4卷
基金
美国国家科学基金会;
关键词
COVID-19; electronic health record; risk prediction; ICD-10; credit scorecard model; MORTALITY; DISEASE; SELECTION; SEVERITY; OUTCOMES; TOOL;
D O I
10.3389/fdata.2021.675882
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Developing an accurate and interpretable model to predict an individual's risk for Coronavirus Disease 2019 (COVID-19) is a critical step to efficiently triage testing and other scarce preventative resources. To aid in this effort, we have developed an interpretable risk calculator that utilized de-identified electronic health records (EHR) from the University of Alabama at Birmingham Informatics for Integrating Biology and the Bedside (UAB-i2b2) COVID-19 repository under the U-BRITE framework. The generated risk scores are analogous to commonly used credit scores where higher scores indicate higher risks for COVID-19 infection. By design, these risk scores can easily be calculated in spreadsheets or even with pen and paper. To predict risk, we implemented a Credit Scorecard modeling approach on longitudinal EHR data from 7,262 patients enrolled in the UAB Health System who were evaluated and/or tested for COVID-19 between January and June 2020. In this cohort, 912 patients were positive for COVID-19. Our workflow considered the timing of symptoms and medical conditions and tested the effects by applying different variable selection techniques such as LASSO and Elastic-Net. Within the two weeks before a COVID-19 diagnosis, the most predictive features were respiratory symptoms such as cough, abnormalities of breathing, pain in the throat and chest as well as other chronic conditions including nicotine dependence and major depressive disorder. When extending the timeframe to include all medical conditions across all time, our models also uncovered several chronic conditions impacting the respiratory, cardiovascular, central nervous and urinary organ systems. The whole pipeline of data processing, risk modeling and web-based risk calculator can be applied to any EHR data following the OMOP common data format. The results can be employed to generate questionnaires to estimate COVID-19 risk for screening in building entries or to optimize hospital resources.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] An ordinal severity scale for COVID-19 retrospective studies using Electronic Health Record data
    Khodaverdi, Maryam
    Price, Bradley S.
    Porterfield, J. Zachary
    Bunnell, H. Timothy
    Vest, Michael T.
    Anzalone, Alfred Jerrod
    Harper, Jeremy
    Kimble, Wes D.
    Moradi, Hamidreza
    Hendricks, Brian
    Santangelo, Susan L.
    Hodder, Sally L.
    [J]. JAMIA OPEN, 2022, 5 (03)
  • [2] Development and validation of a model for individualized prediction of hospitalization risk in 4,536 patients with COVID-19
    Jehi, Lara
    Ji, Xinge
    Milinovich, Alex
    Erzurum, Serpil
    Merlino, Amy
    Gordon, Steve
    Young, James B.
    Kattan, Michael W.
    [J]. PLOS ONE, 2020, 15 (08):
  • [3] Longitudinal validation of an electronic health record delirium prediction model applied at admission in COVID-19 patients
    Castro, Victor M.
    Hart, Kamber L.
    Sacks, Chana A.
    Murphy, Shawn N.
    Perlis, Roy H.
    McCoy, Thomas H., Jr.
    [J]. GENERAL HOSPITAL PSYCHIATRY, 2022, 74 : 9 - 17
  • [4] Development and validation of a dynamic inpatient risk prediction model for clinically significant hypokalemia using electronic health record data
    Li, Yan
    Staley, Benjamin
    Henriksen, Carl
    Xu, Dandan
    Lipori, Gloria
    Winterstein, Almut G.
    [J]. AMERICAN JOURNAL OF HEALTH-SYSTEM PHARMACY, 2019, 76 (05) : 301 - 311
  • [5] Development and validation of an asthma exacerbation prediction model using electronic health record (EHR) data
    Martin, Alfred
    Bauer, Victoria
    Datta, Avisek
    Masi, Christopher
    Mosnaim, Giselle
    Solomonides, Anthony
    Rao, Goutham
    [J]. JOURNAL OF ASTHMA, 2020, 57 (12) : 1339 - 1346
  • [6] Visualization of Covid-19 pandemic influence on healthcare routines in dermatology using electronic health record data
    Wolf, J. Ryan
    Zhang, L.
    Xie, Y.
    Pentland, A.
    Pentland, B. T.
    [J]. JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2023, 143 (05) : S119 - S119
  • [7] Predicting critical state after COVID-19 diagnosis: model development using a large US electronic health record dataset
    Rinderknecht, Mike D.
    Klopfenstein, Yannick
    [J]. NPJ DIGITAL MEDICINE, 2021, 4 (01)
  • [8] Electronic health record data for assessing risk of hospitalization for COVID-19: Methodological considerations applied to multiple sclerosis
    Dillon, Paul
    Siadimas, Athanasios
    Roumpanis, Spyros
    Fajardo, Otto
    Fitovski, Kocho
    Jessop, Nikki
    Whitley, Louise
    Rouzic, Erwan Muros -Le
    [J]. MULTIPLE SCLEROSIS AND RELATED DISORDERS, 2023, 71
  • [9] Predicting critical state after COVID-19 diagnosis: model development using a large US electronic health record dataset
    Mike D. Rinderknecht
    Yannick Klopfenstein
    [J]. npj Digital Medicine, 4
  • [10] Surveillance for probable COVID-19 using structured data in the electronic medical record
    Burke, Patrick C.
    Shirley, Rachel Benish
    Faiman, Matthew
    Boose, Eric W.
    Jones, Robert W., Jr.
    Merlino, Amy
    Gordon, Steven M.
    Fraser, Thomas G.
    [J]. INFECTION CONTROL AND HOSPITAL EPIDEMIOLOGY, 2021, 42 (06): : 781 - 783