Development of An Individualized Risk Prediction Model for COVID-19 Using Electronic Health Record Data

被引:7
|
作者
Mamidi, Tarun Karthik Kumar [1 ,2 ]
Tran-Nguyen, Thi K. [3 ]
Melvin, Ryan L. [4 ]
Worthey, Elizabeth A. [1 ,2 ,3 ]
机构
[1] Univ Alabama Birmingham, Sch Med, Dept Pediat, Ctr Computat Genom & Data Sci, Birmingham, AL 35294 USA
[2] Univ Alabama Birmingham, Sch Med, Dept Pathol, Ctr Computat Genom & Data Sci, Birmingham, AL 35294 USA
[3] Univ Alabama Birmingham, Hugh Kaul Precis Med Inst, Birmingham, AL 35294 USA
[4] Univ Alabama Birmingham, Dept Anesthesiol & Perioperat Med, Birmingham, AL USA
来源
FRONTIERS IN BIG DATA | 2021年 / 4卷
基金
美国国家科学基金会;
关键词
COVID-19; electronic health record; risk prediction; ICD-10; credit scorecard model; MORTALITY; DISEASE; SELECTION; SEVERITY; OUTCOMES; TOOL;
D O I
10.3389/fdata.2021.675882
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Developing an accurate and interpretable model to predict an individual's risk for Coronavirus Disease 2019 (COVID-19) is a critical step to efficiently triage testing and other scarce preventative resources. To aid in this effort, we have developed an interpretable risk calculator that utilized de-identified electronic health records (EHR) from the University of Alabama at Birmingham Informatics for Integrating Biology and the Bedside (UAB-i2b2) COVID-19 repository under the U-BRITE framework. The generated risk scores are analogous to commonly used credit scores where higher scores indicate higher risks for COVID-19 infection. By design, these risk scores can easily be calculated in spreadsheets or even with pen and paper. To predict risk, we implemented a Credit Scorecard modeling approach on longitudinal EHR data from 7,262 patients enrolled in the UAB Health System who were evaluated and/or tested for COVID-19 between January and June 2020. In this cohort, 912 patients were positive for COVID-19. Our workflow considered the timing of symptoms and medical conditions and tested the effects by applying different variable selection techniques such as LASSO and Elastic-Net. Within the two weeks before a COVID-19 diagnosis, the most predictive features were respiratory symptoms such as cough, abnormalities of breathing, pain in the throat and chest as well as other chronic conditions including nicotine dependence and major depressive disorder. When extending the timeframe to include all medical conditions across all time, our models also uncovered several chronic conditions impacting the respiratory, cardiovascular, central nervous and urinary organ systems. The whole pipeline of data processing, risk modeling and web-based risk calculator can be applied to any EHR data following the OMOP common data format. The results can be employed to generate questionnaires to estimate COVID-19 risk for screening in building entries or to optimize hospital resources.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Individualized prediction of COVID-19 adverse outcomes with MLHO
    Hossein Estiri
    Zachary H. Strasser
    Shawn N. Murphy
    [J]. Scientific Reports, 11
  • [32] Individualized prediction of COVID-19 adverse outcomes with MLHO
    Estiri, Hossein
    Strasser, Zachary H.
    Murphy, Shawn N.
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [33] EXAMINING THE ASSOCIATION OF AIR QUALITY ON COVID-19 PATIENT OUTCOMES USING AREA LEVEL DATA FROM THE ELECTRONIC HEALTH RECORD
    Boncyk, Christina S.
    McCarthy, Karen Y.
    Butler, Pamela
    Flores-Conner, Nicholas
    Freundlich, Robert E.
    [J]. ANESTHESIA AND ANALGESIA, 2023, 136 : 51 - 51
  • [34] An ensemble prediction model for COVID-19 mortality risk
    Li, Jie
    Li, Xin
    Hutchinson, John
    Asad, Mohammad
    Liu, Yinghui
    Wang, Yadong
    Wang, Edwin
    [J]. BIOLOGY METHODS & PROTOCOLS, 2022, 7 (01):
  • [35] Individualized melanoma risk prediction using machine learning with electronic health records
    Wan, G.
    Nguyen, N.
    Yan, B.
    Khattab, S.
    Estiri, H.
    Semenov, Y.
    [J]. JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2024, 144 (08) : S35 - S35
  • [36] Insights from electronic health record data to improve mental health service delivery during the COVID-19 pandemic
    Patel, R.
    Irving, J.
    Brinn, A.
    Broadbent, M.
    Shetty, H.
    Pritchard, M.
    Downs, J.
    Stewart, R.
    Harland, R.
    [J]. EUROPEAN PSYCHIATRY, 2021, 64 : S38 - S39
  • [37] Risk of New Retinal Vascular Occlusion After Messenger RNA COVID-19 Vaccination Within Aggregated Electronic Health Record Data
    Hsu, Alan Y.
    Lin, Chun-Ju
    Wei, James Cheng-Chung
    [J]. JAMA OPHTHALMOLOGY, 2023, 141 (12) : 1175 - 1176
  • [38] COVID-19: Short term prediction model using daily incidence data
    Zhao, Hongwei
    Merchant, Naveed N.
    McNulty, Alyssa
    Radcliff, Tiffany A.
    Cote, Murray J.
    Fischer, Rebecca S. B.
    Sang, Huiyan
    Ory, Marcia G.
    [J]. PLOS ONE, 2021, 16 (04):
  • [39] There are no shortcuts in the development and validation of a COVID-19 prediction model
    Collins, Gary S.
    Ma, Jie
    Dhiman, Paula
    [J]. TRANSBOUNDARY AND EMERGING DISEASES, 2021, 68 (02) : 210 - 211
  • [40] Using Electronic Health Record Data to Develop and Validate a Prediction Model for Adverse Outcomes in the Wards
    Churpek, Matthew M.
    Yuen, Trevor C.
    Park, Seo Young
    Gibbons, Robert
    Edelson, Dana P.
    [J]. CRITICAL CARE MEDICINE, 2014, 42 (04) : 841 - 848