Development of An Individualized Risk Prediction Model for COVID-19 Using Electronic Health Record Data

被引:7
|
作者
Mamidi, Tarun Karthik Kumar [1 ,2 ]
Tran-Nguyen, Thi K. [3 ]
Melvin, Ryan L. [4 ]
Worthey, Elizabeth A. [1 ,2 ,3 ]
机构
[1] Univ Alabama Birmingham, Sch Med, Dept Pediat, Ctr Computat Genom & Data Sci, Birmingham, AL 35294 USA
[2] Univ Alabama Birmingham, Sch Med, Dept Pathol, Ctr Computat Genom & Data Sci, Birmingham, AL 35294 USA
[3] Univ Alabama Birmingham, Hugh Kaul Precis Med Inst, Birmingham, AL 35294 USA
[4] Univ Alabama Birmingham, Dept Anesthesiol & Perioperat Med, Birmingham, AL USA
来源
FRONTIERS IN BIG DATA | 2021年 / 4卷
基金
美国国家科学基金会;
关键词
COVID-19; electronic health record; risk prediction; ICD-10; credit scorecard model; MORTALITY; DISEASE; SELECTION; SEVERITY; OUTCOMES; TOOL;
D O I
10.3389/fdata.2021.675882
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Developing an accurate and interpretable model to predict an individual's risk for Coronavirus Disease 2019 (COVID-19) is a critical step to efficiently triage testing and other scarce preventative resources. To aid in this effort, we have developed an interpretable risk calculator that utilized de-identified electronic health records (EHR) from the University of Alabama at Birmingham Informatics for Integrating Biology and the Bedside (UAB-i2b2) COVID-19 repository under the U-BRITE framework. The generated risk scores are analogous to commonly used credit scores where higher scores indicate higher risks for COVID-19 infection. By design, these risk scores can easily be calculated in spreadsheets or even with pen and paper. To predict risk, we implemented a Credit Scorecard modeling approach on longitudinal EHR data from 7,262 patients enrolled in the UAB Health System who were evaluated and/or tested for COVID-19 between January and June 2020. In this cohort, 912 patients were positive for COVID-19. Our workflow considered the timing of symptoms and medical conditions and tested the effects by applying different variable selection techniques such as LASSO and Elastic-Net. Within the two weeks before a COVID-19 diagnosis, the most predictive features were respiratory symptoms such as cough, abnormalities of breathing, pain in the throat and chest as well as other chronic conditions including nicotine dependence and major depressive disorder. When extending the timeframe to include all medical conditions across all time, our models also uncovered several chronic conditions impacting the respiratory, cardiovascular, central nervous and urinary organ systems. The whole pipeline of data processing, risk modeling and web-based risk calculator can be applied to any EHR data following the OMOP common data format. The results can be employed to generate questionnaires to estimate COVID-19 risk for screening in building entries or to optimize hospital resources.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] COVID-19 Prediction Classifier Model Using Hybrid Algorithms in Data Mining
    Nikooghadam, Morteza
    Ghazikhani, Adel
    Saeedi, Mohammad
    [J]. INTERNATIONAL JOURNAL OF PEDIATRICS-MASHHAD, 2021, 9 (01): : 12723 - 12737
  • [42] Electronic health record derived-impact of COVID-19 on myasthenia gravis
    Roy, Bhaskar
    Kovvuru, Sukanthi
    Nalleballe, Krishna
    Onteddu, Sanjeeva Reddy
    Nowak, Richard J.
    [J]. JOURNAL OF THE NEUROLOGICAL SCIENCES, 2021, 423
  • [43] Development and validation of a simple risk scoring system for a COVID-19 diagnostic prediction model
    Guclu, Ozge Aydin
    Ursavas, Ahmet
    Ocakoglu, Gokhan
    Demirdogen, Ezgi
    Ozturk, Nilufer Aylin Acet
    Topcu, Dilara Omer
    Terzi, Orkun Eray
    Onal, Ugur
    Dilektasli, Asli Gorek
    Saglik, Imran
    Coskun, Funda
    Ediger, Dane
    Uzaslan, Esra
    Akalin, Halis
    Karadag, Mehmet
    [J]. TUBERKULOZ VE TORAKS-TUBERCULOSIS AND THORAX, 2023, 71 (04): : 325 - 334
  • [44] Development and validation of a clinical prediction model to estimate the risk of critical patients with COVID-19
    Chen, Wenyu
    Yao, Ming
    Hu, Lin
    Zhang, Ye
    Zhou, Qinghe
    Ren, Hongwei
    Sun, Yanbao
    Zhang, Ming
    Xu, Yufen
    [J]. JOURNAL OF MEDICAL VIROLOGY, 2022, 94 (03) : 1104 - 1114
  • [45] Development and Validation of a Web- Based Severe COVID-19 Risk Prediction Model
    Woo, Sang H.
    Rios-Diaz, Arturo J.
    Kubey, Alan A.
    Cheney-Peters, Dianna R.
    Ackermann, Lily L.
    Chalikonda, Divya M.
    Venkataraman, Chantel M.
    Riley, Joshua M.
    Baram, Michael
    [J]. AMERICAN JOURNAL OF THE MEDICAL SCIENCES, 2021, 362 (04): : 355 - 362
  • [46] Developing a COVID-19 WHO Clinical Progression Scale inpatient database from electronic health record data
    Ramaswamy, Priya
    Gong, Jen J.
    Saleh, Sameh N.
    McDonald, Samuel A.
    Blumberg, Seth
    Medford, Richard J.
    Liu, Xinran
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (07) : 1279 - 1285
  • [47] Effects of Antidepressants on COVID-19 Outcomes: Retrospective Study on Large-Scale Electronic Health Record Data
    Rahman, Mahmudur
    Mahi, Atqiya Munawara
    Melamed, Rachel
    Alam, Mohammad Arif Ul
    [J]. INTERACTIVE JOURNAL OF MEDICAL RESEARCH, 2023, 12
  • [48] Real-time prediction of COVID-19 related mortality using electronic health records
    Schwab, Patrick
    Mehrjou, Arash
    Parbhoo, Sonali
    Celi, Leo Anthony
    Hetzel, Jurgen
    Hofer, Markus
    Scholkopf, Bernhard
    Bauer, Stefan
    [J]. NATURE COMMUNICATIONS, 2021, 12 (01)
  • [49] Real-time prediction of COVID-19 related mortality using electronic health records
    Patrick Schwab
    Arash Mehrjou
    Sonali Parbhoo
    Leo Anthony Celi
    Jürgen Hetzel
    Markus Hofer
    Bernhard Schölkopf
    Stefan Bauer
    [J]. Nature Communications, 12
  • [50] Prediction of Recurrent Atherosclerotic Cardiovascular Disease Risk Using Machine Learning and Electronic Health Record Data
    Sarraju, Ashish
    Ward, Andrew
    Chung, Sukyung
    Li, Jiang
    Scheinker, David
    Rodriguez, Fatima
    [J]. CIRCULATION, 2020, 142