Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model

被引:36
|
作者
Watson, Gregory L. [1 ]
Xiong, Di [1 ]
Zhang, Lu [1 ]
Zoller, Joseph A. [1 ]
Shamshoian, John [1 ]
Sundin, Phillip [1 ]
Bufford, Teresa [1 ]
Rimoin, Anne W. [2 ]
Suchard, Marc A. [1 ,3 ,4 ]
Ramirez, Christina M. [1 ]
机构
[1] Univ Calif Los Angeles, Fielding Sch Publ Hlth, Dept Biostat, Los Angeles, CA 90024 USA
[2] Univ Calif Los Angeles, Fielding Sch Publ Hlth, Dept Epidemiol, Los Angeles, CA USA
[3] Univ Calif Los Angeles, David Geffen Sch Med, Dept Computat Med, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, David Geffen Sch Med, Dept Human Genet, Los Angeles, CA 90095 USA
基金
美国国家卫生研究院;
关键词
DISEASE; 2019; COVID-19; CORONAVIRUS COVID-19; PREDICTION; OUTBREAK; CHINA; TRANSMISSION; STRATEGIES; QUARANTINE; EPIDEMICS; SPREAD;
D O I
10.1371/journal.pcbi.1008837
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Author summary COVID-19 models can be roughly classified as mathematical models that simulate disease within a population, including epidemiological compartmental models, or statistical curve-fitting models that fit a function to observed data and extrapolate forward into the future. Bridging this divide, we combine the strengths of curve-fitting statistical models and the structure of epidemiological models, by embedding a Bayesian velocity model and a machine learning algorithm (random forest) into the framework of a compartmental model. Fusing these models together exploits the particular strengths of each to glean as much information as possible from the currently available data. We identify the velocity of log cumulative cases as an excellent target for modeling and extrapolating COVID-19 case trajectories. We empirically evaluate the predictive performance of the model and provide predicted trajectories with credible intervals for cumulative confirmed case count, active confirmed infections and COVID-19 deaths for each of the 50 U.S. states. Combining sophisticated data analytic methods with proven epidemiological models offers an empirically grounded strategy for making realistic predictions and quantifying their uncertainty. These predictions indicate substantial variation in the COVID-19 trajectories of U.S. states. Predictions of COVID-19 case growth and mortality are critical to the decisions of political leaders, businesses, and individuals grappling with the pandemic. This predictive task is challenging due to the novelty of the virus, limited data, and dynamic political and societal responses. We embed a Bayesian time series model and a random forest algorithm within an epidemiological compartmental model for empirically grounded COVID-19 predictions. The Bayesian case model fits a location-specific curve to the velocity (first derivative) of the log transformed cumulative case count, borrowing strength across geographic locations and incorporating prior information to obtain a posterior distribution for case trajectories. The compartmental model uses this distribution and predicts deaths using a random forest algorithm trained on COVID-19 data and population-level characteristics, yielding daily projections and interval estimates for cases and deaths in U.S. states. We evaluated the model by training it on progressively longer periods of the pandemic and computing its predictive accuracy over 21-day forecasts. The substantial variation in predicted trajectories and associated uncertainty between states is illustrated by comparing three unique locations: New York, Colorado, and West Virginia. The sophistication and accuracy of this COVID-19 model offer reliable predictions and uncertainty estimates for the current trajectory of the pandemic in the U.S. and provide a platform for future predictions as shifting political and societal responses alter its course.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] CoBiD-net: a tailored deep learning ensemble model for time series forecasting of covid-19
    Sourabh Shastri
    Kuljeet Singh
    Monu Deswal
    Sachin Kumar
    Vibhakar Mansotra
    [J]. Spatial Information Research, 2022, 30 : 9 - 22
  • [32] CoBiD-net: a tailored deep learning ensemble model for time series forecasting of covid-19
    Shastri, Sourabh
    Singh, Kuljeet
    Deswal, Monu
    Kumar, Sachin
    Mansotra, Vibhakar
    [J]. SPATIAL INFORMATION RESEARCH, 2022, 30 (01) : 9 - 22
  • [33] Machine Learning Model for Computational Tracking and Forecasting the COVID-19 Dynamic Propagation
    dos Santos Gomes, Daiana Caroline
    de Oliveira Serra, Ginalber Luiz
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (03) : 615 - 622
  • [34] DEVELOPMENT OF ENSEMBLE MACHINE LEARNING MODEL TO IMPROVE COVID-19 OUTBREAK FORECASTING
    Alrehaili, Meaad
    Assiri, Fatmah
    [J]. JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2022, 8 (02): : 159 - 169
  • [35] Forecasting the effects of vaccination on the COVID-19 pandemic in Malaysia using SEIRV compartmental models
    Lim, Mei Cheng
    Singh, Sarbhan
    Lai, Chee Herng
    Gill, Balvinder Singh
    Kamarudin, Mohd Kamarulariffin
    Zamri, Ahmed Syahmi Syafiq Md
    Tan, Cia Vei
    Zulkifli, Asrul Anuar
    Nadzri, Mohamad Nadzmi Md
    Ghazali, Nur'ain Mohd
    Ghazali, Sumarni Mohd
    Iderus, Nuur Hafizah Md
    Ahmad, Nur Ar Rabiah Binti
    Suppiah, Jeyanthi
    Tee, Kok Keng
    Aris, Tahir
    Ahmad, Lonny Chen Rong Qi
    [J]. EPIDEMIOLOGY AND HEALTH, 2023, 45 : 1 - 9
  • [36] A fractional-order compartmental model for the spread of the COVID-19 pandemic
    Biala, T. A.
    Khaliq, A. Q. M.
    [J]. COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2021, 98
  • [37] COVID-19: Forecasting confirmed cases and deaths with a simple time series model
    Petropoulos, Fotios
    Makridakis, Spyros
    Stylianou, Neophytos
    [J]. INTERNATIONAL JOURNAL OF FORECASTING, 2022, 38 (02) : 439 - 452
  • [38] Gecko: A time-series model for COVID-19 hospital admission forecasting
    Panaggio, Mark J.
    Rainwater-Lovett, Kaitlin
    Nicholas, Paul J.
    Fang, Mike
    Bang, Hyunseung
    Freeman, Jeffrey
    Peterson, Elisha
    Imbriale, Samuel
    [J]. EPIDEMICS, 2022, 39
  • [39] Who was at risk for COVID-19 late in the US pandemic? Insights from a population health machine learning model
    Elijah A. Adeoye
    Yelena Rozenfeld
    Jennifer Beam
    Karen Boudreau
    Emily J. Cox
    James M. Scanlan
    [J]. Medical & Biological Engineering & Computing, 2022, 60 : 2039 - 2049
  • [40] Who was at risk for COVID-19 late in the US pandemic? Insights from a population health machine learning model
    Adeoye, Elijah A.
    Rozenfeld, Yelena
    Beam, Jennifer
    Boudreau, Karen
    Cox, Emily J.
    Scanlan, James M.
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (07) : 2039 - 2049