Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model

被引:36
|
作者
Watson, Gregory L. [1 ]
Xiong, Di [1 ]
Zhang, Lu [1 ]
Zoller, Joseph A. [1 ]
Shamshoian, John [1 ]
Sundin, Phillip [1 ]
Bufford, Teresa [1 ]
Rimoin, Anne W. [2 ]
Suchard, Marc A. [1 ,3 ,4 ]
Ramirez, Christina M. [1 ]
机构
[1] Univ Calif Los Angeles, Fielding Sch Publ Hlth, Dept Biostat, Los Angeles, CA 90024 USA
[2] Univ Calif Los Angeles, Fielding Sch Publ Hlth, Dept Epidemiol, Los Angeles, CA USA
[3] Univ Calif Los Angeles, David Geffen Sch Med, Dept Computat Med, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, David Geffen Sch Med, Dept Human Genet, Los Angeles, CA 90095 USA
基金
美国国家卫生研究院;
关键词
DISEASE; 2019; COVID-19; CORONAVIRUS COVID-19; PREDICTION; OUTBREAK; CHINA; TRANSMISSION; STRATEGIES; QUARANTINE; EPIDEMICS; SPREAD;
D O I
10.1371/journal.pcbi.1008837
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Author summary COVID-19 models can be roughly classified as mathematical models that simulate disease within a population, including epidemiological compartmental models, or statistical curve-fitting models that fit a function to observed data and extrapolate forward into the future. Bridging this divide, we combine the strengths of curve-fitting statistical models and the structure of epidemiological models, by embedding a Bayesian velocity model and a machine learning algorithm (random forest) into the framework of a compartmental model. Fusing these models together exploits the particular strengths of each to glean as much information as possible from the currently available data. We identify the velocity of log cumulative cases as an excellent target for modeling and extrapolating COVID-19 case trajectories. We empirically evaluate the predictive performance of the model and provide predicted trajectories with credible intervals for cumulative confirmed case count, active confirmed infections and COVID-19 deaths for each of the 50 U.S. states. Combining sophisticated data analytic methods with proven epidemiological models offers an empirically grounded strategy for making realistic predictions and quantifying their uncertainty. These predictions indicate substantial variation in the COVID-19 trajectories of U.S. states. Predictions of COVID-19 case growth and mortality are critical to the decisions of political leaders, businesses, and individuals grappling with the pandemic. This predictive task is challenging due to the novelty of the virus, limited data, and dynamic political and societal responses. We embed a Bayesian time series model and a random forest algorithm within an epidemiological compartmental model for empirically grounded COVID-19 predictions. The Bayesian case model fits a location-specific curve to the velocity (first derivative) of the log transformed cumulative case count, borrowing strength across geographic locations and incorporating prior information to obtain a posterior distribution for case trajectories. The compartmental model uses this distribution and predicts deaths using a random forest algorithm trained on COVID-19 data and population-level characteristics, yielding daily projections and interval estimates for cases and deaths in U.S. states. We evaluated the model by training it on progressively longer periods of the pandemic and computing its predictive accuracy over 21-day forecasts. The substantial variation in predicted trajectories and associated uncertainty between states is illustrated by comparing three unique locations: New York, Colorado, and West Virginia. The sophistication and accuracy of this COVID-19 model offer reliable predictions and uncertainty estimates for the current trajectory of the pandemic in the U.S. and provide a platform for future predictions as shifting political and societal responses alter its course.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A machine learning forecasting model for COVID-19 pandemic in India
    Sujath, R.
    Chatterjee, Jyotir Moy
    Hassanien, Aboul Ella
    [J]. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2020, 34 (07) : 959 - 972
  • [2] A machine learning forecasting model for COVID-19 pandemic in India
    R. Sujath
    Jyotir Moy Chatterjee
    Aboul Ella Hassanien
    [J]. Stochastic Environmental Research and Risk Assessment, 2020, 34 : 959 - 972
  • [3] Correction to: A machine learning forecasting model for COVID-19 pandemic in India
    R. Sujatha
    Jyotir Moy Chatterjee
    Aboul Ella Hassanien
    [J]. Stochastic Environmental Research and Risk Assessment, 2020, 34 : 1681 - 1681
  • [4] Machine Learning and Deep Learning Based Time Series Prediction and Forecasting of Ten Nations’ COVID-19 Pandemic
    Kumar Y.
    Koul A.
    Kaur S.
    Hu Y.-C.
    [J]. SN Computer Science, 4 (1)
  • [5] Time Series Forecasting of US COVID-19 Transmission
    Ding, Yongmei
    Huang, Rui
    Shao, Ningyi
    [J]. ALTERNATIVE THERAPIES IN HEALTH AND MEDICINE, 2021, 27 : 4 - 11
  • [6] Application of machine learning time series analysis for prediction COVID-19 pandemic
    Chaurasia V.
    Pal S.
    [J]. Research on Biomedical Engineering, 2022, 38 (01) : 35 - 47
  • [7] The FBProphet forecasting model to evaluate the spread of COVID-19 pandemic: A machine learning approach
    Babu, Md. Ashraful
    Ahmmed, Md. Mortuza
    Abu Helal, Md.
    Hoque, M. A.
    [J]. JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2022, 25 (07) : 2073 - 2082
  • [8] A Machine Learning Univariate Time series Model for Forecasting COVID-19 Confirmed Cases: A Pilot Study in Botswana
    Mphale, Ofaletse
    Okike, Ezekiel U.
    Rafifing, Neo
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (01): : 225 - 233
  • [9] Epidemiological Forecasting with Model Reduction of Compartmental Models. Application to the COVID-19 Pandemic
    Bakhta, Athmane
    Boiveau, Thomas
    Maday, Yvon
    Mula, Olga
    [J]. BIOLOGY-BASEL, 2021, 10 (01): : 1 - 42
  • [10] COVID-19 Pandemic Prediction and Forecasting Using Machine Learning Classifiers
    Sultana, Jabeen
    Singha, Anjani Kumar
    Siddiqui, Shams Tabrez
    Nagalaxmi, Guthikonda
    Sriram, Anil Kumar
    Pathak, Nitish
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 32 (02): : 1007 - 1024