Application of one-, three-, and seven-day forecasts during early onset on the COVID-19 epidemic dataset using moving average, autoregressive, autoregressive moving average, autoregressive integrated moving average, and naive forecasting methods

被引:7
|
作者
Lynch, Christopher J. [1 ]
Gore, Ross [1 ]
机构
[1] Old Dominion Univ, Virginia Modeling Anal & Simulat Ctr VMASC, Norfolk, VA 23529 USA
来源
DATA IN BRIEF | 2021年 / 35卷
关键词
Coronavirus COVID-19; Infectious diseases; Epidemic modeling; ARIMA(p; d; q); model; ARMA model; Holt-winters exponential smoothing model; Statistical analysis; Short-range time series forecasting; TIME-SERIES;
D O I
10.1016/j.dib.2021.106759
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The coronavirus disease 2019 (COVID-19) spread rapidly across the world since its appearance in December 2019. This data set creates one-, three-, and seven-day forecasts of the COVID-19 pandemic's cumulative case counts at the county, health district, and state geographic levels for the state of Virginia. Forecasts are created over the first 46 days of reported COVID-19 cases using the cumulative case count data provided by The New York Times as of April 22, 2020. From this historical data, one-, three-, seven, and all-days prior to the forecast start date are used to generate the forecasts. Forecasts are created using: (1) a Naive approach; (2) Holt-Winters exponential smoothing (HW); (3) growth rate (Growth); (4) moving average (MA); (5) autoregressive (AR); (6) autoregressive moving average (ARMA); and (7) autoregressive integrated moving average (ARIMA). Median Absolute Error (MdAE) and Median Absolute Percentage Error (MdAPE) metrics are created with each forecast to evaluate the forecast with respect to existing historical data. These error metrics are aggregated to provide a means for assessing which combination of forecast method, forecast length, and lookback length are best fits, based on lowest aggregated error at each geographic level. The data set is comprised of an R-Project file, four R source code files, all 1,329,404 generated short-range forecasts, MdAE and MdAPE error metric data for each forecast, copies of the input files, and the generated comparison tables. All code and data files are provided to provide transparency and facilitate replicability and reproducibility. This package opens directly in RStudio through the R Project file. The R Project file removes the need to set path locations for the folders contained within the data set to simplify setup requirements. This data set provides two avenues for reproducing results: 1) Use the provided code to generate the forecasts from scratch and then run the analyses; or 2) Load the saved forecast data and run the analyses on the stored data. Code annotations provide the instructions needed to accomplish both routes. This data can be used to generate the same set of forecasts and error metrics for any US state by altering the state parameter within the source code. Users can also generate health district forecasts for any other state, by providing a file which maps each county within a state to its respective health-district. The source code can be connected to the most up-to-date version of The New York Times COVID-19 dataset allows for the generation of forecasts up to the most recently reported data to facilitate near real-time forecasting. (C) 2021 The Authors. Published by Elsevier Inc.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] FORECASTING COVID-19 USING AUTOREGRESSIVE INTEGRATED MOVING AVERAGE MODEL
    Deepa, B.
    Jeenmarseline, K. S.
    [J]. INTERNATIONAL JOURNAL OF LIFE SCIENCE AND PHARMA RESEARCH, 2022, 12 : 108 - 114
  • [2] Visibility Forecasting Using Autoregressive Integrated Moving Average (ARIMA) Models
    Salman, Afan Galih
    Kanigoro, Bayu
    [J]. 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 252 - 259
  • [3] Load Forecasting using Autoregressive Integrated Moving Average and Artificial Neural Network
    Velasco, Lemuel Clark P.
    Polestico, Daisy Lou L.
    Macasieb, Gary Paolo O.
    Reyes, Michael Bryan V.
    Vasquez, Felicisimo B., Jr.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (07) : 23 - 29
  • [4] Global Forecasting Confirmed and Fatal Cases of COVID-19 Outbreak Using Autoregressive Integrated Moving Average Model
    Dansana, Debabrata
    Kumar, Raghvendra
    Das Adhikari, Janmejoy
    Mohapatra, Mans
    Sharma, Rohit
    Priyadarshini, Ishaani
    Le, Dac-Nhuong
    [J]. FRONTIERS IN PUBLIC HEALTH, 2020, 8
  • [5] Forecasting Indian infant mortality rate: An application of autoregressive integrated moving average model
    Mishra, Amit K.
    Sahanaa, Chandar
    Manikandan, Mani
    [J]. JOURNAL OF FAMILY AND COMMUNITY MEDICINE, 2019, 26 (02): : 123 - 126
  • [6] Forecasting the influx of crime cases using seasonal autoregressive integrated moving average model
    Redoblo, Cristine, V
    Redoblo, Jose Leo G.
    Salmingo, Rene A.
    Padilla, Charwin M.
    Arroyo, Jan Carlo T.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2023, 10 (08): : 158 - 165
  • [7] Hotspots Forecasting Using Autoregressive Integrated Moving Average (ARIMA) for Detecting Forest Fires
    Slavia, Athaya Putri
    Sutoyo, Edi
    Witarsyah, Deden
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND INTELLIGENCE SYSTEM (IOTAIS), 2019, : 92 - 97
  • [8] Predicting the failure of railway point machines by using Autoregressive Integrated Moving Average and Autoregressive-Kalman methods
    Abbasnejad, Sahand
    Mirabadi, Ahmad
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART F-JOURNAL OF RAIL AND RAPID TRANSIT, 2018, 232 (06) : 1790 - 1799
  • [9] The Validity of Autoregressive Integrated Moving Average Approach to Forecast the Spread of COVID-19 Pandemic in Africa
    Legese Feyisa, Habtamu
    Tilahun Tefera, Frezer
    [J]. DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2022, 2022
  • [10] Comparative analysis of Gated Recurrent Units (GRU), long Short-Term memory (LSTM) cells, autoregressive Integrated moving average (ARIMA), seasonal autoregressive Integrated moving average (SARIMA) for forecasting COVID-19 trends
    ArunKumar, K. E.
    Kalaga, Dinesh, V
    Kumar, Ch. Mohan Sai M.
    Kawaji, Masahiro
    Brenza, Timothy M.
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2022, 61 (10) : 7585 - 7603