Application of one-, three-, and seven-day forecasts during early onset on the COVID-19 epidemic dataset using moving average, autoregressive, autoregressive moving average, autoregressive integrated moving average, and naive forecasting methods

被引:7
|
作者
Lynch, Christopher J. [1 ]
Gore, Ross [1 ]
机构
[1] Old Dominion Univ, Virginia Modeling Anal & Simulat Ctr VMASC, Norfolk, VA 23529 USA
来源
DATA IN BRIEF | 2021年 / 35卷
关键词
Coronavirus COVID-19; Infectious diseases; Epidemic modeling; ARIMA(p; d; q); model; ARMA model; Holt-winters exponential smoothing model; Statistical analysis; Short-range time series forecasting; TIME-SERIES;
D O I
10.1016/j.dib.2021.106759
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The coronavirus disease 2019 (COVID-19) spread rapidly across the world since its appearance in December 2019. This data set creates one-, three-, and seven-day forecasts of the COVID-19 pandemic's cumulative case counts at the county, health district, and state geographic levels for the state of Virginia. Forecasts are created over the first 46 days of reported COVID-19 cases using the cumulative case count data provided by The New York Times as of April 22, 2020. From this historical data, one-, three-, seven, and all-days prior to the forecast start date are used to generate the forecasts. Forecasts are created using: (1) a Naive approach; (2) Holt-Winters exponential smoothing (HW); (3) growth rate (Growth); (4) moving average (MA); (5) autoregressive (AR); (6) autoregressive moving average (ARMA); and (7) autoregressive integrated moving average (ARIMA). Median Absolute Error (MdAE) and Median Absolute Percentage Error (MdAPE) metrics are created with each forecast to evaluate the forecast with respect to existing historical data. These error metrics are aggregated to provide a means for assessing which combination of forecast method, forecast length, and lookback length are best fits, based on lowest aggregated error at each geographic level. The data set is comprised of an R-Project file, four R source code files, all 1,329,404 generated short-range forecasts, MdAE and MdAPE error metric data for each forecast, copies of the input files, and the generated comparison tables. All code and data files are provided to provide transparency and facilitate replicability and reproducibility. This package opens directly in RStudio through the R Project file. The R Project file removes the need to set path locations for the folders contained within the data set to simplify setup requirements. This data set provides two avenues for reproducing results: 1) Use the provided code to generate the forecasts from scratch and then run the analyses; or 2) Load the saved forecast data and run the analyses on the stored data. Code annotations provide the instructions needed to accomplish both routes. This data can be used to generate the same set of forecasts and error metrics for any US state by altering the state parameter within the source code. Users can also generate health district forecasts for any other state, by providing a file which maps each county within a state to its respective health-district. The source code can be connected to the most up-to-date version of The New York Times COVID-19 dataset allows for the generation of forecasts up to the most recently reported data to facilitate near real-time forecasting. (C) 2021 The Authors. Published by Elsevier Inc.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Forecasting and prediction of scorpion sting cases in Biskra province, Algeria, using a seasonal autoregressive integrated moving average model
    Selmane, Schehrazad
    L'Hadj, Mohamed
    [J]. EPIDEMIOLOGY AND HEALTH, 2016, 38 : e2016044
  • [42] Short-Term Stochastic Load Forecasting Using Autoregressive Integrated Moving Average Models and Hidden Markov Model
    Hermias, Jeffrel P.
    Teknomo, Kardi
    Monje, Jose Claro N.
    [J]. 2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICICT), 2017, : 131 - 137
  • [43] Electricity Sales Forecasting Using Hybrid Autoregressive Integrated Moving Average and Soft Computing Approaches in the Absence of Explanatory Variables
    Shao, Yuehjen E.
    Tsai, Yi-Shan
    [J]. ENERGIES, 2018, 11 (07):
  • [44] Forecasting and predicting intussusception in children younger than 48 months in Suzhou using a seasonal autoregressive integrated moving average model
    Guo, Wan-liang
    Geng, Jia
    Zhan, Yang
    Tan, Ya-lan
    Hu, Zhang-chun
    Pan, Peng
    Sheng, Mao
    Wang, Jian
    Huang, Shun-gen
    [J]. BMJ OPEN, 2019, 9 (01):
  • [45] Mid-Term Load Forecasting for Iran Power System Using Seasonal Autoregressive Integrated Moving Average Model (SARIMA)
    Dehghanzadeh, Ahmad
    Kazemimofrad, Haura
    Moghimzadeh, Mehdi
    Mashhadi, Mostafa Rajabi
    [J]. 26TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2018), 2018, : 1240 - 1245
  • [46] Forecasting New Tuberculosis Cases in Malaysia: A Time-Series Study Using the Autoregressive Integrated Moving Average (ARIMA) Model
    Rashid, Mohd Ariff Ab
    Zaki, Rafdzah Ahmad
    Mahiyuddin, Wan Rozita Wan
    Yahya, Abqariyah
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (09)
  • [47] Forecasting occurrence of palm weevil Rhynchophorus palmarum L. (Coleoptera, Curculionidae) using Autoregressive Integrated Moving Average modeling
    Pacheco-Sanchez, Eduardo L.
    Guamani-Quimis, Lenin A.
    da Rosa, Cinara Ewerling
    Portalanza, Diego
    Mieles, Alejandro E.
    Garces-Fiallos, Felipe R.
    [J]. SCIENTIA AGROPECUARIA, 2023, 14 (02) : 171 - 178
  • [48] Time series analysis and forecasting of cholera disease using discrete wavelet transform and seasonal autoregressive integrated moving average model
    Amshi, Ahmad Hauwa
    Prasad, Rajesh
    [J]. SCIENTIFIC AFRICAN, 2023, 20
  • [49] Optimization of pumping schedule based on water demand forecasting using a combined model of autoregressive integrated moving average and exponential smoothing
    Kang, Hyeong-Seok
    Kim, Hyunook
    Lee, Jaekyeong
    Lee, Ingyu
    Kwak, Byoung-Youn
    Im, Hyungjoon
    [J]. WATER SCIENCE AND TECHNOLOGY-WATER SUPPLY, 2015, 15 (01): : 188 - 195
  • [50] Forecasting of Milk Production in Northern Thailand Using Seasonal Autoregressive Integrated Moving Average, Error Trend Seasonality, and Hybrid Models
    Punyapornwithaya, Veerasak
    Jampachaisri, Katechan
    Klaharn, Kunnanut
    Sansamur, Chalutwan
    [J]. FRONTIERS IN VETERINARY SCIENCE, 2021, 8