Solving Some Problems of Predictive Analytics for Time Series Data

被引:0
|
作者
Botygin, Igor [1 ]
Sherstneva, Anna [1 ]
Sherstnev, Vladislav [1 ]
机构
[1] Tomsk Polytechn Univ, Tomsk, Russia
关键词
Regression analysis; Additive regression model; Autoregressive integrated moving average model;
D O I
10.1007/978-3-031-09070-7_32
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Approaches for constructing an additive regression model and a seasonally integrated autoregressive-moving average model were considered. Experiments were aimed at investigating and developing an algorithm for forecasting meteorological fields. In particular, a time series forecast of monthly precipitation totals for the next two years was built. In the software experiments for building forecast models, arrays containing data from eight-year observations of atmospheric precipitation were used. That is, sums of precipitation for the period between standard synoptic dates with an interval of three hours were used. The year 2016 was selected as the forecast year. The original data series was processed - emissions that were not attributed to seasonal variations were removed. Additionally, the allowable maximum amount of precipitation between dates for an observation station with synoptic index 29430 recommended by the All-Russian Research Institute for Hydrometeorological Information - World Data Centre (VNIIGMI-WDC) was controlled. The Python programming language was used as a tool to generate forecasts for the time series data. An additive regression model was created using the Prophet library from Facebook. A seasonally integrated autoregressive-moving average model was created using the StatsModels library. A DF-test (DickeyFuller basic test), which checks for the presence of a single "unit root", was used to test for stationarity. The testing and writing of the program code took place in the interactive Jupyter Notebook environment. The environment is a graphical web shell for Python and extends the idea of the console approach to interactive computing. Model forecast accuracy was assessed by calculating absolute and mean absolute prediction errors.
引用
收藏
页码:382 / 391
页数:10
相关论文
共 50 条
  • [21] Clustering and Classification for Time Series Data in Visual Analytics: A Survey
    Ali, Mohammed
    Alqahtani, Ali
    Jones, Mark W.
    Xie, Xianghua
    IEEE ACCESS, 2019, 7 : 181314 - 181338
  • [22] Visual Analytics of Multivariate Intensive Care Time Series Data
    Brich, N.
    Schulz, C.
    Peter, J.
    Klingert, W.
    Schenk, M.
    Weiskopf, D.
    Krone, M.
    COMPUTER GRAPHICS FORUM, 2022, 41 (06) : 273 - 286
  • [23] Multivariate time series dataset for space weather data analytics
    Angryk, Rafal A.
    Martens, Petrus C.
    Aydin, Berkay
    Kempton, Dustin
    Mahajan, Sushant S.
    Basodi, Sunitha
    Ahmadzadeh, Azim
    Cai, Xumin
    Filali Boubrahimi, Soukaina
    Hamdi, Shah Muhammad
    Schuh, Michael A.
    Georgoulis, Manolis K.
    SCIENTIFIC DATA, 2020, 7 (01)
  • [24] Analysis and Implementation of a Parallel Computing Cluster for Solving Computational Problems in Data Analytics
    Rao, Chavala Mutyala
    Shyamala, K.
    PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020), 2020,
  • [25] Towards Integrated Data Analytics: Time Series Forecasting in DBMS
    Fischer, Ulrike
    Dannecker, Lars
    Siksnys, Laurynas
    Rosenthal, Frank
    Boehm, Matthias
    Lehner, Wolfgang
    Datenbank-Spektrum, 2013, 13 (01) : 45 - 53
  • [26] PREDICTIVE ANALYTICS ON POLITICAL DATA
    Raj, Adarsh Samuel J.
    Fernando, Lenin J.
    Raj, Sunil Y.
    2017 2ND WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT), 2017, : 93 - 96
  • [27] Time Series Data Mining: A Case Study With Big Data Analytics Approach
    Wang, Fang
    Li, Menggang
    Mei, Yiduo
    Li, Wenrui
    IEEE ACCESS, 2020, 8 : 14322 - 14328
  • [28] Solving incrementally the fitting and detection problems in fMRI time series
    Roche, A
    Pinel, P
    Dehaene, S
    Poline, JB
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2004, PT 2, PROCEEDINGS, 2004, 3217 : 719 - 726
  • [29] BAYESIAN SOLUTIONS TO SOME PROBLEMS IN POOLING CROSS-SECTION AND TIME SERIES DATA
    MADDALA, GS
    ECONOMETRICA, 1971, 39 (04) : 125 - &
  • [30] Real-Time Predictive Analytics Using Degradation Image Data
    Fang, Xiaolei
    Paynabar, Kamran
    Gebraeel, Nagi
    2018 ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS), 2018,