A Comparative Study of Performance Estimation Methods for Time Series Forecasting

被引:23
|
作者
Cerqueira, Vitor [1 ,2 ]
Torgo, Luis [1 ,2 ]
Smailovic, Jasmina [3 ]
Mozetic, Igor [3 ]
机构
[1] LIAAD INESCTEC, Porto, Portugal
[2] Univ Porto, Porto, Portugal
[3] Jozef Stefan Inst, Jamova 39, Ljubljana 1000, Slovenia
基金
欧盟地平线“2020”;
关键词
performance estimation; model selection; cross validation; time series; CROSS-VALIDATION; MODEL; SELECTION;
D O I
10.1109/DSAA.2017.7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Performance estimation denotes a task of estimating the loss that a predictive model will incur on unseen data. These procedures are part of the pipeline in every machine learning task and are used for assessing the overall generalisation ability of models. In this paper we address the application of these methods to time series forecasting tasks. For independent and identically distributed data the most common approach is cross-validation. However, the dependency among observations in time series raises some caveats about the most appropriate way to estimate performance in these datasets and currently there is no settled way to do so. We compare different variants of cross-validation and different variants of out-of-sample approaches using two case studies: One with 53 real-world time series and another with three synthetic time series. Results show noticeable differences in the performance estimation methods in the two scenarios. In particular, empirical experiments suggest that cross-validation approaches can be applied to stationary synthetic time series. However, in real-world scenarios the most accurate estimates are produced by the out-of-sample methods, which preserve the temporal order of observations.
引用
收藏
页码:529 / 538
页数:10
相关论文
共 50 条
  • [41] A Comparative Analysis of Univariate Time Series Methods for Estimating and Forecasting Daily Spam in United States
    Zhang, Jie
    Lee, Gene Moo
    Wang, Jingguo
    AMCIS 2016 PROCEEDINGS, 2016,
  • [42] Forecasting meteorological time series using soft computing methods: an empirical study
    Bautu, Elena
    Barbulescu, Alina
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (04): : 1297 - 1306
  • [43] Time Series Complexities and Their Relationship to Forecasting Performance
    Ponce-Flores, Mirna
    Frausto-Solis, Juan
    Santamaria-Bonfil, Guillermo
    Perez-Ortega, Joaquin
    Gonzalez-Barbosa, Juan J.
    ENTROPY, 2020, 22 (01) : 89
  • [44] RECURSIVE ESTIMATION AND FORECASTING OF NONSTATIONARY TIME-SERIES
    NG, CN
    YOUNG, PC
    JOURNAL OF FORECASTING, 1990, 9 (02) : 173 - 204
  • [45] Modeling ANNs Performance on Time Series Forecasting
    Suarez, Ranyart R.
    Graff, Mario
    2013 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2013,
  • [46] The Performance of LSTM and BiLSTM in Forecasting Time Series
    Siami-Namini, Sima
    Tavakoli, Neda
    Namin, Akbar Siami
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 3285 - 3292
  • [47] Performance of periodic time series models in forecasting
    Herwartz H.
    Empirical Economics, 1999, 24 (2) : 271 - 301
  • [48] A triangulation estimation and forecasting framework for agricultural time series
    Chou, Fu-, I
    Ho, Wen-Hsien
    Chen, Yenming J.
    Tsai, Jinn-Tsong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (04) : 7893 - 7899
  • [49] A Comparative Study on WEMA and H-WEMA Forecasting Methods in Time Series Analysis (Case Study: JKSE Composite Index Data)
    Hansun, Seng
    2016 6TH INTERNATIONAL ANNUAL ENGINEERING SEMINAR (INAES), 2016, : 6 - 10
  • [50] Parameter estimation methods for gene circuit modeling from time-series mRNA data: a comparative study
    Fan, Ming
    Kuwahara, Hiroyuki
    Wang, Xiaolei
    Wang, Suojin
    Gao, Xin
    BRIEFINGS IN BIOINFORMATICS, 2015, 16 (06) : 987 - 999