Evaluating time series forecasting models: an empirical study on performance estimation methods

被引：107

作者：

Cerqueira, Vitor ^{[1
]}

Torgo, Luis ^{[1
,2
,3
]}

Mozetic, Igor ^{[4
]}

机构：

[1] LIAAD INESC TEC, Porto, Portugal

[2] Univ Porto, Porto, Portugal

[3] Dalhousie Univ, Halifax, NS, Canada

[4] Jozef Stefan Inst, Ljubljana, Slovenia

来源：

MACHINE LEARNING | 2020年 / 109卷 / 11期

关键词：

Performance estimation; Model selection; Cross validation; Time series; Forecasting; CROSS-VALIDATION; SELECTION;

D O I：

10.1007/s10994-020-05910-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Performance estimation aims at estimating the loss that a predictive model will incur on unseen data. This process is a fundamental stage in any machine learning project. In this paper we study the application of these methods to time series forecasting tasks. For independent and identically distributed data the most common approach is cross-validation. However, the dependency among observations in time series raises some caveats about the most appropriate way to estimate performance in this type of data. Currently, there is no consensual approach. We contribute to the literature by presenting an extensive empirical study which compares different performance estimation methods for time series forecasting tasks. These methods include variants of cross-validation, out-of-sample (holdout), and prequential approaches. Two case studies are analysed: One with 174 real-world time series and another with three synthetic time series. Results show noticeable differences in the performance estimation methods in the two scenarios. In particular, empirical experiments suggest that blocked cross-validation can be applied to stationary time series. However, when the time series are non-stationary, the most accurate estimates are produced by out-of-sample methods, particularly the holdout approach repeated in multiple testing periods.

引用

页码：1997 / 2028

页数：32

共 50 条

[1] Evaluating time series forecasting models: an empirical study on performance estimation methods
Vitor Cerqueira
Luis Torgo
Igor Mozetič
[J]. Machine Learning, 2020, 109 : 1997 - 2028
[2] A Comparative Study of Performance Estimation Methods for Time Series Forecasting
Cerqueira, Vitor
Torgo, Luis
Smailovic, Jasmina
Mozetic, Igor
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2017, : 529 - 538
[3] A NOTE ON EVALUATING TIME-SERIES MODELS FOR FORECASTING
JARRETT, J
[J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1991, 19 (05): : 487 - 489
[4] Forecasting meteorological time series using soft computing methods: an empirical study
Bautu, Elena
Barbulescu, Alina
[J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (04): : 1297 - 1306
[5] Forecasting performance of multivariate time series models with full and reduced rank: an empirical examination
Wang, ZJ
Bessler, DA
[J]. INTERNATIONAL JOURNAL OF FORECASTING, 2004, 20 (04) : 683 - 695
[6] Absolute and relative measures for evaluating the forecasting performance of time series models for daily strearnflows
Astatkie, T.
[J]. NORDIC HYDROLOGY, 2006, 37 (03) : 205 - 215
[7] Performance of periodic time series models in forecasting
Herwartz H.
[J]. Empirical Economics, 1999, 24 (2) : 271 - 301
[8] An Empirical Comparison of Machine Learning Models for Time Series Forecasting
Ahmed, Nesreen K.
Atiya, Amir F.
El Gayar, Neamat
El-Shishiny, Hisham
[J]. ECONOMETRIC REVIEWS, 2010, 29 (5-6) : 594 - 621
[9] EVALUATING TREND MODELS IN FORECASTING EXPORTS TIME SERIES IN CROATIA
Dumicic, Ksenija
Cibaric, Irena
Casni, Anita Ceh
[J]. PROCEEDINGS OF THE 10TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH SOR 09, 2009, : 495 - 505
[10] FORECASTING PERFORMANCE OF STRUCTURAL TIME-SERIES MODELS
ANDREWS, RL
[J]. JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 1994, 12 (01) : 129 - 133

← 1 2 3 4 5 →