Evaluating time series forecasting models: an empirical study on performance estimation methods

被引：107

作者：

Cerqueira, Vitor ^{[1
]}

Torgo, Luis ^{[1
,2
,3
]}

Mozetic, Igor ^{[4
]}

机构：

[1] LIAAD INESC TEC, Porto, Portugal

[2] Univ Porto, Porto, Portugal

[3] Dalhousie Univ, Halifax, NS, Canada

[4] Jozef Stefan Inst, Ljubljana, Slovenia

来源：

MACHINE LEARNING | 2020年 / 109卷 / 11期

关键词：

Performance estimation; Model selection; Cross validation; Time series; Forecasting; CROSS-VALIDATION; SELECTION;

D O I：

10.1007/s10994-020-05910-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Performance estimation aims at estimating the loss that a predictive model will incur on unseen data. This process is a fundamental stage in any machine learning project. In this paper we study the application of these methods to time series forecasting tasks. For independent and identically distributed data the most common approach is cross-validation. However, the dependency among observations in time series raises some caveats about the most appropriate way to estimate performance in this type of data. Currently, there is no consensual approach. We contribute to the literature by presenting an extensive empirical study which compares different performance estimation methods for time series forecasting tasks. These methods include variants of cross-validation, out-of-sample (holdout), and prequential approaches. Two case studies are analysed: One with 174 real-world time series and another with three synthetic time series. Results show noticeable differences in the performance estimation methods in the two scenarios. In particular, empirical experiments suggest that blocked cross-validation can be applied to stationary time series. However, when the time series are non-stationary, the most accurate estimates are produced by out-of-sample methods, particularly the holdout approach repeated in multiple testing periods.

引用

页码：1997 / 2028

页数：32

共 50 条

[21] Forecasting performance of time series models on electricity spot markets
Guertler, Marc
Paulsen, Thomas
[J]. INTERNATIONAL JOURNAL OF ENERGY SECTOR MANAGEMENT, 2018, 12 (04) : 617 - 640
[22] Time series models for forecasting wastewater treatment plant performance
Berthouex, PM
Box, GE
[J]. WATER RESEARCH, 1996, 30 (08) : 1865 - 1875
[23] Time Series Models for Performance Evaluation of Network Traffic Forecasting
Kim, S.
[J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2007, 20 (02) : 219 - 227
[24] Comparison of forecasting performance of nonlinear models of hydrological time series
Komornik, Jozef
Komornikova, Magda
Mesiar, Radko
Szokeova, Danusa
Szolgay, Jan
[J]. PHYSICS AND CHEMISTRY OF THE EARTH, 2006, 31 (18) : 1127 - 1145
[25] Selection of time series forecasting models based on performance information
dos Santos, PM
Ludermir, TB
Prudêncio, RBC
[J]. HIS'04: FOURTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, : 366 - 371
[26] A Monte Carlo study of the forecasting performance of empirical SETAR models
Clements, MP
Smith, J
[J]. JOURNAL OF APPLIED ECONOMETRICS, 1999, 14 (02) : 123 - 141
[27] Time series forecasting for dynamic quality of web services: An empirical study
Syu, Yang
Kuo, Jong-Yih
Fanjiang, Yong-Yi
[J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 134 : 279 - 303
[28] A Large-Scale Empirical Study of Aligned Time Series Forecasting
Pilyugina, Polina
Medvedeva, Svetlana
Mosievich, Kirill
Trofimov, Ilya
Kostromina, Alina
Simakov, Dmitry
Burnaev, Evgeny
[J]. IEEE ACCESS, 2024, 12 : 131100 - 131121
[29] Comparative Study on Univariate Forecasting Methods for Meteorological Time Series
Thi-Thu-Hong Phan
Caillault, Emilie Poisson
Bigand, Andre
[J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2380 - 2384
[30] A COMPARISON OF THE FORECASTING PERFORMANCE OF WEFA AND ARIMA TIME-SERIES METHODS
DHRYMES, PJ
PERISTIANI, SC
[J]. INTERNATIONAL JOURNAL OF FORECASTING, 1988, 4 (01) : 81 - 101

← 1 2 3 4 5 →