A New Timing Error Cost Function for Binary Time Series Prediction

被引:11
|
作者
Rivest, Francois [1 ,2 ,3 ]
Kohar, Richard [1 ]
机构
[1] Royal Mil Coll Canada, Dept Math & Comp Sci, Kingston, ON K7K 4B4, Canada
[2] Queens Univ, Ctr Neurosci Studies, Kingston, ON K7L 3N6, Canada
[3] Queens Univ, Sch Comp, Kingston, ON K7L 3N6, Canada
关键词
Timing; Time series analysis; Cost function; Forecasting; Prediction algorithms; Animals; Task analysis; Dynamic time warping (DTW); recurrent neural network; squared timing error (STE); time series forecasting; MODEL;
D O I
10.1109/TNNLS.2019.2900046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to make predictions is central to the artificial intelligence problem. While machine learning algorithms have difficulty in learning to predict events with hundreds of time-step dependencies, animals can learn event timing within tens of trials across a broad spectrum of time scales. This suggests strongly a need for new perspectives on the forecasting problem. This paper focuses on binary time series that can be predicted within some temporal precision. We demonstrate that the sum of squared errors (SSE) calculated at every time step is not appropriate for this problem. Next, we look at the advantages and shortcomings of using a dynamic time warping (DTW) cost function. Then, we propose the squared timing error (STE) that uses DTW on the event space and applies SSE on the timing error instead of at each time step. We evaluate all three cost functions on different types of timing errors, such as phase shift, warping, and missing events, on synthetic and real-world binary time series (heartbeats, finance, and music). The results show that STE provides more information about timing error, is differentiable, and can be computed online efficiently. Finally, we devise a gradient descent algorithm for STE on a simplified recurrent neural network. We then compare the performance of the STE-based algorithm to SSE- and logit-based gradient descent algorithms on the same network architecture. The results in real-world binary time series show that the STE algorithm generally outperforms all the other cost functions considered.
引用
收藏
页码:174 / 185
页数:12
相关论文
共 50 条