Time Series Prediction via Similarity Search: Exploring Invariances, Distance Measures and Ensemble Functions

被引:5
|
作者
Parmezan, Antonio R. S. [1 ]
Souza, Vinicius M. A. [2 ]
Batista, Gustavo E. A. P. A. [3 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, BR-13566590 Sao Carlos, Brazil
[2] Pontificia Univ Catolica Parana, Grad Program Informat, BR-80215901 Curitiba, Parana, Brazil
[3] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
关键词
Forecasting; multi-step-ahead prediction; pattern sequence similarity; univariate analysis; ARTIFICIAL NEURAL-NETWORKS; MACHINE LEARNING-MODELS; DIFFERENTIAL EVOLUTION; ALGORITHM;
D O I
10.1109/ACCESS.2022.3192849
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid advance of scientific research in data mining has led to the adaptation of conventional pattern extraction methods to the context of time series analysis. The forecasting (or prediction) task has been supported mainly by regression algorithms based on artificial neural networks, support vector machines, and k-Nearest Neighbors (kNN). However, some studies provided empirical evidence that similarity-based methods, i.e. variations of kNN, constitute a promising approach compared with more complex predictive models from both machine learning and statistics. Although the scientific community has made great strides in increasing the visibility of these easy-to-fit and impressively accurate algorithms, previous work has failed to recognize the right invariances needed for this task. We propose a novel extension of kNN, namely kNN - Time Series Prediction with Invariances (kNN-TSPI), that differs from the literature by combining techniques to obtain amplitude and offset invariance, complexity invariance, and treatment of trivial matches. Our predictor enables more meaningful matches between reference queries and data subsequences. From a comprehensive evaluation with real-world datasets, we demonstrate that kNN-TSPI is a competitive algorithm against two conventional similarity-based approaches and, most importantly, against 11 popular predictors. To assist future research and provide a better understanding of similarity-based method behaviors, we also explore different settings of kNN-TSPI regarding invariances to distortions in time series, distance measures, complexity-invariant distances, and ensemble functions. Results show that kNN-TSPI stands out for its robustness and stability both concerning the parameter k and the accuracy of the projection horizon trends.
引用
收藏
页码:78022 / 78043
页数:22
相关论文
共 50 条
  • [1] Elastic similarity and distance measures for multivariate time series
    Shifaz, Ahmed
    Pelletier, Charlotte
    Petitjean, Francois
    Webb, Geoffrey I.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (06) : 2665 - 2698
  • [2] Elastic similarity and distance measures for multivariate time series
    Ahmed Shifaz
    Charlotte Pelletier
    François Petitjean
    Geoffrey I. Webb
    Knowledge and Information Systems, 2023, 65 : 2665 - 2698
  • [3] Isomorphism Distance in Multidimensional Time Series and Similarity Search
    Guo Wensheng
    Ji Lianen
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 : 209 - 217
  • [4] Histogram Distance for Similarity Search in Large Time Series Database
    Ouyang, Yicun
    Zhang, Feng
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2010, 2010, 6283 : 170 - 177
  • [5] A Study of the Use of Complexity Measures in the Similarity Search Process Adopted by kNN Algorithm for Time Series Prediction
    Sabino Parmezan, Antonio Rafael
    Batista, Gustavo E. A. P. A.
    2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 45 - 51
  • [6] Time series similarity measures and time series indexing
    Gunopulos, D
    Das, G
    SIGMOD RECORD, 2001, 30 (02) : 624 - 624
  • [7] Supervised Temporal Link Prediction Using Time Series of Similarity Measures
    Ozcan, Alper
    Oguducu, Sule Gunduz
    2017 NINTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2017), 2017, : 519 - 521
  • [8] Speeding Up Similarity Search on a Large Time Series Dataset under Time Warping Distance
    Ruengronghirunya, Pongsakorn
    Niennattrakul, Vit
    Ratanamahatana, Chotirat Ann
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 981 - 988
  • [9] Accelerating time series similarity search under Move-Split-Merge distance via dissimilarity space embedding
    Zhang, Haowen
    Li, Juan
    Feng, Jinwang
    Yao, Qing
    Dong, Yabo
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [10] Performance Improvement via Bagging in Ensemble Prediction of Chaotic Time Series Using Similarity of Attractors and LOOCV Predictable Horizon
    Toidani, Mitsuki
    Matsuo, Kazuya
    Kurogi, Shuichi
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 590 - 598