Time Series Prediction via Similarity Search: Exploring Invariances, Distance Measures and Ensemble Functions

被引:5
|
作者
Parmezan, Antonio R. S. [1 ]
Souza, Vinicius M. A. [2 ]
Batista, Gustavo E. A. P. A. [3 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, BR-13566590 Sao Carlos, Brazil
[2] Pontificia Univ Catolica Parana, Grad Program Informat, BR-80215901 Curitiba, Parana, Brazil
[3] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
关键词
Forecasting; multi-step-ahead prediction; pattern sequence similarity; univariate analysis; ARTIFICIAL NEURAL-NETWORKS; MACHINE LEARNING-MODELS; DIFFERENTIAL EVOLUTION; ALGORITHM;
D O I
10.1109/ACCESS.2022.3192849
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid advance of scientific research in data mining has led to the adaptation of conventional pattern extraction methods to the context of time series analysis. The forecasting (or prediction) task has been supported mainly by regression algorithms based on artificial neural networks, support vector machines, and k-Nearest Neighbors (kNN). However, some studies provided empirical evidence that similarity-based methods, i.e. variations of kNN, constitute a promising approach compared with more complex predictive models from both machine learning and statistics. Although the scientific community has made great strides in increasing the visibility of these easy-to-fit and impressively accurate algorithms, previous work has failed to recognize the right invariances needed for this task. We propose a novel extension of kNN, namely kNN - Time Series Prediction with Invariances (kNN-TSPI), that differs from the literature by combining techniques to obtain amplitude and offset invariance, complexity invariance, and treatment of trivial matches. Our predictor enables more meaningful matches between reference queries and data subsequences. From a comprehensive evaluation with real-world datasets, we demonstrate that kNN-TSPI is a competitive algorithm against two conventional similarity-based approaches and, most importantly, against 11 popular predictors. To assist future research and provide a better understanding of similarity-based method behaviors, we also explore different settings of kNN-TSPI regarding invariances to distortions in time series, distance measures, complexity-invariant distances, and ensemble functions. Results show that kNN-TSPI stands out for its robustness and stability both concerning the parameter k and the accuracy of the projection horizon trends.
引用
收藏
页码:78022 / 78043
页数:22
相关论文
共 50 条
  • [31] An empirical evaluation of similarity measures for time series classification
    Serra, Joan
    Arcos, Josep Ll.
    KNOWLEDGE-BASED SYSTEMS, 2014, 67 : 305 - 314
  • [32] An Experimental Evaluation of Similarity Measures for Uncertain Time Series
    Orang, Mahsa
    Shiri, Nematollaah
    PROCEEDINGS OF THE 18TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM (IDEAS14), 2014, : 261 - 264
  • [33] On-line Elastic Similarity Measures for time series
    Oregi, Izaskun
    Perez, Aritz
    Del Ser, Javier
    Lozano, Jose A.
    PATTERN RECOGNITION, 2019, 88 : 506 - 517
  • [34] Time Series Subsequence Similarity Search Under Dynamic Time Warping Distance on the Intel Many-core Accelerators
    Movchan, Aleksandr
    Zymbler, Mikhail
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2015, 2015, 9371 : 295 - 306
  • [35] Similarity search and performance prediction of shield tunnels in operation through time series data mining
    Zhu, Hehua
    Wang, Xin
    Chen, Xueqin
    Zhang, Lianyang
    AUTOMATION IN CONSTRUCTION, 2020, 114
  • [36] Research on time-series based and similarity search based methods for PV power prediction
    Jiang, Meng
    Ding, Kun
    Chen, Xiang
    Cui, Liu
    Zhang, Jingwei
    Yang, Zenan
    Cang, Yi
    Cao, Shang
    ENERGY CONVERSION AND MANAGEMENT, 2024, 308
  • [37] An Efficient Similarity Search For Financial Multivariate Time Series
    Zhou, Dazhuo
    Li, Minqiang
    Yan, Hongcan
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11161 - 11164
  • [38] Fast online similarity search for uncertain time series
    Ma R.
    Zheng D.
    Yan L.
    Journal of Computing and Information Technology, 2020, 28 (01): : 1 - 17
  • [39] Similarity search on time series based on threshold queries
    Assfalg, J
    Kriegel, HP
    Kröger, P
    Kunath, P
    Pryakhin, A
    Renz, M
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 276 - 294
  • [40] GPU Acceleration of Similarity Search for Uncertain Time Series
    Hwang, Jun
    Kozawa, Yusuke
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    2014 17TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2014), 2014, : 626 - 631