Time Series Prediction via Similarity Search: Exploring Invariances, Distance Measures and Ensemble Functions

被引:5
|
作者
Parmezan, Antonio R. S. [1 ]
Souza, Vinicius M. A. [2 ]
Batista, Gustavo E. A. P. A. [3 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, BR-13566590 Sao Carlos, Brazil
[2] Pontificia Univ Catolica Parana, Grad Program Informat, BR-80215901 Curitiba, Parana, Brazil
[3] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
关键词
Forecasting; multi-step-ahead prediction; pattern sequence similarity; univariate analysis; ARTIFICIAL NEURAL-NETWORKS; MACHINE LEARNING-MODELS; DIFFERENTIAL EVOLUTION; ALGORITHM;
D O I
10.1109/ACCESS.2022.3192849
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid advance of scientific research in data mining has led to the adaptation of conventional pattern extraction methods to the context of time series analysis. The forecasting (or prediction) task has been supported mainly by regression algorithms based on artificial neural networks, support vector machines, and k-Nearest Neighbors (kNN). However, some studies provided empirical evidence that similarity-based methods, i.e. variations of kNN, constitute a promising approach compared with more complex predictive models from both machine learning and statistics. Although the scientific community has made great strides in increasing the visibility of these easy-to-fit and impressively accurate algorithms, previous work has failed to recognize the right invariances needed for this task. We propose a novel extension of kNN, namely kNN - Time Series Prediction with Invariances (kNN-TSPI), that differs from the literature by combining techniques to obtain amplitude and offset invariance, complexity invariance, and treatment of trivial matches. Our predictor enables more meaningful matches between reference queries and data subsequences. From a comprehensive evaluation with real-world datasets, we demonstrate that kNN-TSPI is a competitive algorithm against two conventional similarity-based approaches and, most importantly, against 11 popular predictors. To assist future research and provide a better understanding of similarity-based method behaviors, we also explore different settings of kNN-TSPI regarding invariances to distortions in time series, distance measures, complexity-invariant distances, and ensemble functions. Results show that kNN-TSPI stands out for its robustness and stability both concerning the parameter k and the accuracy of the projection horizon trends.
引用
收藏
页码:78022 / 78043
页数:22
相关论文
共 50 条
  • [21] Exploring knowledge benchmarking using time-series directional distance functions and bibliometrics
    Nepomuceno, Thyago Celso C.
    de Carvalho, Victor Diogho H.
    Nepomuceno, Kessia Thais C.
    Costa, Ana Paula C. S.
    EXPERT SYSTEMS, 2023, 40 (01)
  • [22] Fast Similarity Search of Multi-dimensional Time Series via Segment Rotation
    Gong, Xudong
    Xiong, Yan
    Huang, Wenchao
    Chen, Lei
    Lu, Qiwei
    Hu, Yiqing
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT1, 2015, 9049 : 108 - 124
  • [23] Applying Statistical Models and Parametric Distance Measures for Music Similarity Search
    Lukashevich, Hanna
    Dittmar, Christian
    Bastuck, Christoph
    ADVANCES IN DATA ANALYSIS, DATA HANDLING AND BUSINESS INTELLIGENCE, 2010, : 409 - 418
  • [24] On general purpose time series similarity measures and their use as kernel functions in support vector machines
    Pree, Helmuth
    Herwig, Benjamin
    Gruber, Thiemo
    Sick, Bernhard
    David, Klaus
    Lukowicz, Paul
    INFORMATION SCIENCES, 2014, 281 : 478 - 495
  • [25] Time series classification via divergence measures between probability density functions
    Mello, Carlos E.
    Carvalho, Andre S. T.
    Lyra, Adria
    Pedreira, Carlos E.
    PATTERN RECOGNITION LETTERS, 2019, 125 : 42 - 48
  • [26] Iterated time series prediction with ensemble models
    Wichard, JD
    Ogorzalek, M
    PROCEEDINGS OF THE 23RD IASTED INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION, AND CONTROL, 2004, : 512 - 517
  • [27] Time series prediction based on ensemble ANFIS
    Chen, DW
    Zhang, JP
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 3552 - 3556
  • [28] Temporal Link Prediction Using Time Series of Quasi-Local Node Similarity Measures
    Ozcan, Alper
    Oguducu, Sule Gunduz
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 381 - 386
  • [29] Classification of time series data with nonlinear similarity measures
    Schreiber, T
    Schmitz, A
    PHYSICAL REVIEW LETTERS, 1997, 79 (08) : 1475 - 1478
  • [30] A Comparative Study of Similarity Measures for Time Series Classification
    Yoshida, Sho
    Chakraborty, Basabi
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2017, 10091 : 397 - 408