Time Series Prediction via Similarity Search: Exploring Invariances, Distance Measures and Ensemble Functions

被引:5
|
作者
Parmezan, Antonio R. S. [1 ]
Souza, Vinicius M. A. [2 ]
Batista, Gustavo E. A. P. A. [3 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, BR-13566590 Sao Carlos, Brazil
[2] Pontificia Univ Catolica Parana, Grad Program Informat, BR-80215901 Curitiba, Parana, Brazil
[3] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
关键词
Forecasting; multi-step-ahead prediction; pattern sequence similarity; univariate analysis; ARTIFICIAL NEURAL-NETWORKS; MACHINE LEARNING-MODELS; DIFFERENTIAL EVOLUTION; ALGORITHM;
D O I
10.1109/ACCESS.2022.3192849
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid advance of scientific research in data mining has led to the adaptation of conventional pattern extraction methods to the context of time series analysis. The forecasting (or prediction) task has been supported mainly by regression algorithms based on artificial neural networks, support vector machines, and k-Nearest Neighbors (kNN). However, some studies provided empirical evidence that similarity-based methods, i.e. variations of kNN, constitute a promising approach compared with more complex predictive models from both machine learning and statistics. Although the scientific community has made great strides in increasing the visibility of these easy-to-fit and impressively accurate algorithms, previous work has failed to recognize the right invariances needed for this task. We propose a novel extension of kNN, namely kNN - Time Series Prediction with Invariances (kNN-TSPI), that differs from the literature by combining techniques to obtain amplitude and offset invariance, complexity invariance, and treatment of trivial matches. Our predictor enables more meaningful matches between reference queries and data subsequences. From a comprehensive evaluation with real-world datasets, we demonstrate that kNN-TSPI is a competitive algorithm against two conventional similarity-based approaches and, most importantly, against 11 popular predictors. To assist future research and provide a better understanding of similarity-based method behaviors, we also explore different settings of kNN-TSPI regarding invariances to distortions in time series, distance measures, complexity-invariant distances, and ensemble functions. Results show that kNN-TSPI stands out for its robustness and stability both concerning the parameter k and the accuracy of the projection horizon trends.
引用
收藏
页码:78022 / 78043
页数:22
相关论文
共 50 条
  • [41] Set-based Similarity Search for Time Series
    Peng, Jinglin
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2039 - 2052
  • [42] Time Series Similarity Search Methods for Sensor Data
    Automatic Control and Computer Sciences, 2022, 56 : 120 - 129
  • [43] Time Series Similarity Search Methods for Sensor Data
    Jawale, Anupama
    Magar, Ganesh
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (02) : 120 - 129
  • [44] Similarity Search of Time Series Trajectories Based on Shape
    Gawde, Gajanan
    Pawar, Jyoti
    PROCEEDINGS OF THE ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MANAGEMENT OF DATA (CODS-COMAD'18), 2018, : 340 - 343
  • [45] Parallelization of similarity search in large time series databases
    Qiao, Jonathan
    Ye, Yang
    Zhang, Chaoyang
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 355 - +
  • [46] Cluster-Based Similarity Search in Time Series
    Karamitopoulos, Leonidas
    Evangelidis, Georgios
    PROCEEDINGS OF THE 2009 FOURTH BALKAN CONFERENCE IN INFORMATICS, 2009, : 113 - 118
  • [47] AN APPROACH FOR TIME SERIES SIMILARITY SEARCH BASED ON LUCENE
    Chang, Min
    Lou, Yuansheng
    Qiu, Lei
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 210 - 214
  • [48] Underlying techniques of efficient similarity search on time series
    Feng, Yu-Cai
    Jiang, Tao
    Li, Guo-Hui
    Zhu, Hong
    Jisuanji Xuebao/Chinese Journal of Computers, 2009, 32 (11): : 2107 - 2122
  • [49] Similarity search in time series databases using moments
    Toshniwal, D
    Joshi, RC
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 164 - 171
  • [50] Improving Time Series Prediction via Modification of Dynamic Weighted Majority in Ensemble Learning
    Loderer, Marek
    Pavlik, Peter
    Rozinajova, Viera
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 651 - 660