Ranking and significance of variable-length similarity-based time series motifs

被引:3
|
作者
Serra, Joan [1 ,2 ]
Serra, Isabel [3 ]
Corral, Alvaro [3 ]
Lluis Arcos, Josep [2 ]
机构
[1] Telefon Res, Barcelona, Spain
[2] Artificial Intelligence Res Inst IIIA CSIC, Barcelona, Spain
[3] Ctr Recerca Matemat, Barcelona, Spain
关键词
Time series; Motif ranking; Distance modeling; Beta distribution; CLASSIFICATION;
D O I
10.1016/j.eswa.2016.02.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The detection of very similar patterns in a time series, commonly called motifs, has received continuous and increasing attention from diverse scientific communities. In particular, recent approaches for discovering similar motifs of different lengths have been proposed. In this work, we show that such variable-length similarity-based motifs cannot be directly compared, and hence ranked, by their normalized dissimilarities. Specifically, we find that length-normalized motif dissimilarities still have intrinsic dependencies on the motif length, and that lowest dissimilarities are particularly affected by this dependency. Moreover, we find that such dependencies are generally non-linear and change with the considered data set and dissimilarity measure. Based on these findings, we propose a solution to rank (previously obtained) motifs of different lengths and measure their significance. This solution relies on a compact but accurate model of the dissimilarity space, using a beta distribution with three parameters that depend on the motif length in a non-linear way. We believe the incomparability of variable-length dissimilarities could have an impact beyond the field of time series, and that similar modeling strategies as the one used here could be of help in a more broad context and in diverse application scenarios. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:452 / 460
页数:9
相关论文
共 50 条
  • [41] Reduced data similarity-based matching for time series patterns alignment
    Boucheham, Bachir
    PATTERN RECOGNITION LETTERS, 2010, 31 (07) : 629 - 638
  • [42] On Convolutional Autoencoders to Speed Up Similarity-Based Time Series Mining
    Aragao da Silva, Yuri Gabriel
    Silva, Diego Furtado
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4769 - 4778
  • [43] Similarity-Based Searching in Multi-Parameter Time Series Databases
    Lehman, L. H.
    Saeed, M.
    Moody, G. B.
    Mark, R. G.
    COMPUTERS IN CARDIOLOGY 2008, VOLS 1 AND 2, 2008, : 653 - 656
  • [44] Similarity-based visualization of time series collections: An application to analysis of streamflows
    Alencar, A. B.
    Paulovich, F. V.
    Minghim, R.
    Andrade, M. G.
    Oliveira, M. C. R.
    PROCEEDINGS OF THE 12TH INTERNATIONAL INFORMATION VISUALISATION, 2008, : 280 - 286
  • [45] Sieve bootstrap with variable-length Markov chains for stationary categorical time series - Comment
    Lahiri, SN
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (458) : 460 - 462
  • [46] Sieve bootstrap with variable-length Markov chains for stationary categorical time series -: Rejoinder
    Bühlmann, P
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (458) : 466 - 471
  • [47] Manifold Learning for Multivariate Variable-Length Sequences With an Application to Similarity Search
    Ho, Shen-Shyang
    Dai, Peng
    Rudzicz, Frank
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (06) : 1333 - 1344
  • [48] Variable-Length Multivariate Time Series Classification Using ROCKET: A Case Study of Incident Detection
    Bier, Agnieszka
    Jastrzebska, Agnieszka
    Olszewski, Pawel
    IEEE ACCESS, 2022, 10 : 95701 - 95715
  • [49] ACCURACY OF DETERMINATION OF THE TIME POSITION OF VARIABLE-LENGTH PULSES
    KHAITUN, FI
    ZAMANSKAYA, IE
    SOVIET JOURNAL OF OPTICAL TECHNOLOGY, 1980, 47 (11): : 683 - 684
  • [50] Characterization of medical time series using fuzzy similarity-based fractal dimensions
    Sarkar, M
    Leong, TY
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2003, 27 (02) : 201 - 222