AFFINITY: Efficiently Querying Statistical Measures on Time-Series Data

被引:0
|
作者
Sathe, Saket [1 ]
Aberer, Karl [1 ]
机构
[1] EPFL, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computing statistical measures for large databases of time series is a fundamental primitive for querying and mining time-series data [1]-[6]. This primitive is gaining importance with the increasing number and rapid growth of time series databases. In this paper, we introduce a framework for efficient computation of statistical measures by exploiting the concept of affine relationships. Affine relationships can be used to infer statistical measures for time series, from other related time series, instead of computing them directly; thus, reducing the overall computational cost significantly. The resulting methods exhibit at least one order of magnitude improvement over the best known methods. To the best of our knowledge, this is the first work that presents an unified approach for computing and querying several statistical measures at once. Our approach exploits affine relationships using three key components. First, the AFCLST algorithm clusters the time-series data, such that high-quality affine relationships could be easily found. Second, the SYMEX algorithm uses the clustered time series and efficiently computes the desired affine relationships. Third, the SCAPE index structure produces a many-fold improvement in the performance of processing several statistical queries by seamlessly indexing the affine relationships. Finally, we establish the effectiveness of our approaches by performing comprehensive experimental evaluation on real datasets.
引用
收藏
页码:841 / 852
页数:12
相关论文
共 50 条
  • [1] Bounded similarity querying for time-series data
    Goldin, DQ
    Millstein, TD
    Kutlu, A
    INFORMATION AND COMPUTATION, 2004, 194 (02) : 203 - 241
  • [2] Using signature files for querying time-series data
    Andre-Jonsson, H
    Badal, DZ
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1263 : 211 - 220
  • [3] Structural Periodic Measures for Time-Series Data
    Michail Vlachos
    Philip S. Yu
    Vittorio Castelli
    Christopher Meek
    Data Mining and Knowledge Discovery, 2006, 12 : 1 - 28
  • [4] Structural periodic measures for time-series data
    Vlachos, M
    Yu, PS
    Castelli, V
    Meek, C
    DATA MINING AND KNOWLEDGE DISCOVERY, 2006, 12 (01) : 1 - 28
  • [5] Statistical methodological review for time-series data
    Rahardja, Dewi
    JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2020, 23 (08): : 1445 - 1461
  • [6] Practical Measures of Integrated Information for Time-Series Data
    Barrett, Adam B.
    Seth, Anil K.
    PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (01)
  • [7] AirExplorer: visual exploration of air quality data based on time-series querying
    Qu, Dezhan
    Lin, Xiaoli
    Ren, Ke
    Liu, Quanle
    Zhang, Huijie
    JOURNAL OF VISUALIZATION, 2020, 23 (06) : 1129 - 1145
  • [8] AirExplorer: visual exploration of air quality data based on time-series querying
    Dezhan Qu
    Xiaoli Lin
    Ke Ren
    Quanle Liu
    Huijie Zhang
    Journal of Visualization, 2020, 23 : 1129 - 1145
  • [9] Relaxed Selection Techniques for Querying Time-Series Graphs
    Holz, Christian
    Feiner, Steven
    UIST 2009: PROCEEDINGS OF THE 22ND ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY, 2009, : 213 - 222
  • [10] Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures
    Ding, Hui
    Trajcevski, Goce
    Scheuermann, Peter
    Wang, Xiaoyue
    Keogh, Eamonn
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1542 - 1552