Speed Up of Index Creation for Time-Series Similarity Search with Handling Distortions

被引:0
|
作者
Gil, Myeong-Seon [1 ]
Kim, Bum-Soo [1 ]
Moon, Yang-Sae [1 ]
Choi, Mi-Jung [1 ]
机构
[1] Kangwon Natl Univ, Dept Comp Sceince, Chunchon 200701, Kangwon, South Korea
关键词
Time-series databases; Data mining; Subsequence matching; Distortion-free time-series; Index construction; SUBSEQUENCE MATCHING ALGORITHM; MOVING AVERAGE TRANSFORM; DATABASES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we address the problem of constructing a multidimensional index for time-series similarity search with handling distortions, called distortion-free subsequence matching. A naive algorithm for index construction in distortion-free subsequence matching is a very time-consuming process since it generates a huge number of data subsequences to consider all possible positions and all possible query lengths. In this paper, we formally analyze the index construction step and discuss how to improve the performance of each step. To improve the performance, we present a concept of DF-bucket, which stores the intermediate results and reuse them repeatedly in the next steps. We also present a novel notion of store-and-reuse principle, and using the principle we build a multidimensional index much faster than a naive algorithm.
引用
收藏
页码:401 / 408
页数:8
相关论文
共 50 条
  • [31] A Review of Missing Values Handling Methods on Time-Series Data
    Pratama, Irfan
    Permanasari, Adhistya Erna
    Ardiyanto, Igi
    Indrayani, Rini
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY SYSTEMS AND INNOVATION (ICITSI), 2016,
  • [32] Research on Similarity Matching for Multiple Granularities Time-Series Data
    Hao, Wenning
    Zhao, Enlai
    Zhang, Hongjun
    Chen, Gang
    Jin, Dawei
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 549 - 556
  • [33] ALoT: A Time-Series Similarity Measure Based on Alignment of Textures
    Ogul, Hasan
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 576 - 585
  • [34] Dictionary-Based Compression for Long Time-Series Similarity
    Lang, Willis
    Morse, Michael
    Patel, Jignesh M.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (11) : 1609 - 1622
  • [35] Spectral similarity for analysis of DNA microarray time-series data
    Yan, Hong
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2006, 1 (02) : 150 - 161
  • [36] A Multidimensional Time-Series Similarity Measure With Applications to Eldercare Monitoring
    Hajihashemi, Zahra
    Popescu, Mihail
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2016, 20 (03) : 953 - 962
  • [37] A Time-series Similarity Method for QRS Morphology Variation Analysis
    Wang, Liping
    Yao, Junjie
    Zhang, Wenjie
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 419 - 426
  • [38] The influence of global constraints on similarity measures for time-series databases
    Kurbalija, Vladimir
    Radovanovic, Milos
    Geler, Zoltan
    Ivanovic, Mirjana
    KNOWLEDGE-BASED SYSTEMS, 2014, 56 : 49 - 67
  • [39] Parameter-Free Search of Time-Series Discord
    Luo, Wei
    Gallagher, Marcus
    Wiles, Janet
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2013, 28 (02) : 300 - 310
  • [40] TIME-SERIES ANALYSIS OF VISUAL-SEARCH STRATEGIES
    BROGAN, D
    PERCEPTION, 1984, 13 (01) : A28 - A28