ULISSE: ULtra compact Index for Variable-Length Similarity SEarch in Data Series

被引:9
|
作者
Linardi, Michele [1 ]
Palpanas, Themis [1 ]
机构
[1] Paris Descartes Univ, LIPADE, Paris, France
关键词
D O I
10.1109/ICDE.2018.00149
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data series similarity search is an important operation and at the core of several analysis tasks and applications related to data series collections. Despite the fact that data series indexes enable fast similarity search, all existing indexes can only answer queries of a single length (fixed at index construction time), which is a severe limitation. In this work, we propose ULISSE, the first data series index structure designed for answering similarity search queries of variable length. Our contribution is two-fold. First, we introduce a novel representation technique, which effectively and succinctly summarizes multiple sequences of different length. Based on the proposed index, we describe efficient algorithms for approximate and exact similarity search, combining disk based index visits and in-memory sequential scans. We experimentally evaluate our approach using several synthetic and real datasets. The results show that ULISSE is several times (and up to orders of magnitude) more efficient in terms of both space and time cost, when compared to competing approaches.
引用
收藏
页码:1356 / 1359
页数:4
相关论文
共 50 条
  • [21] A compact multi-resolution index for variable length queries in time series databases
    Srividya Kadiyala
    Nematollaah Shiri
    Knowledge and Information Systems, 2008, 15 : 131 - 147
  • [22] A compact multi-resolution index for variable length queries in time series databases
    Kadiyala, Srividya
    Shiri, Nematollaah
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 15 (02) : 131 - 147
  • [23] Exploring variable-length time series motifs in one hundred million length scale
    Yifeng Gao
    Jessica Lin
    Data Mining and Knowledge Discovery, 2018, 32 : 1200 - 1228
  • [24] Exploring variable-length time series motifs in one hundred million length scale
    Gao, Yifeng
    Lin, Jessica
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (05) : 1200 - 1228
  • [25] GrammarViz 3.0: Interactive Discovery of Variable-Length Time Series Patterns
    Senin, Pavel
    Lin, Jessica
    Wang, Xing
    Oates, Tim
    Gandhi, Sunil
    Boedihardjo, Arnold P.
    Chen, Crystal
    Frankenstein, Susan
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (01)
  • [26] Modelling of chaotic time series using a variable-length windowing approach
    Tekbas, ÖH
    CHAOS SOLITONS & FRACTALS, 2006, 29 (02) : 277 - 281
  • [27] A Variable-Length Network Encoding Protocol for Big Genomic Data
    Aledhari, Mohammed
    Hefeida, Mohamed S.
    Saeed, Fahad
    WIRED/WIRELESS INTERNET COMMUNICATIONS, WWIC 2016, 2016, 9674 : 212 - 224
  • [29] A framework for discovering variable-length motifs in medical data streams
    Sun, Le
    He, Jinyuan
    Wang, Chen
    Ma, Jiangang
    Dong, Hai
    Zhang, Yanchun
    Journal of Computers (Taiwan), 2019, 30 (01): : 105 - 113
  • [30] Analysis of Variable-Length Codes for Integer Encoding in Hyperspectral Data Compression with thek2-Raster Compact Data Structure
    Chow, Kevin
    Tzamarias, Dion Eustathios Olivier
    Hernandez-Cabronero, Miguel
    Blanes, Ian
    Serra-Sagrista, Joan
    REMOTE SENSING, 2020, 12 (12)