Dynamic Time Warping Under Product Quantization, With Applications to Time-Series Data Similarity Search

被引:12
|
作者
Zhang, Haowen [1 ]
Dong, Yabo [1 ]
Li, Jing [1 ]
Xu, Duanqing [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310012, Peoples R China
关键词
Time series analysis; Databases; Time measurement; Internet of Things; Search problems; Quantization (signal); Activity recognition; DTW barycenter averaging (DBA); dynamic time warping (DTW); filter-and-refine; product quantization (PQ); time series; AVERAGING METHOD; CLASSIFICATION; DISTANCE;
D O I
10.1109/JIOT.2021.3132017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The similarity search on sensor data generated by a myriad of sensing devices is a frequently encountered problem in the era of the Internet of Things (IoT). This sensor data generally appear in the form of time series, a temporally ordered sequence of real numbers obtained regularly in time. It has been widely accepted that the dynamic time warping (DTW) currently is the most prevalent similarity measure in the time-series mining community, mainly due to its flexibility and broad applicability. However, calculating DTW between two time series has quadratic time complexity, leading to unsatisfactory efficiency when performing the similarity search over the large time-series data set. The main contribution of this article is to propose a method called product quantization (PQ)-based DTW (PQDTW) for fast time-series approximate similarity search under DTW. The PQ, a well-known approximate nearest neighbor search approach, is used in PQDTW. Nevertheless, the conventional PQ is developed with the Euclidean distance and is not designed for DTW. To solve this problem, the DTW barycenter averaging (DBA) technique is utilized to adapt the PQ for DTW before using it. We employ PQDTW along with the filter-and-refine framework to efficiently and accurately perform the time-series similarity search. Our method can reasonably reduce many DTW computations in the filtering phase; thus, the query process is accelerated. We compare PQDTW with related popular algorithms using public time-series data sets. Experimental results verify that the proposal achieves the best tradeoff between query efficiency and retrieval accuracy compared to the competitors.
引用
收藏
页码:11814 / 11826
页数:13
相关论文
共 50 条
  • [21] A method for measuring similarity of time series based on series decomposition and dynamic time warping
    Zhang, Qingzhen
    Zhang, Chaoqi
    Cui, Langfu
    Han, Xiaoxuan
    Jin, Yang
    Xiang, Gang
    Shi, Yan
    [J]. APPLIED INTELLIGENCE, 2023, 53 (06) : 6448 - 6463
  • [22] Addressing Big Data Time Series: Mining Trillions of Time Series Subsequences Under Dynamic Time Warping
    Rakthanmanon, Thanawin
    Campana, Bilson
    Mueen, Abdullah
    Batista, Gustavo
    Westover, Brandon
    Zhu, Qiang
    Zakaria, Jesin
    Keogh, Eamonn
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2013, 7 (03)
  • [23] Data Augmentation with Suboptimal Warping for Time-Series Classification
    Kamycki, Krzysztof
    Kapuscinski, Tomasz
    Oszust, Mariusz
    [J]. SENSORS, 2020, 20 (01)
  • [24] Speeding up similarity search under dynamic time warping by pruning unpromising alignments
    Silva, Diego F.
    Giusti, Rafael
    Keogh, Eamonn
    Batista, Gustavo E. A. P. A.
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (04) : 988 - 1016
  • [25] Speeding up similarity search under dynamic time warping by pruning unpromising alignments
    Diego F. Silva
    Rafael Giusti
    Eamonn Keogh
    Gustavo E. A. P. A. Batista
    [J]. Data Mining and Knowledge Discovery, 2018, 32 : 988 - 1016
  • [26] Multivariate Time Series Similarity Measure Based on Weighted Dynamic Time Warping
    Zhao, Jinhui
    Ju, Rusheng
    Xie, Xu
    Ye, Yanqing
    [J]. PROCEEDINGS OF 2020 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MACHINE VISION AND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND MACHINE LEARNING, IPMV 2020, 2020, : 173 - 179
  • [27] Applying trie-structure to improve dynamic time warping on time-series stock data analysis
    Chen, AP
    Chen, YC
    Yeh, CY
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, VOLS 1AND 2, 2004, : 31 - 36
  • [28] Bounded similarity querying for time-series data
    Goldin, DQ
    Millstein, TD
    Kutlu, A
    [J]. INFORMATION AND COMPUTATION, 2004, 194 (02) : 203 - 241
  • [29] Optimizing dynamic time warping’s window width for time series data mining applications
    Hoang Anh Dau
    Diego Furtado Silva
    François Petitjean
    Germain Forestier
    Anthony Bagnall
    Abdullah Mueen
    Eamonn Keogh
    [J]. Data Mining and Knowledge Discovery, 2018, 32 : 1074 - 1120
  • [30] Optimizing dynamic time warping's window width for time series data mining applications
    Hoang Anh Dau
    Silva, Diego Furtado
    Petitjean, Francois
    Forestier, Germain
    Bagnall, Anthony
    Mueen, Abdullah
    Keogh, Eamonn
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (04) : 1074 - 1120