Parallelization of Searching and Mining Time Series Data using Dynamic Time Warping

被引:0
|
作者
Shabib, Ahmed [1 ]
Narang, Anish [1 ]
Niddodi, Chaitra Prasad [1 ]
Das, Madhura [1 ]
Pradeep, Rachita [1 ]
Shenoy, Varun [1 ]
Auradkar, Prafullata [1 ]
Vignesh, T. S. [1 ]
Sitaram, Dinkar [1 ]
机构
[1] PES Inst Technol, Dept Comp Sci & Engn, Bangalore, Karnataka, India
关键词
Time series; Dynamic time warping; Multicore; Spark;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Among the various algorithms present for data mining, the UCR Dynamic Time Warping (DTW) suite provided a solution to search and mine large data sets of time series data more efficiently as compared to the previously existing method of using Euclidean Distance. The UCR DTW algorithm was developed for a single CPU core. In this paper, we consider 2 methods of parallelizing the DTW algorithm. First, we consider a multi-core implementation, followed by a cluster implementation using Spark. From the multi-core implementation, we achieve nearly linear speedup. In the Spark implementation, we find that a straightforward implementation of DTW does not perform well. This is because; a major step in DTW is parallel computation of a lower bound. This paradigm is not supported well by Spark, which supports (i) broadcast variables that are broadcasts of read-only variables (ii) accumulation variables that represent distributed sums. We show how to compute distributed lower bounds efficiently in Spark and achieve nearly linear speedup with DTW in a Spark computation as well.
引用
收藏
页码:343 / 348
页数:6
相关论文
共 50 条
  • [1] On-line and dynamic time warping for time series data mining
    Hailin Li
    International Journal of Machine Learning and Cybernetics, 2015, 6 : 145 - 153
  • [2] On-line and dynamic time warping for time series data mining
    Li, Hailin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (01) : 145 - 153
  • [3] Time works well: Dynamic time warping based on time weighting for time series data mining
    Li, Hailin
    INFORMATION SCIENCES, 2021, 547 : 592 - 608
  • [4] Dynamic time warping based on cubic spline interpolation for time series data mining
    Li, Hailin
    Wan, Xiaoji
    Liang, Ye
    Gao, Shile
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 19 - 26
  • [5] Addressing Big Data Time Series: Mining Trillions of Time Series Subsequences Under Dynamic Time Warping
    Rakthanmanon, Thanawin
    Campana, Bilson
    Mueen, Abdullah
    Batista, Gustavo
    Westover, Brandon
    Zhu, Qiang
    Zakaria, Jesin
    Keogh, Eamonn
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2013, 7 (03)
  • [6] Embedding of time series data by using Dynamic Time Warping distances
    Graduate School of Information Sciences, Hiroshima City University, Hiroshima, 731-3194, Japan
    不详
    不详
    不详
    不详
    不详
    不详
    不详
    不详
    不详
    Syst Comput Jpn, 2006, 3 (1-9):
  • [7] Optimizing dynamic time warping’s window width for time series data mining applications
    Hoang Anh Dau
    Diego Furtado Silva
    François Petitjean
    Germain Forestier
    Anthony Bagnall
    Abdullah Mueen
    Eamonn Keogh
    Data Mining and Knowledge Discovery, 2018, 32 : 1074 - 1120
  • [8] Optimizing dynamic time warping's window width for time series data mining applications
    Hoang Anh Dau
    Silva, Diego Furtado
    Petitjean, Francois
    Forestier, Germain
    Bagnall, Anthony
    Mueen, Abdullah
    Keogh, Eamonn
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (04) : 1074 - 1120
  • [9] A local segmented dynamic time warping distance measure algorithm for time series data mining
    Dong, Xiao-Li
    Gu, Cheng-Kui
    Wang, Zheng-Ou
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1247 - +
  • [10] Fuzzy clustering of time series data using dynamic time warping distance
    Izakian, Hesam
    Pedrycz, Witold
    Jamal, Iqbal
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 39 : 235 - 244