Optimizing dynamic time warping's window width for time series data mining applications

被引:52
|
作者
Hoang Anh Dau [1 ]
Silva, Diego Furtado [2 ]
Petitjean, Francois [3 ]
Forestier, Germain [4 ]
Bagnall, Anthony [5 ]
Mueen, Abdullah [6 ]
Keogh, Eamonn [1 ]
机构
[1] Univ Calif Riverside, Riverside, CA 92521 USA
[2] Univ Fed Sao Carlos, Sao Carlos, SP, Brazil
[3] Monash Univ, Melbourne, Vic, Australia
[4] Univ Haute Alsace, Mulhouse, France
[5] Univ East Anglia, Norwich, Norfolk, England
[6] Univ New Mexico, Albuquerque, NM 87131 USA
基金
澳大利亚研究理事会; 英国工程与自然科学研究理事会;
关键词
Time series; Clustering; Classification; Dynamic time warping; Semi-supervised learning; CLASSIFICATION;
D O I
10.1007/s10618-018-0565-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic Time Warping (DTW) is a highly competitive distance measure for most time series data mining problems. Obtaining the best performance from DTW requires setting its only parameter, the maximum amount of warping (w). In the supervised case with ample data, w is typically set by cross-validation in the training stage. However, this method is likely to yield suboptimal results for small training sets. For the unsupervised case, learning via cross-validation is not possible because we do not have access to labeled data. Many practitioners have thus resorted to assuming that "the larger the better", and they use the largest value of w permitted by the computational resources. However, as we will show, in most circumstances, this is a na < ve approach that produces inferior clusterings. Moreover, the best warping window width is generally non-transferable between the two tasks, i.e., for a single dataset, practitioners cannot simply apply the best w learned for classification on clustering or vice versa. In addition, we will demonstrate that the appropriate amount of warping not only depends on the data structure, but also on the dataset size. Thus, even if a practitioner knows the best setting for a given dataset, they will likely be at a lost if they apply that setting on a bigger size version of that data. All these issues seem largely unknown or at least unappreciated in the community. In this work, we demonstrate the importance of setting DTW's warping window width correctly, and we also propose novel methods to learn this parameter in both supervised and unsupervised settings. The algorithms we propose to learn w can produce significant improvements in classification accuracy and clustering quality. We demonstrate the correctness of our novel observations and the utility of our ideas by testing them with more than one hundred publicly available datasets. Our forceful results allow us to make a perhaps unexpected claim; an underappreciated "low hanging fruit" in optimizing DTW's performance can produce improvements that make it an even stronger baseline, closing most or all the improvement gap of the more sophisticated methods proposed in recent years.
引用
收藏
页码:1074 / 1120
页数:47
相关论文
共 50 条
  • [41] Speed up dynamic time warping of multivariate time series
    Li, Zhengxin
    Zhang, Fengming
    Nie, Feiping
    Li, Hailin
    Wang, Jian
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (03) : 2593 - 2603
  • [42] Correlation based dynamic time warping of multivariate time series
    Banko, Zoltan
    Abonyi, Janos
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (17) : 12814 - 12823
  • [43] A Scalable Segmented Dynamic Time Warping for Time Series Classification
    Ma, Ruizhe
    Ahmadzadeh, Azim
    Boubrahimi, Soukaina Filali
    Angryk, Rafal A.
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2019, PT II, 2019, 11509 : 407 - 419
  • [44] Applications of Data Mining to Time Series of Electrical Disturbance Data
    Cornforth, David
    [J]. 2009 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, VOLS 1-8, 2009, : 2179 - 2186
  • [45] Data mining time series with applications to crime analysis
    Brown, DE
    Oxford, RB
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 1453 - 1458
  • [46] Multivariate Time Series Data Clustering Method Based on Dynamic Time Warping and Affinity Propagation
    Wan, Xiaoji
    Li, Hailin
    Zhang, Liping
    Wu, Yenchun Jim
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [47] On clustering multimedia time series data using K-means and dynamic time warping
    Niennattrakul, Vit
    Ratanamahatana, Chotirat Ann
    [J]. MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 733 - +
  • [48] Electrocardiogram data mining based on frame classification by dynamic time warping matching
    Zhang, Gong
    Kinsner, Witold
    Huang, Bin
    [J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2009, 12 (06) : 701 - 707
  • [49] Subsequence Partition of Time Series based on Different Width Time Window
    Guo, Hongwei
    Zhang, Jianliang
    Liu, Yanchi
    Gao, Xuedong
    [J]. 2010 INTERNATIONAL CONFERENCE ON E-EDUCATION, E-BUSINESS, E-MANAGEMENT AND E-LEARNING: IC4E 2010, PROCEEDINGS, 2010, : 57 - 60
  • [50] Adaptively constrained dynamic time warping for time series classification and clustering
    Li, Huanhuan
    Liu, Jingxian
    Yang, Zaili
    Liu, Ryan Wen
    Wu, Kefeng
    Wan, Yuan
    [J]. INFORMATION SCIENCES, 2020, 534 : 97 - 116