A global averaging method for dynamic time warping, with applications to clustering

被引:648
|
作者
Petitjean, Francois [1 ,2 ,3 ]
Ketterlin, Alain [1 ,2 ]
Gancarski, Pierre [1 ,2 ]
机构
[1] LSIIT, UMR 7005, Pole API, F-67412 Illkirch Graffenstaden, France
[2] Univ Strasbourg, F-67084 Strasbourg, France
[3] Ctr Natl Etud Spatiales, F-31401 Toulouse 9, France
关键词
Sequence analysis; Time series clustering; Dynamic time warping; Distance-based clustering; Time series averaging; DTW barycenter averaging; Global averaging; Satellite image time series; SEQUENCE; SERIES; ALIGNMENT;
D O I
10.1016/j.patcog.2010.09.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining sequential data is an old topic that has been revived in the last decade, due to the increasing availability of sequential datasets. Most works in this field are centred on the definition and use of a distance (or, at least, a similarity measure) between sequences of elements. A measure called dynamic time warping (DTW) seems to be currently the most relevant for a large panel of applications. This article is about the use of DTW in data mining algorithms, and focuses on the computation of an average of a set of sequences. Averaging is an essential tool for the analysis of data. For example, the K-MEANS clustering algorithm repeatedly computes such an average, and needs to provide a description of the clusters it forms. Averaging is here a crucial step, which must be sound in order to make algorithms work accurately. When dealing with sequences, especially when sequences are compared with DTW, averaging is not a trivial task. Starting with existing techniques developed around DTW, the article suggests an analysis framework to classify averaging techniques. It then proceeds to study the two major questions lifted by the framework. First, we develop a global technique for averaging a set of sequences. This technique is original in that it avoids using iterative pairwise averaging. It is thus insensitive to ordering effects. Second, we describe a new strategy to reduce the length of the resulting average sequence. This has a favourable impact on performance, but also on the relevance of the results. Both aspects are evaluated on standard datasets, and the evaluation shows that they compare favourably with existing methods. The article ends by describing the use of averaging in clustering. The last section also introduces a new application domain, namely the analysis of satellite image time series, where data mining techniques provide an original approach. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:678 / 693
页数:16
相关论文
共 50 条
  • [1] Adaptive Global Time Sequence Averaging Method Using Dynamic Time Warping
    Liu, Yu-Tao
    Zhang, Yong-An
    Zeng, Ming
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (08) : 2129 - 2142
  • [2] Clustering time series with Granular Dynamic Time Warping method
    Yu, Fusheng
    Dong, Keqiang
    Chen, Fei
    Jiang, Yongke
    Zeng, Wenyi
    [J]. GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 393 - +
  • [3] Grey Incidence Clustering Method Based on Dynamic Time Warping
    Dai, Jin
    Yan, Yi
    Hui, Feng
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 1675 - 1680
  • [4] Inaccuracies of shape averaging method using dynamic time warping for time series data
    Niennattrakul, Vit
    Ratanamahatana, Chotirat Ann
    [J]. COMPUTATIONAL SCIENCE - ICCS 2007, PT 1, PROCEEDINGS, 2007, 4487 : 513 - +
  • [5] Clustering Paths With Dynamic Time Warping
    Koschke, Rainer
    Steinbeck, Marcel
    [J]. EIGHTH IEEE WORKING CONFERENCE ON SOFTWARE VISUALIZATION (VISSOFT 2020), 2020, : 89 - 99
  • [6] Time Series Clustering Based on Dynamic Time Warping
    Wang, Weizeng
    Lyu, Gaofan
    Shi, Yuliang
    Liang, Xun
    [J]. PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 487 - 490
  • [7] Averaging Methods using Dynamic Time Warping for Time Series Classification
    Datta, Shreyasi
    Karmakar, Chandan K.
    Palaniswami, Marimuthu
    [J]. 2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 2794 - 2798
  • [8] Revisiting inaccuracies of time series averaging under dynamic time warping
    Jain, Brijnesh
    [J]. PATTERN RECOGNITION LETTERS, 2019, 125 : 418 - 424
  • [9] Grey incidence clustering method based on multidimensional dynamic time warping distance
    Dai, Jin
    Yan, Yi
    He, Yuhong
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2017, 28 (05) : 946 - 954
  • [10] Grey incidence clustering method based on multidimensional dynamic time warping distance
    Jin Dai
    Yi Yan
    Yuhong He
    [J]. Journal of Systems Engineering and Electronics, 2017, 28 (05) : 946 - 954