Clustering Time Series with k-Medoids Based Algorithms

被引:0
|
作者
Holder, Christopher [1 ]
Guijo-Rubio, David [1 ,2 ]
Bagnall, Anthony [1 ]
机构
[1] Univ East Anglia, Sch Comp Sci, Norwich NR4 7TQ, Norfolk, England
[2] Univ Cordoba, Dept Comp Sci, Cordoba, Spain
基金
英国工程与自然科学研究理事会;
关键词
Time series; clustering; k-means; k-medoids; PAM; UCR archive; STATISTICAL COMPARISONS; CLASSIFIERS;
D O I
10.1007/978-3-031-49896-1_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time Series Clustering (TSCL) involves grouping unlabelled time series into homogeneous groups. A popular approach to TSCL is to use the partitional clustering algorithms k-means or k-medoids in conjunction with an elastic distance function such as Dynamic Time Warping (DTW). We explore TSCL using nine different elastic distance measures. Both partitional algorithms characterise clusters with an exemplar series, but use different techniques to do so: k-means uses an averaging algorithm to find an exemplar, whereas k-medoids chooses a training case (medoid). Traditionally, the arithmetic mean of a collection of time series was used with k-means. However, this ignores any offset. In 2011, an averaging technique specific to DTW, called DTW Barycentre Averaging (DBA), was proposed. Since, k-means with DBA has been the algorithm of choice for the majority of partition-based TSCL and much of the research using medoids-based approaches for TSCL stopped. We revisit k-medoids based TSCL with a range of elastic distance measures. Our results show k-medoids approaches are significantly better than k-means on a standard test suite, independent of the elastic distance measure used. We also compare the most commonly used alternating k-medoids approach against the Partition Around Medoids (PAM) algorithm. PAM significantly outperforms the default k-medoids for all nine elastic measures used. Additionally, we evaluate six variants of PAM designed to speed up TSCL. Finally, we show PAM with the best elastic distance measure is significantly better than popular alternative TSCL algorithms, including the k-means DBA approach, and competitive with the best deep learning algorithms.
引用
收藏
页码:39 / 55
页数:17
相关论文
共 50 条
  • [1] Application of the k-medoids Partitioning Algorithm for Clustering of Time Series Data
    Radovanovic, Ana
    Ye, Xinlin
    Milanovic, Jovica, V
    Milosavljevic, Nina
    Storchi, Riccardo
    [J]. 2020 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE 2020): SMART GRIDS: KEY ENABLERS OF A GREEN POWER SYSTEM, 2020, : 645 - 649
  • [2] Spatial Clustering with Obstacles Constraints Based on Genetic Algorithms and K-Medoids
    Zhang, Xueping
    Wang, Jiayao
    Wu, Fang
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (10): : 109 - 114
  • [3] K-medoids Clustering Based on MapReduce and Optimal Search of Medoids
    Zhu, Ying-ting
    Wang, Fu-zhang
    Shan, Xing-hua
    Lv, Xiao-yan
    [J]. 2014 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2014), 2014, : 573 - 577
  • [4] Comparison between K-Means and K-Medoids Clustering Algorithms
    Madhulatha, Tagaram Soni
    [J]. ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, 2011, 198 : 472 - 481
  • [5] A novel spatial clustering with obstacles constraints based on genetic algorithms and K-Medoids
    Zhang, Xueping
    Wang, Jiayao
    Wu, Fang
    Fan, Zhongshan
    Li, Xiaoqing
    [J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, 2006, : 605 - 610
  • [6] A K-medoids Based Clustering Scheme with an Application to Document Clustering
    Onan, Aytug
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 354 - 359
  • [7] An Efficient Implementation of Anytime K-medoids Clustering for Time Series under Dynamic Time Warping
    Van The Huy
    Duong Tuan Anh
    [J]. PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 22 - 29
  • [8] Parallel K-Medoids Clustering Algorithm Based on Hadoop
    Jiang, Yaobin
    Zhang, Jiongmin
    [J]. 2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 649 - 652
  • [9] A genetic k-medoids clustering algorithm
    Weiguo Sheng
    Xiaohui Liu
    [J]. Journal of Heuristics, 2006, 12 : 447 - 466
  • [10] Convex fuzzy k-medoids clustering
    Pinheiro, Daniel N.
    Aloise, Daniel
    Blanchard, Simon J.
    [J]. FUZZY SETS AND SYSTEMS, 2020, 389 : 66 - 92