k-Shape: Efficient and Accurate Clustering of Time Series

被引:425
|
作者
Paparrizos, John [1 ]
Gravano, Luis [1 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
关键词
ALGORITHM; REPRESENTATION;
D O I
10.1145/2723372.2737793
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The proliferation and ubiquity of temporal data across many disciplines has generated substantial interest in the analysis and mining of time series. Clustering is one of the most popular data mining methods, not only due to its exploratory power, but also as a preprocessing step or subroutine for other techniques. In this paper, we present k-Shape, a novel algorithm for time-series clustering. k-Shape relies on a scalable iterative refinement procedure, which creates homogeneous and well-separated clusters. As its distance measure, k-Shape uses a normalized version of the cross-correlation measure in order to consider the shapes of time series while comparing them. Based on the properties of that distance measure, we develop a method to compute cluster centroids, which are used in every iteration to update the assignment of time series to clusters. To demonstrate the robustness of k-Shape, we perform an extensive experimental evaluation of our approach against partitional, hierarchical, and spectral clustering methods, with combinations of the most competitive distance measures. k-Shape outperforms all scalable approaches in terms of accuracy. Furthermore, k-Shape also outperforms all non-scalable (and hence impractical) combinations, with one exception that achieves similar accuracy results. However, unlike k-Shape, this combination requires tuning of its distance measure and is two orders of magnitude slower than k-Shape. Overall, k-Shape emerges as a domain-independent, highly accurate, and highly efficient clustering approach for time series with broad applications.
引用
收藏
页码:1855 / 1870
页数:16
相关论文
共 50 条
  • [1] k-Shape: Efficient and Accurate Clustering of Time Series
    Paparrizos, John
    Gravano, Luis
    [J]. SIGMOD RECORD, 2016, 45 (01) : 69 - 76
  • [2] Technical Perspective - k-Shape: Efficient and Accurate Clustering of Time Series
    Ives, Zachary G.
    [J]. SIGMOD RECORD, 2016, 45 (01) : 68 - 68
  • [3] Accelerating k-Shape Time Series Clustering Algorithm Using GPU
    Wang, Xun
    Song, Ruibao
    Xiao, Junmin
    Li, Tong
    Li, Xueqi
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (10) : 2718 - 2734
  • [4] Fuzzy Classification for Time Series Data Based on K-Shape
    Li, Hailin
    Jia, Ruiying
    Tan, Guanyin
    [J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2021, 50 (06): : 899 - 906
  • [5] Improving Load Forecasting Based on Deep Learning and K-shape Clustering
    Fahiman, Fateme
    Erfani, Sarah M.
    Rajasegarar, Sutharshan
    Palaniswami, Marimuthu
    Leckie, Christopher
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4134 - 4141
  • [6] Shape-based clustering of synthetic Stokes profiles using k-means and k-Shape
    Moe, Thore E.
    Pereira, Tiago M. D.
    Calvo, Flavio
    Leenaarts, Jorrit
    [J]. ASTRONOMY & ASTROPHYSICS, 2023, 675
  • [7] Harmonic Pollution Zoning Method Based on Improved k-Shape Clustering
    Zhang, Min
    Wang, Jinhao
    Chang, Xiao
    Gao, Le
    Guo, Xiangyu
    Tang, Wenchu
    Wang, Hanwen
    [J]. IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2024,
  • [8] k-Shape clustering for extracting macro-patterns in intracranial pressure signals
    Martinez-Tejada, Isabel
    Riedel, Casper Schwartz
    Juhler, Marianne
    Andresen, Morten
    Wilhjelm, Jens E.
    [J]. FLUIDS AND BARRIERS OF THE CNS, 2022, 19 (01)
  • [9] Spatial-temporal short-term load forecasting framework via K-shape time series clustering method and graph convolutional networks
    Wu, Zeqing
    Mu, Yunfei
    Deng, Shuai
    Li, Yang
    [J]. ENERGY REPORTS, 2022, 8 : 8752 - 8766
  • [10] Improving Aggregated Load Forecasting Using Evidence Accumulation k-Shape Clustering
    Zhang, Yufan
    Liu, Yuquan
    Yu, Zhiwen
    Xiong, Wen
    Li Wang
    Ai, Qian
    Li, Zhaoyu
    Huang, Kaiyi
    Hao, Ran
    Jiang, Ziqing
    [J]. 2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,