Time series clustering in linear time complexity

被引:9
|
作者
Li, Xiaosheng [1 ]
Lin, Jessica [1 ]
Zhao, Liang [1 ]
机构
[1] George Mason Univ, 4400 Univ Dr, Fairfax, VA 22030 USA
关键词
Time series; Clustering; Linear time; Symbolic representation; REPRESENTATION; ALIGNMENT;
D O I
10.1007/s10618-021-00798-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing power of data storage and advances in data generation and collection technologies, large volumes of time series data become available and the content is changing rapidly. This requires data mining methods to have low time complexity to handle the huge and fast-changing data. This article presents a novel time series clustering algorithm that has linear time complexity. The proposed algorithm partitions the data by checking some randomly selected symbolic patterns in the time series. We provide theoretical analysis to show that group structures in the data can be revealed from this process. We evaluate the proposed algorithm extensively on all 128 datasets from the well-known UCR time series archive, and compare with the state-of-the-art approaches with statistical analysis. The results show that the proposed method achieves better accuracy compared with other rival methods. We also conduct experiments to explore how the parameters and configuration of the algorithm can affect the final clustering results.
引用
收藏
页码:2369 / 2388
页数:20
相关论文
共 50 条
  • [41] On Change Detection in the Complexity of Time Series
    Aiordachioaie, Dorel
    Popescu, Theodor D.
    Pavel, Sorin Marius
    [J]. PROCEEDINGS OF THE 2020 12TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI-2020), 2020,
  • [42] Complexity analysis of temperature time series
    School of Science, Beijing Jiaotong University, Beijing 100044, China
    [J]. Beijing Jiaotong Daxue Xuebao, 2008, 3 (98-101):
  • [43] Complexity analysis of riverflow time series
    Asok K. Sen
    [J]. Stochastic Environmental Research and Risk Assessment, 2009, 23 : 361 - 366
  • [44] Detecting the complexity of a functional time series
    Bongiorno, Enea G.
    Chan, Lax
    Goia, Aldo
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2024, 36 (03) : 600 - 622
  • [45] Higher order complexity of time series
    Gu, FJ
    Shen, EH
    Meng, X
    Cao, Y
    Cai, ZJ
    [J]. INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2004, 14 (08): : 2979 - 2990
  • [46] Complexity analysis of riverflow time series
    Sen, Asok K.
    [J]. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2009, 23 (03) : 361 - 366
  • [47] Discrimination and clustering for multivariate time series
    Kakizawa, Y
    Shumway, RH
    Taniguchi, M
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1998, 93 (441) : 328 - 340
  • [48] Learning Representations for Time Series Clustering
    Ma, Qianli
    Zheng, Jiawei
    Li, Sen
    Cottrell, GarrisonW.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [49] A Review of Subsequence Time Series Clustering
    Zolhavarieh, Seyedjamal
    Aghabozorgi, Saeed
    Teh, Ying Wah
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [50] Shape clustering on time series data
    Zheng, Ch
    Zhang, L.
    [J]. 2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES: ITESS 2008, VOL 3, 2008, : 1249 - 1253