Time series clustering in linear time complexity

被引:9
|
作者
Li, Xiaosheng [1 ]
Lin, Jessica [1 ]
Zhao, Liang [1 ]
机构
[1] George Mason Univ, 4400 Univ Dr, Fairfax, VA 22030 USA
关键词
Time series; Clustering; Linear time; Symbolic representation; REPRESENTATION; ALIGNMENT;
D O I
10.1007/s10618-021-00798-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing power of data storage and advances in data generation and collection technologies, large volumes of time series data become available and the content is changing rapidly. This requires data mining methods to have low time complexity to handle the huge and fast-changing data. This article presents a novel time series clustering algorithm that has linear time complexity. The proposed algorithm partitions the data by checking some randomly selected symbolic patterns in the time series. We provide theoretical analysis to show that group structures in the data can be revealed from this process. We evaluate the proposed algorithm extensively on all 128 datasets from the well-known UCR time series archive, and compare with the state-of-the-art approaches with statistical analysis. The results show that the proposed method achieves better accuracy compared with other rival methods. We also conduct experiments to explore how the parameters and configuration of the algorithm can affect the final clustering results.
引用
收藏
页码:2369 / 2388
页数:20
相关论文
共 50 条
  • [1] Time series clustering in linear time complexity
    Xiaosheng Li
    Jessica Lin
    Liang Zhao
    [J]. Data Mining and Knowledge Discovery, 2021, 35 : 2369 - 2388
  • [2] Linear Time Complexity Time Series Clustering with Symbolic Pattern Forest
    Li, Xiaosheng
    Lin, Jessica
    Zhao, Liang
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2930 - 2936
  • [3] PISD: A linear complexity distance beats dynamic time warping on time series classification and clustering
    Tran, Minh-Tuan
    Le, Xuan-May
    Huynh, Van-Nam
    Yoon, Sung-Eui
    [J]. Engineering Applications of Artificial Intelligence, 2024, 138
  • [4] Clustering time series by linear dependency
    Andrés M. Alonso
    Daniel Peña
    [J]. Statistics and Computing, 2019, 29 : 655 - 676
  • [5] Clustering time series by linear dependency
    Alonso, Andres M.
    Pena, Daniel
    [J]. STATISTICS AND COMPUTING, 2019, 29 (04) : 655 - 676
  • [6] Linear Time Complexity Time Series Classification with Bag-of-Pattern-Features
    Li, Xiaosheng
    Lin, Jessica
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 277 - 286
  • [7] A Novel Fuzzy Time Series Forecasting Model Based on Multiple Linear Regression and Time Series Clustering
    Zhang, Yanpeng
    Qu, Hua
    Wang, Weipeng
    Zhao, Jihong
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [8] pdc: An R Package for Complexity-Based Clustering of Time Series
    Brandmaier, Andreas M.
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2015, 67 (05): : 1 - 23
  • [9] Enhancing Linear Time Complexity Time Series Classification with Hybrid Bag-Of-Patterns
    Liang, Shen
    Zhang, Yanchun
    Ma, Jiangang
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 717 - 735
  • [10] CLUSTERING OF TIME SERIES USING A HIERARCHICAL LINEAR DYNAMICAL SYSTEM
    Cinar, Goktug T.
    Principe, Jose C.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,