Time-Series Clustering Based on the Characterization of Segment Typologies

被引:26
|
作者
Guijo-Rubio, David [1 ]
Manuel Duran-Rosal, Antonio [2 ]
Antonio Gutierrez, Pedro [1 ]
Troncoso, Alicia [3 ]
Hervas-Martinez, Cesar [1 ]
机构
[1] Univ Cordoba, Dept Comp Sci & Numer Anal, Cordoba 14071, Spain
[2] Univ Loyola Andalucia, Dept Quantitat Methods, Cordoba 14004, Spain
[3] Univ Pablo Olavide, Dept Comp Languages & Syst, Seville 41013, Spain
关键词
Time series analysis; Hidden Markov models; Clustering algorithms; Time measurement; Autoregressive processes; Data mining; Proposals; feature extraction; segmentation; time-series clustering; AVERAGING METHOD; DISTANCE;
D O I
10.1109/TCYB.2019.2962584
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Time-series clustering is the process of grouping time series with respect to their similarity or characteristics. Previous approaches usually combine a specific distance measure for time series and a standard clustering method. However, these approaches do not take the similarity of the different subsequences of each time series into account, which can be used to better compare the time-series objects of the dataset. In this article, we propose a novel technique of time-series clustering consisting of two clustering stages. In a first step, a least-squares polynomial segmentation procedure is applied to each time series, which is based on a growing window technique that returns different-length segments. Then, all of the segments are projected into the same dimensional space, based on the coefficients of the model that approximates the segment and a set of statistical features. After mapping, a first hierarchical clustering phase is applied to all mapped segments, returning groups of segments for each time series. These clusters are used to represent all time series in the same dimensional space, after defining another specific mapping process. In a second and final clustering stage, all the time-series objects are grouped. We consider internal clustering quality to automatically adjust the main parameter of the algorithm, which is an error threshold for the segmentation. The results obtained on 84 datasets from the UCR Time Series Classification Archive have been compared against three state-of-the-art methods, showing that the performance of this methodology is very promising, especially on larger datasets.
引用
收藏
页码:5409 / 5422
页数:14
相关论文
共 50 条
  • [1] Fuzzy clustering based segmentation of time-series
    Abonyi, J
    Feil, B
    Nemeth, S
    Arva, P
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS V, 2003, 2810 : 275 - 285
  • [2] A methodology for index tracking based on time-series clustering
    Focardi, SM
    Fabozzi, FJ
    [J]. QUANTITATIVE FINANCE, 2004, 4 (04) : 417 - 425
  • [3] Mobile Networks Classification Based on Time-Series Clustering
    Lu, Shun
    Qian, Bing
    Zhao, Long-Gang
    Sun, Qiong
    [J]. 2022 IEEE 5TH INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION ENGINEERING, ICECE, 2022, : 65 - 71
  • [4] Incremental Clustering of Time-Series by Fuzzy Clustering
    Aghabozorgi, Saeed
    Saybani, Mahmoud Reza
    Teh, Ying Wah
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2012, 28 (04) : 671 - 688
  • [5] Fast and Accurate Time-Series Clustering
    Paparrizos, John
    Gravano, Luis
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2017, 42 (02):
  • [6] Clustering of multivariate time-series data
    Singhal, A
    Seborg, DE
    [J]. PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3931 - 3936
  • [7] Time-series clustering - A decade review
    Aghabozorgi, Saeed
    Shirkhorshidi, Ali Seyed
    Teh Ying Wah
    [J]. INFORMATION SYSTEMS, 2015, 53 : 16 - 38
  • [8] Clustering multivariate time-series data
    Singhal, A
    Seborg, DE
    [J]. JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 427 - 438
  • [9] A clustering model for time-series forecasting
    Coric, Rebeka
    Dumic, Mateja
    Jelic, Slobodan
    [J]. 2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1105 - 1109
  • [10] Deep Time-Series Clustering: A Review
    Alqahtani, Ali
    Ali, Mohammed
    Xie, Xianghua
    Jones, Mark W.
    [J]. ELECTRONICS, 2021, 10 (23)