Time-series clustering - A decade review

被引:990
|
作者
Aghabozorgi, Saeed [1 ]
Shirkhorshidi, Ali Seyed [1 ]
Teh Ying Wah [1 ]
机构
[1] Univ Malaya, Dept Informat Syst, Fac Comp Sci & Informat Technol, Kuala Lumpur 50603, Malaysia
关键词
Clustering; Time-series; Distance measure; Evaluation measure; Representations; GENE-EXPRESSION DATA; SIMILARITY SEARCH; DIMENSIONALITY REDUCTION; AVERAGING METHOD; REPRESENTATION; ALGORITHMS; MODEL; CLASSIFICATION; COMPRESSION; RECOGNITION;
D O I
10.1016/j.is.2015.04.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is a solution for classifying enormous data when there is not any early knowledge about classes. With emerging new concepts like cloud computing and big data and their vast applications in recent years, research works have been increased on unsupervised solutions like clustering algorithms to extract knowledge from this avalanche of data. Clustering time-series data has been used in diverse scientific areas to discover patterns which empower data analysts to extract valuable information from complex and massive datasets. In case of huge datasets, using supervised classification solutions is almost impossible, while clustering can solve this problem using unsupervised approaches. In this research work, the focus is on time-series data, which is one of the popular data types in clustering problems and is broadly used from gene expression data in biology to stock market analysis in finance. This review will expose four main components of time-series clustering and is aimed to represent an updated investigation on the trend of improvements in efficiency, quality and complexity of clustering time-series approaches during the last decade and enlighten new paths for future works. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:16 / 38
页数:23
相关论文
共 50 条
  • [1] Deep Time-Series Clustering: A Review
    Alqahtani, Ali
    Ali, Mohammed
    Xie, Xianghua
    Jones, Mark W.
    [J]. ELECTRONICS, 2021, 10 (23)
  • [2] Incremental Clustering of Time-Series by Fuzzy Clustering
    Aghabozorgi, Saeed
    Saybani, Mahmoud Reza
    Teh, Ying Wah
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2012, 28 (04) : 671 - 688
  • [3] Fast and Accurate Time-Series Clustering
    Paparrizos, John
    Gravano, Luis
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2017, 42 (02):
  • [4] Clustering of multivariate time-series data
    Singhal, A
    Seborg, DE
    [J]. PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3931 - 3936
  • [5] Clustering multivariate time-series data
    Singhal, A
    Seborg, DE
    [J]. JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 427 - 438
  • [6] A clustering model for time-series forecasting
    Coric, Rebeka
    Dumic, Mateja
    Jelic, Slobodan
    [J]. 2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1105 - 1109
  • [7] Clustering short time-series microarray
    Ping, Loh Wei
    Abu Hasan, Yahya
    [J]. INTERNATIONAL CONFERENCE ON MATHEMATICAL BIOLOGY 2007, 2008, 971 : 39 - 46
  • [8] Time-series Clustering by Approximate Prototypes
    Hautamaki, Ville
    Nykanen, Pekka
    Franti, Pasi
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 644 - 647
  • [9] Translational symmetry in subsequence time-series clustering
    Ide, Tsuyoshi
    [J]. NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2007, 4384 : 5 - 18
  • [10] Clustering to Forecast Sparse Time-Series Data
    Jha, Abhay
    Ray, Shubhankar
    Seaman, Brian
    Dhillon, Inderjit S.
    [J]. 2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1388 - 1399