Incremental Clustering for Time Series Data based on an Improved Leader Algorithm

被引:4
|
作者
Huynh Thi Thu Thuy [1 ]
Duong Tuan Anh [1 ]
Vo Thi Ngoc Chau [1 ]
机构
[1] Ho Chi Minh City Univ Technol, Fac Comp Sci & Engn, Ho Chi Minh City, Vietnam
关键词
time series; incremental time series clustering; Leader algorithm; streaming time series;
D O I
10.1109/rivf.2019.8713702
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Incremental clustering is one of the most important tasks in streaming data mining. As for streaming time series, incremental clustering can be used in some advanced time series data mining tasks such as anomaly detection and motif discovery. In this paper, we propose a novel incremental clustering method for time series. The proposed method is an improved variant of the Leader algorithm, called I-Leader. Leader is a well-known incremental clustering method which is a single-pass distance-based partitional clustering method. However, Leader suffers from some weaknesses such as producing not good clustering result which consists of a large number of clusters and therefore requiring a lot of time for clustering. To overcome these weaknesses of Leader, in I-Leader we employ a data summarization technique and use this data summarization to maintain the high quality for the clusters. We have conducted some experiments on several datasets to compare the performance of the I-Leader algorithm with that of the original Leader in time series clustering. Besides, we compare I-Leader with k-Means. The experimental results show that our I-Leader algorithm runs faster than both Leader and k-Means. I-Leader is also more effective than both of these algorithms with better clustering results.
引用
收藏
页码:13 / 18
页数:6
相关论文
共 50 条
  • [1] Fuzzy clustering algorithm for time series based on adaptive incremental learning
    Wang, Wei
    Hu, Xiaohui
    Wang, Mingye
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (04) : 3991 - 3998
  • [2] An algorithm for time series data mining based on clustering
    Wu, Shaozhi
    Wu, Yue
    Wang, Ying
    Ye, Yalan
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 2155 - +
  • [3] Clustering Algorithm Based on Time Series Similarity to Web Data Clustering
    Yang Yan
    Yao Hua-Xiong
    Li Rong
    [J]. PROCEEDINGS OF THE 2015 4TH NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING ( NCEECE 2015), 2016, 47 : 1373 - 1377
  • [4] The unordered time series fuzzy clustering algorithm based on the adaptive incremental learning
    Xu, Huanchun
    Hou, Rui
    Fan, Jinfeng
    Zhou, Liang
    Yue, Hongxuan
    Wang, Liusheng
    Liu, Jiayue
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (04) : 3783 - 3791
  • [5] A clustering algorithm for time series data
    Yin, Jian
    Zhou, Duanning
    Xie, Qiong-Qiong
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2006, : 119 - +
  • [6] Similarity Webpage Denoising Data Clustering Algorithm Based on Time Series
    Hang Chun-mei
    Wu Yang-yang
    [J]. 2015 SEVENTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2015), 2015, : 984 - 986
  • [7] Data Incremental Clustering Algorithm based on Differential Privacy
    Gao, Qing
    Wang, Xiujun
    Gao, Yan
    Tao, Tao
    [J]. 2023 IEEE 9TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2023,
  • [8] An Artificial Immune Based Incremental Data Clustering Algorithm
    Xiao, Xin
    [J]. ADVANCES IN CIVIL ENGINEERING II, PTS 1-4, 2013, 256-259 : 2935 - 2938
  • [9] Iterative incremental clustering of time series
    Lin, J
    Vlachos, M
    Keogh, E
    Gunopulos, D
    [J]. ADVANCES IN DATABASE TECHNOLOGY - EDBT 2004, PROCEEDINGS, 2004, 2992 : 106 - 122
  • [10] Incremental fuzzy clustering of time series *
    Wang, Ling
    Xu, Peipei
    Ma, Qian
    [J]. FUZZY SETS AND SYSTEMS, 2021, 421 : 62 - 76