A Nested Two-Stage Clustering Method for Structured Temporal Sequence Data

被引:0
|
作者
Liang Wang
Vignesh Narayanan
Yao-Chi Yu
Yikyung Park
Jr-Shin Li
机构
[1] Washington University in St. Louis,
[2] Washington University School of Medicine,undefined
来源
关键词
Clustering; Optimal transport; Dynamic time warping; Structured temporal sequence;
D O I
暂无
中图分类号
学科分类号
摘要
Mining patterns of temporal sequence data is an important problem across many disciplines. Under appropriate preprocessing procedures, a structured temporal sequence can be organized into a probability measure or a time series representation, which grants a potential to reveal distinctive temporal pattern characteristics. In this paper, we propose a nested two-stage clustering method that integrates optimal transport and the dynamic time warping distances to learn the distributional and dynamic shape-based dissimilarity at the respective stage. The proposed clustering algorithm preserves both the distribution and shape patterns present in the data, which are critical for the datasets composed of structured temporal sequences. The effectiveness of the method is tested against existing agglomerative and K-shape-based clustering algorithms on Monte Carlo simulated synthetic datasets, and the performance is compared through various cluster validation metrics. Furthermore, we apply the developed method to real-world datasets from three domains: temporal dietary records, online retail sales, and smart meter energy profiles. The expressiveness of the cluster and subcluster centroid patterns shows significant promise of our method for structured temporal sequence data mining.
引用
收藏
页码:1627 / 1662
页数:35
相关论文
共 50 条
  • [1] A Nested Two-Stage Clustering Method for Structured Temporal Sequence Data
    Wang, Liang
    Narayanan, Vignesh
    Yu, Yao-Chi
    Park, Yikyung
    Li, Jr-Shin
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (07) : 1627 - 1662
  • [2] Two-stage pruning method for gram-based categorical sequence clustering
    Yuan, Liang
    Wang, Wenjian
    Chen, Lifei
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (04) : 631 - 640
  • [3] Two-stage pruning method for gram-based categorical sequence clustering
    Liang Yuan
    Wenjian Wang
    Lifei Chen
    [J]. International Journal of Machine Learning and Cybernetics, 2019, 10 : 631 - 640
  • [4] Two-stage nested partitions method for stochastic optimization
    Olafsson, S
    [J]. METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2004, 6 (01) : 5 - 27
  • [5] Two-Stage Nested Partitions Method for Stochastic Optimization
    Sigurdur Ólafsson
    [J]. Methodology And Computing In Applied Probability, 2004, 6 : 5 - 27
  • [6] Scalable nested partition-based clustering with two-stage sampling
    Kim, Jumi
    Yang, Jaekyung
    [J]. ASIA LIFE SCIENCES, 2015, : 233 - 248
  • [7] Research on two-stage fuzzy clustering method
    School of Computer, Taiyuan University of Science and Technology, Taiyuan 030024, China
    不详
    [J]. Harbin Gongcheng Daxue Xuebao, 2006, SUPPL. (552-557):
  • [8] Research on Two-Stage Fuzzy Clustering Method for Spectrum Data Based on PSO
    Cai Jiang-hui
    Zhang Ji-fu
    Zhao Xu-jun
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2009, 29 (04) : 1137 - 1141
  • [9] Sequence Classification: A Regression Based Generalization of Two-stage Clustering
    Farin, Nusrat Jahan
    Mansoor, Nafees
    Momen, Sifat
    Mobin, Iftekharul
    Mohammed, Nabeel
    [J]. 2016 INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE (IWCI), 2016, : 126 - 130
  • [10] PSOHS: an efficient two-stage approach for data clustering
    Hatamlou, Abdolreza
    Hatamlou, Masoumeh
    [J]. MEMETIC COMPUTING, 2013, 5 (02) : 155 - 161