MODEL-BASED LONGITUDINAL CLUSTERING WITH VARYING CLUSTER ASSIGNMENTS

被引:6
|
作者
Sewell, Daniel K. [1 ]
Chen, Yuguo [2 ]
Bernhard, William [3 ]
Sulkin, Tracy [3 ]
机构
[1] Univ Iowa, Dept Biostat, 145 N Riverside Dr, Iowa City, IA 52242 USA
[2] Univ Illinois, Dept Stat, 725 South Wright St, Champaign, IL 61820 USA
[3] Univ Illinois, Dept Polit Sci, 1407 W Gregory Dr, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
Cluster analysis; EM algorithm; multinomial logistic regression; normal mixture models; time series; EM ALGORITHM; LATENT;
D O I
10.5705/ss.2014.205
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
It is often of interest to perform clustering on longitudinal data, yet it is difficult to formulate an intuitive model for which estimation is computationally feasible. We propose a model-based clustering method for clustering objects that are observed over time. The proposed model can be viewed as an extension of the normal mixture model for clustering to longitudinal data. While existing models only account for clustering effects, we propose modeling the distribution of the observed values of each object as a blending of a cluster effect and an individual effect, hence also giving an estimate of how much the behavior of an object is determined by the cluster to which it belongs. Further, it is important to detect how explanatory variables affect the clustering. An advantage of our method is that it can handle multiple explanatory variables of any type through a linear modeling of the cluster transition probabilities. We implement the generalized EM algorithm using several recursive relationships to greatly decrease the computational cost. The accuracy of our estimation method is illustrated in a simulation study, and U.S. Congressional data is analyzed.
引用
收藏
页码:205 / 233
页数:29
相关论文
共 50 条
  • [1] Model-based clustering for longitudinal data
    De la Cruz-Mesia, Rolando
    Quintanab, Fernando A.
    Marshall, Guillermo
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (03) : 1441 - 1457
  • [2] Model-based clustering of longitudinal data
    McNicholas, Paul D.
    Murphy, T. Brendan
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (01): : 153 - 168
  • [3] Model-based time-varying clustering of multivariate longitudinal data with covariates and outliers
    Maruotti, Antonello
    Punzo, Antonio
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 113 : 475 - 496
  • [4] Bayesian model-based clustering for longitudinal ordinal data
    Roy Costilla
    Ivy Liu
    Richard Arnold
    Daniel Fernández
    [J]. Computational Statistics, 2019, 34 : 1015 - 1038
  • [5] Bayesian model-based clustering for longitudinal ordinal data
    Costilla, Roy
    Liu, Ivy
    Arnold, Richard
    Fernandez, Daniel
    [J]. COMPUTATIONAL STATISTICS, 2019, 34 (03) : 1015 - 1038
  • [6] Scalable model-based cluster analysis using clustering features
    Jin, HD
    Leung, KS
    Wong, ML
    Xu, ZB
    [J]. PATTERN RECOGNITION, 2005, 38 (05) : 637 - 649
  • [7] Model-Based Clustering
    McNicholas, Paul D.
    [J]. JOURNAL OF CLASSIFICATION, 2016, 33 (03) : 331 - 373
  • [8] Model-Based Clustering
    Gormley, Isobel Claire
    Murphy, Thomas Brendan
    Raftery, Adrian E.
    [J]. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2023, 10 : 573 - 595
  • [9] Model-Based Clustering
    Paul D. McNicholas
    [J]. Journal of Classification, 2016, 33 : 331 - 373
  • [10] Model-based clustering via linear cluster-weighted models
    Ingrassia, Salvatore
    Minotti, Simona C.
    Punzo, Antonio
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 159 - 182