Directed Clustering of Multivariate Data Based on Linear or Quadratic Latent Variable Models

被引:0
|
作者
Zhang, Yingjuan [1 ]
Einbeck, Jochen [1 ,2 ]
机构
[1] Univ Durham, Dept Math Sci, Durham DH1 3LE, England
[2] Univ Durham, Durham Res Methods Ctr, Durham DH1 3LE, England
关键词
clustering; mixture model; latent variable model; dimension reduction; expectation-maximization algorithm; model selection; fundamental diagram; MAXIMUM-LIKELIHOOD; DISCRIMINANT-ANALYSIS;
D O I
10.3390/a17080358
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider situations in which the clustering of some multivariate data is desired, which establishes an ordering of the clusters with respect to an underlying latent variable. As our motivating example for a situation where such a technique is desirable, we consider scatterplots of traffic flow and speed, where a pattern of consecutive clusters can be thought to be linked by a latent variable, which is interpretable as traffic density. We focus on latent structures of linear or quadratic shapes, and present an estimation methodology based on expectation-maximization, which estimates both the latent subspace and the clusters along it. The directed clustering approach is summarized in two algorithms and applied to the traffic example outlined. Connections to related methodology, including principal curves, are briefly drawn.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Generalized Linear Latent Variable Models for Multivariate Count and Biomass Data in Ecology
    Niku, Jenni
    Warton, David I.
    Hui, Francis K. C.
    Taskinen, Sara
    [J]. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2017, 22 (04) : 498 - 522
  • [2] Generalized Linear Latent Variable Models for Multivariate Count and Biomass Data in Ecology
    Jenni Niku
    David I. Warton
    Francis K. C. Hui
    Sara Taskinen
    [J]. Journal of Agricultural, Biological and Environmental Statistics, 2017, 22 : 498 - 522
  • [3] Generalized linear latent variable models for repeated measures of spatially correlated multivariate data
    Zhu, J
    Eickhoff, JC
    Yan, P
    [J]. BIOMETRICS, 2005, 61 (03) : 674 - 683
  • [4] gllvm: Fast analysis of multivariate abundance data with generalized linear latent variable models in r
    Niku, Jenni
    Hui, Francis K. C.
    Taskinen, Sara
    Warton, David I.
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2019, 10 (12): : 2173 - 2182
  • [5] Hidden Markov Latent Variable Models with Multivariate Longitudinal Data
    Song, Xinyuan
    Xia, Yemao
    Zhu, Hongtu
    [J]. BIOMETRICS, 2017, 73 (01) : 313 - 323
  • [6] Bayesian analysis of transformation latent variable models with multivariate censored data
    Song, Xin-Yuan
    Pan, Deng
    Liu, Peng-Fei
    Cai, Jing-Heng
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (05) : 2337 - 2358
  • [7] GEE-Assisted Variable Selection for Latent Variable Models with Multivariate Binary Data
    Hui, Francis K. C.
    Muller, Samuel
    Welsh, A. H.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 1252 - 1263
  • [8] Polya-gamma data augmentation and latent variable models for multivariate binomial data
    Holmes, John B.
    Schofield, Matthew R.
    Barker, Richard J.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2022, 71 (01) : 194 - 218
  • [9] Latent variable models for probabilistic graph clustering
    Lin, JK
    [J]. BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2004, 735 : 187 - 194
  • [10] Quadratic prediction problems in multivariate linear models
    Liu, Xu-Qing
    Wang, Dong-Dong
    Rong, Jian-Ying
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2009, 100 (02) : 291 - 300