Exponential family mixed membership models for soft clustering of multivariate data

被引:1
|
作者
White, Arthur [1 ]
Murphy, Thomas Brendan [2 ,3 ]
机构
[1] Univ Dublin, Sch Comp Sci & Stat, Trinity Coll Dublin, Dublin 2, Ireland
[2] Univ Coll Dublin, Sch Math & Stat, Dublin 4, Ireland
[3] Univ Coll Dublin, Insight Res Ctr, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
Mixed membership models; Model based clustering; Mixture models; Variational Bayes; DISABILITY; INFERENCE;
D O I
10.1007/s11634-016-0267-5
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
For several years, model-based clustering methods have successfully tackled many of the challenges presented by data-analysts. However, as the scope of data analysis has evolved, some problems may be beyond the standard mixture model framework. One such problem is when observations in a dataset come from overlapping clusters, whereby different clusters will possess similar parameters for multiple variables. In this setting, mixed membership models, a soft clustering approach whereby observations are not restricted to single cluster membership, have proved to be an effective tool. In this paper, a method for fitting mixed membership models to data generated by a member of an exponential family is outlined. The method is applied to count data obtained from an ultra running competition, and compared with a standard mixture model approach.
引用
收藏
页码:521 / 540
页数:20
相关论文
共 50 条
  • [31] Multivariate generalized linear mixed models for underdispersed count data
    da Silva, Guilherme Parreira
    Laureano, Henrique Aparecido
    Petterle, Ricardo Rasmussen
    Ribeiro Jr, Paulo Justiniano
    Bonat, Wagner Hugo
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (14) : 2410 - 2427
  • [32] Multivariate Latent Growth Models for Mixed Data with Covariate Effects
    Bianconcini, S.
    Cagnone, S.
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2012, 41 (16-17) : 3079 - 3093
  • [33] High dimensional multivariate mixed models for binary questionnaire data
    Fieuws, Steffen
    Verbeke, Geert
    Boen, Filip
    Delecluse, Christophe
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2006, 55 : 449 - 460
  • [34] Optimal designs for multivariate logistic mixed models with longitudinal data
    Jiang, Hong-Yan
    Yue, Rong-Xian
    Zhou, Xiao-Dong
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2019, 48 (04) : 850 - 864
  • [35] On Fuzzy Clustering for Categorical Multivariate Data Induced by Polya Mixture Models
    Kanzawa, Yuchi
    [J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2017), 2017, 10571 : 89 - 102
  • [36] Spatial generalized linear mixed models with multivariate CAR models for areal data
    Torabi, Mahmoud
    [J]. SPATIAL STATISTICS, 2014, 10 : 12 - 26
  • [37] Interpreting Multivariate Membership Degrees of Fuzzy Clustering Methods: a Strategy
    Pimentel, Bruno A.
    de Souto, Marcilio C. P.
    de Souza, Renata M. C. R.
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 2800 - 2804
  • [38] JOINT MIXED MEMBERSHIP MODELING OF MULTIVARIATE LONGITUDINAL AND SURVIVAL DATA FOR LEARNING THE INDIVIDUALIZED DISEASE PROGRESSION
    He, Yuyang
    Song, Xinyuan
    Kang, Kai
    [J]. ANNALS OF APPLIED STATISTICS, 2024, 18 (03): : 1924 - 1946
  • [39] A NEW FAMILY OF MULTIVARIATE EXPONENTIAL-DISTRIBUTIONS
    FANG, KT
    FANG, BQ
    [J]. KEXUE TONGBAO, 1986, 31 (21): : 1510 - 1511
  • [40] MULTIVARIATE DISCRETE EXPONENTIAL FAMILY OF DISTRIBUTIONS AND THEIR PROPERTIES
    KHATRI, CG
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1983, 12 (08) : 877 - 893