Exponential family mixed membership models for soft clustering of multivariate data

被引:1
|
作者
White, Arthur [1 ]
Murphy, Thomas Brendan [2 ,3 ]
机构
[1] Univ Dublin, Sch Comp Sci & Stat, Trinity Coll Dublin, Dublin 2, Ireland
[2] Univ Coll Dublin, Sch Math & Stat, Dublin 4, Ireland
[3] Univ Coll Dublin, Insight Res Ctr, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
Mixed membership models; Model based clustering; Mixture models; Variational Bayes; DISABILITY; INFERENCE;
D O I
10.1007/s11634-016-0267-5
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
For several years, model-based clustering methods have successfully tackled many of the challenges presented by data-analysts. However, as the scope of data analysis has evolved, some problems may be beyond the standard mixture model framework. One such problem is when observations in a dataset come from overlapping clusters, whereby different clusters will possess similar parameters for multiple variables. In this setting, mixed membership models, a soft clustering approach whereby observations are not restricted to single cluster membership, have proved to be an effective tool. In this paper, a method for fitting mixed membership models to data generated by a member of an exponential family is outlined. The method is applied to count data obtained from an ultra running competition, and compared with a standard mixture model approach.
引用
收藏
页码:521 / 540
页数:20
相关论文
共 50 条
  • [41] TESTS OF COMPOSITE HYPOTHESES FOR MULTIVARIATE EXPONENTIAL FAMILY
    MATTHES, TK
    TRUAX, DR
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1967, 38 (03): : 681 - &
  • [42] Multivariate lifetime distributions for the exponential dispersion family
    Alai, Daniel H.
    [J]. SCANDINAVIAN ACTUARIAL JOURNAL, 2019, (05) : 387 - 405
  • [43] A multivariate generalization of the power exponential family of distributions
    Gomez, E
    Gomez-Villegas, MA
    Marin, JM
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1998, 27 (03) : 589 - 600
  • [44] Discriminative Mixed-membership Models
    Shan, Hanhuai
    Banerjee, Arindam
    Oza, Nikunj C.
    [J]. 2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 466 - +
  • [45] Clustering of multivariate geostatistical data
    Fouedjio, Francky
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2020, 12 (05)
  • [46] Network Clustering Analysis Using Mixture Exponential-Family Random Graph Models and Its Application in Genetic Interaction Data
    Wang, Yishu
    Fang, Huaying
    Yang, Dejie
    Zhao, Hongyu
    Deng, Minghua
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (05) : 1743 - 1752
  • [47] Variational Inference over Nonstationary Data Streams for Exponential Family Models †
    Masegosa, Andres R.
    Ramos-Lopez, Dario
    Salmeron, Antonio
    Langseth, Helge
    Nielsen, Thomas D.
    [J]. MATHEMATICS, 2020, 8 (11) : 1 - 27
  • [48] Multivariate GARCH models with correlation clustering
    So, Mike K. P.
    Yip, Iris W. H.
    [J]. Journal of Forecasting, 2012, 31 (05): : 443 - 468
  • [49] Multivariate GARCH Models with Correlation Clustering
    So, Mike K. P.
    Yip, Iris W. H.
    [J]. JOURNAL OF FORECASTING, 2012, 31 (05) : 443 - 468
  • [50] Directed Clustering of Multivariate Data Based on Linear or Quadratic Latent Variable Models
    Zhang, Yingjuan
    Einbeck, Jochen
    [J]. ALGORITHMS, 2024, 17 (08)