Calibrating covariate informed product partition models

被引:0
|
作者
Garritt L. Page
Fernando A. Quintana
机构
[1] Brigham Young University,Department of Statistics
[2] Pontificia Universidad Católica de Chile,Departamento de Estadística
来源
Statistics and Computing | 2018年 / 28卷
关键词
High-dimensional covariate space; Prediction; Covariate-based clustering; Mixture of experts; Random partition models;
D O I
暂无
中图分类号
学科分类号
摘要
Covariate informed product partition models incorporate the intuitively appealing notion that individuals or units with similar covariate values a priori have a higher probability of co-clustering than those with dissimilar covariate values. These methods have been shown to perform well if the number of covariates is relatively small. However, as the number of covariates increase, their influence on partition probabilities overwhelm any information the response may provide in clustering and often encourage partitions with either a large number of singleton clusters or one large cluster resulting in poor model fit and poor out-of-sample prediction. This same phenomenon is observed in Bayesian nonparametric regression methods that induce a conditional distribution for the response given covariates through a joint model. In light of this, we propose two methods that calibrate the covariate-dependent partition model by capping the influence that covariates have on partition probabilities. We demonstrate the new methods’ utility using simulation and two publicly available datasets.
引用
收藏
页码:1009 / 1031
页数:22
相关论文
共 50 条
  • [1] Calibrating covariate informed product partition models
    Page, Garritt L.
    Quintana, Fernando A.
    STATISTICS AND COMPUTING, 2018, 28 (05) : 1009 - 1031
  • [2] DISCOVERING INTERACTIONS USING COVARIATE INFORMED RANDOM PARTITION MODELS
    Page, Garritt L.
    Quintana, Fernando A.
    Rosner, Gary L.
    ANNALS OF APPLIED STATISTICS, 2021, 15 (01): : 1 - 21
  • [3] Spatial Product Partition Models
    Page, Garritt L.
    Quintana, Fernando A.
    BAYESIAN ANALYSIS, 2016, 11 (01): : 265 - 298
  • [4] Product partition models for normal means
    Crowley, EM
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (437) : 192 - 198
  • [5] Product partition models with correlated parameters
    Monteiro, Joao V. D.
    Assuncao, Renato M.
    Loschi, Rosangela H.
    BAYESIAN ANALYSIS, 2011, 6 (04): : 691 - 725
  • [6] Bayesian clustering and product partition models
    Quintana, FA
    Iglesias, PL
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2003, 65 : 557 - 574
  • [7] Modal Clustering in a Class of Product Partition Models
    Dahl, David B.
    BAYESIAN ANALYSIS, 2009, 4 (02): : 243 - 264
  • [8] PRODUCT PARTITION MODELS FOR CHANGE POINT PROBLEMS
    BARRY, D
    HARTIGAN, JA
    ANNALS OF STATISTICS, 1992, 20 (01): : 260 - 279
  • [9] Statistical modelling using product partition models
    Jordan, Claire
    Livingstone, Vicki
    Barry, Daniel
    STATISTICAL MODELLING, 2007, 7 (03) : 275 - 295
  • [10] Physics-Informed Machine Learning for Calibrating Macroscopic Traffic Flow Models
    Tang, Yu
    Jin, Li
    Ozbay, Kaan
    TRANSPORTATION SCIENCE, 2024, 58 (06) : 1389 - 1402