Robust Bayesian model selection for variable clustering with the Gaussian graphical model

被引:0
|
作者
Daniel Andrade
Akiko Takeda
Kenji Fukumizu
机构
[1] SOKENDAI (The Graduate University for Advanced Studies),Security Research Laboratories
[2] NEC,Department of Creative Informatics
[3] The University of Tokyo,undefined
[4] RIKEN Center for Advanced Intelligence Project,undefined
[5] The Institute of Statistical Mathematics,undefined
来源
Statistics and Computing | 2020年 / 30卷
关键词
Clustering; Gaussian graphical model; Model selection; Variational approximation;
D O I
暂无
中图分类号
学科分类号
摘要
Variable clustering is important for explanatory analysis. However, only few dedicated methods for variable clustering with the Gaussian graphical model have been proposed. Even more severe, small insignificant partial correlations due to noise can dramatically change the clustering result when evaluating for example with the Bayesian information criteria (BIC). In this work, we try to address this issue by proposing a Bayesian model that accounts for negligible small, but not necessarily zero, partial correlations. Based on our model, we propose to evaluate a variable clustering result using the marginal likelihood. To address the intractable calculation of the marginal likelihood, we propose two solutions: one based on a variational approximation and another based on MCMC. Experiments on simulated data show that the proposed method is similarly accurate as BIC in the no noise setting, but considerably more accurate when there are noisy partial correlations. Furthermore, on real data the proposed method provides clustering results that are intuitively sensible, which is not always the case when using BIC or its extensions.
引用
收藏
页码:351 / 376
页数:25
相关论文
共 50 条
  • [41] Gaussian Graphical Model-Based Clustering of Time Series Data
    Obata, Kohei
    [J]. PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 1148 - 1149
  • [42] Bayesian feature and model selection for Gaussian mixture models
    Constantinopoulos, C
    Titsias, MK
    Likas, A
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (06) : 1013 - U1
  • [43] Robust variable selection in the logistic regression model
    Jiang, Yunlu
    Zhang, Jiantao
    Huang, Yingqiang
    Zou, Hang
    Huang, Meilan
    Chen, Fanhong
    [J]. HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2021, 50 (05): : 1572 - 1582
  • [44] Bayesian Model Selection for Change Point Detection and Clustering
    Mazhar, Othmane
    Rojas, Cristian R.
    Fischione, Carlo
    Hesamzadeh, Mohammad Reza
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [45] Bayesian variable selection and model averaging in the arbitrage pricing theory model
    Ouysse, Rachida
    Kohn, Robert
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) : 3249 - 3268
  • [46] Iterative feature selection in Gaussian mixture clustering with automatic model selection
    Zeng, Hong
    Cheung, Yiu-Ming
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2277 - 2282
  • [47] LATENT VARIABLE GRAPHICAL MODEL SELECTION VIA CONVEX OPTIMIZATION
    Chandrasekaran, Venkat
    Parrilo, Pablo A.
    Willsky, Alan S.
    [J]. ANNALS OF STATISTICS, 2012, 40 (04): : 1935 - 1967
  • [48] Comparing Model Selection and Regularization Approaches to Variable Selection in Model-Based Clustering
    Celeux, Gilles
    Martin-Magniette, Marie-Laure
    Maugis-Rabusseau, Cathy
    Raftery, Adrian E.
    [J]. JOURNAL OF THE SFDS, 2014, 155 (02): : 57 - 71
  • [49] Family-wise error rate control in Gaussian graphical model selection via distributionally robust optimization
    Tran, Chau
    Cisneros-Velarde, Pedro
    Oh, Sang-Yun
    Petersen, Alexander
    [J]. STAT, 2022, 11 (01):
  • [50] Variable Selection for Clustering with Gaussian Mixture Models
    Maugis, Cathy
    Celeux, Gilles
    Martin-Magniette, Marie-Laure
    [J]. BIOMETRICS, 2009, 65 (03) : 701 - 709