Sparse Bayesian infinite factor models

被引:245
|
作者
Bhattacharya, A. [1 ]
Dunson, D. B. [1 ]
机构
[1] Duke Univ, Dept Stat Sci, Durham, NC 27708 USA
基金
美国国家卫生研究院;
关键词
Adaptive Gibbs sampling; Factor analysis; High-dimensional data; Multiplicative gamma process; Parameter expansion; Regularization; Shrinkage; PRIOR DISTRIBUTIONS; SURVIVAL; SELECTION; ARTICLE; NUMBER;
D O I
10.1093/biomet/asr013
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We focus on sparse modelling of high-dimensional covariance matrices using Bayesian latent factor models. We propose a multiplicative gamma process shrinkage prior on the factor loadings which allows introduction of infinitely many factors, with the loadings increasingly shrunk towards zero as the column index increases. We use our prior on a parameter-expanded loading matrix to avoid the order dependence typical in factor analysis models and develop an efficient Gibbs sampler that scales well as data dimensionality increases. The gain in efficiency is achieved by the joint conjugacy property of the proposed prior, which allows block updating of the loadings matrix. We propose an adaptive Gibbs sampler for automatically truncating the infinite loading matrix through selection of the number of important factors. Theoretical results are provided on the support of the prior and truncation approximation bounds. A fast algorithm is proposed to produce approximate Bayes estimates. Latent factor regression methods are developed for prediction and variable selection in applications with high-dimensional correlated predictors. Operating characteristics are assessed through simulation studies, and the approach is applied to predict survival times from gene expression data.
引用
收藏
页码:291 / 306
页数:16
相关论文
共 50 条
  • [31] Sparse Bayesian models: Bankruptcy-predictors of choice?
    Ribeiro, Bernardete
    Vieira, Armando
    das Neves, Joao Carvalho
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 3377 - +
  • [32] A Bayesian Sparse Factor Model with Adaptive Posterior Concentration
    Ohn, Ilsang
    Lin, Lizhen
    Kim, Yongdai
    BAYESIAN ANALYSIS, 2024, 19 (04): : 1277 - 1301
  • [33] On the truncation criteria in infinite factor models
    Schiavon, Lorenzo
    Canale, Antonio
    STAT, 2020, 9 (01):
  • [34] Bayesian tests of global factor models
    Fletcher, Jonathan
    JOURNAL OF EMPIRICAL FINANCE, 2018, 48 : 279 - 289
  • [35] On the identifiability of Bayesian factor analytic models
    Panagiotis Papastamoulis
    Ioannis Ntzoufras
    Statistics and Computing, 2022, 32
  • [36] On the identifiability of Bayesian factor analytic models
    Papastamoulis, Panagiotis
    Ntzoufras, Ioannis
    STATISTICS AND COMPUTING, 2022, 32 (02)
  • [37] Bayesian estimation of sparse dynamic factor models with order-independent and ex-post mode identification
    Kaufmann, Sylvia
    Schumacher, Christian
    JOURNAL OF ECONOMETRICS, 2019, 210 (01) : 116 - 134
  • [38] Infinite Dropout for training Bayesian models from data streams
    Van-Son Nguyen
    Duc-Tung Nguyen
    Linh Ngo Van
    Khoat Than
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 125 - 134
  • [39] Copula based factorization in Bayesian multivariate infinite mixture models
    Burda, Martin
    Prokhorov, Artem
    JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 127 : 200 - 213
  • [40] Bayesian infinite mixture models for wind speed distribution estimation
    Wang, Yun
    Li, Yifen
    Zou, Runmin
    Song, Dongran
    ENERGY CONVERSION AND MANAGEMENT, 2021, 236