Generalized Spike-and-Slab Priors for Bayesian Group Feature Selection Using Expectation Propagation

被引:0
|
作者
Hernandez-Lobato, Daniel [1 ]
Miguel Hernandez-Lobato, Jose [2 ]
Dupont, Pierre [3 ]
机构
[1] Univ Autonoma Madrid, Dept Comp Sci, E-28049 Madrid, Spain
[2] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[3] Catholic Univ Louvain, Machine Learning Grp, ICTEAM, B-1348 Louvain, Belgium
关键词
group feature selection; generalized spike-and-slab priors; expectation propagation; sparse linear model; approximate inference; sequential experimental design; signal reconstruction; GROUP-LASSO; VARIABLE SELECTION; REGRESSION; SHRINKAGE; DESIGN;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe a Bayesian method for group feature selection in linear regression problems. The method is based on a generalized version of the standard spike-and-slab prior distribution which is often used for individual feature selection. Exact Bayesian inference under the prior considered is infeasible for typical regression problems. However, approximate inference can be carried out efficiently using Expectation Propagation (EP). A detailed analysis of the generalized spike-and-slab prior shows that it is well suited for regression problems that are sparse at the group level. Furthermore, this prior can be used to introduce prior knowledge about specific groups of features that are a priori believed to be more relevant. An experimental evaluation compares the performance of the proposed method with those of group LASSO, Bayesian group LASSO, automatic relevance determination and additional variants used for group feature selection. The results of these experiments show that a model based on the generalized spike-and-slab prior and the EP algorithm has state-of-the-art prediction performance in the problems analyzed. Furthermore, this model is also very useful to carry out sequential experimental design (also known as active learning), where the data instances that are most informative are iteratively included in the training set, reducing the number of instances needed to obtain a particular level of prediction accuracy.
引用
收藏
页码:1891 / 1945
页数:55
相关论文
共 50 条
  • [31] Online Bayesian Sparse Learning with Spike and Slab Priors
    Fang, Shikai
    Zhe, Shandian
    Lee, Kuang-chih
    Zhang, Kai
    Neville, Jennifer
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 142 - 151
  • [32] Simultaneous Variable and Covariance Selection With the Multivariate Spike-and-Slab LASSO
    Deshpande, Sameer K.
    Rockova, Veronika
    George, Edward, I
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (04) : 921 - 931
  • [33] Which UGC features drive web purchase intent? A spike-and-slab Bayesian Variable Selection Approach
    Owusu, Richard A.
    Mutshinda, Crispin M.
    Antai, Imoh
    Dadzie, Kofi Q.
    Winston, Evelyn M.
    [J]. INTERNET RESEARCH, 2016, 26 (01) : 22 - 37
  • [34] MULTI-TASK IMAGE CLASSIFICATION VIA COLLABORATIVE, HIERARCHICAL SPIKE-AND-SLAB PRIORS
    Mousavi, Hojjat S.
    Srinivas, Umamahesh
    Monga, Vishal
    Suo, Yuanming
    Dao, Minh
    Tran, Trac D.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4236 - 4240
  • [35] A MAJORIZATION-MINIMIZATION APPROACH TO VARIABLE SELECTION USING SPIKE AND SLAB PRIORS
    Yen, Tso-Jung
    [J]. ANNALS OF STATISTICS, 2011, 39 (03): : 1748 - 1775
  • [36] Enhancing Nonlinear Subspace Identification Using Sparse Bayesian Learning with Spike and Slab Priors
    Rui Zhu
    Sufang Chen
    Dong Jiang
    Shitao Xie
    Lei Ma
    Stefano Marchesiello
    Dario Anastasio
    [J]. Journal of Vibration Engineering & Technologies, 2024, 12 : 3021 - 3031
  • [37] Enhancing Nonlinear Subspace Identification Using Sparse Bayesian Learning with Spike and Slab Priors
    Zhu, Rui
    Chen, Sufang
    Jiang, Dong
    Xie, Shitao
    Ma, Lei
    Marchesiello, Stefano
    Anastasio, Dario
    [J]. JOURNAL OF VIBRATION ENGINEERING & TECHNOLOGIES, 2024, 12 (03) : 3021 - 3031
  • [38] The Spike-and-Slab Lasso Generalized Linear Models for Prediction and Associated Genes Detection
    Tang, Zaixiang
    Shen, Yueping
    Zhang, Xinyan
    Yi, Nengjun
    [J]. GENETICS, 2017, 205 (01) : 77 - +
  • [39] Fast Bayesian variable selection for high dimensional linear models: Marginal solo spike and slab priors
    Chen, Su
    Walker, Stephen G.
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (01): : 284 - 309
  • [40] Bayesian Quantile Regression Based on the Empirical Likelihood with Spike and Slab Priors
    Xi, Ruibin
    Li, Yunxiao
    Hu, Yiming
    [J]. BAYESIAN ANALYSIS, 2016, 11 (03): : 821 - 855