A novel variational Bayesian method for variable selection in logistic regression models

被引:21
|
作者
Zhang, Chun-Xia [1 ]
Xu, Shuang [1 ]
Zhang, Jiang-She [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Variable selection; Logistic regression; Sparse model; Variational Bayes; Indicator model; High-dimensional data; PENALIZED REGRESSION; EM ALGORITHM; INFERENCE;
D O I
10.1016/j.csda.2018.08.025
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With high-dimensional data emerging in various domains, sparse logistic regression models have gained much interest of researchers. Variable selection plays a key role in both improving the prediction accuracy and enhancing the interpretability of built models. Bayesian variable selection approaches enjoy many advantages such as high selection accuracy, easily incorporating many kinds of prior knowledge and so on. Because Bayesian methods generally make inference from the posterior distribution with Markov Chain Monte Carlo (MCMC) techniques, however, they become intractable in high-dimensional situations due to the large searching space. To address this issue, a novel variational Bayesian method for variable selection in high-dimensional logistic regression models is presented. The proposed method is based on the indicator model in which each covariate is equipped with a binary latent variable indicating whether it is important. The Bernoulli-type prior is adopted for the latent indicator variable. As for the specification of the hyperparameter in the Bernoulli prior, we provide two schemes to determine its optimal value so that the novel model can achieve sparsity adaptively. To identify important variables and make predictions, one efficient variational Bayesian approach is employed to make inference from the posterior distribution. The experiments conducted with both synthetic and some publicly available data show that the new method outperforms or is very competitive with some other popular counterparts. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 19
页数:19
相关论文
共 50 条