Using the EM algorithm for Bayesian variable selection in logistic regression models with related covariates

被引:9
|
作者
Koslovsky, M. D. [1 ]
Swartz, M. D. [1 ]
Leon-Novelo, L. [1 ]
Chan, W. [1 ]
Wilkinson, A. V. [2 ]
机构
[1] UTHealth, Dept Biostat, 1200 Pressler St, Houston, TX 77030 USA
[2] UTHealth, Dept Epidemiol, Austin, TX USA
关键词
Bayesian inference; binary outcomes; deterministic annealing; expectation-maximization; grouped covariates; heredity constraint; inheritance property; variable selection; 62F15; 62J12; 68U20; GROUP LASSO; YOUTH; SMOKING; BINARY;
D O I
10.1080/00949655.2017.1398255
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We develop a Bayesian variable selection method for logistic regression models that can simultaneously accommodate qualitative covariates and interaction terms under various heredity constraints. We use expectation-maximization variable selection (EMVS) with a deterministic annealing variant as the platform for our method, due to its proven flexibility and efficiency. We propose a variance adjustment of the priors for the coefficients of qualitative covariates, which controls false-positive rates, and a flexible parameterization for interaction terms, which accommodates user-specified heredity constraints. This method can handle all pairwise interaction terms as well as a subset of specific interactions. Using simulation, we show that this method selects associated covariates better than the grouped LASSO and the LASSO with heredity constraints in various exploratory research scenarios encountered in epidemiological studies. We apply our method to identify genetic and non-genetic risk factors associated with smoking experimentation in a cohort of Mexican-heritage adolescents.
引用
收藏
页码:575 / 596
页数:22
相关论文
共 50 条
  • [1] Bayesian variable selection for logistic regression
    Tian, Yiqing
    Bondell, Howard D.
    Wilson, Alyson
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2019, 12 (05) : 378 - 393
  • [2] A novel variational Bayesian method for variable selection in logistic regression models
    Zhang, Chun-Xia
    Xu, Shuang
    Zhang, Jiang-She
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 133 : 1 - 19
  • [3] Prior elicitation, variable selection and Bayesian computation for logistic regression models
    Chen, MH
    Ibrahim, JG
    Yiannoutsos, C
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 : 223 - 242
  • [4] Variable selection in logistic regression models
    Zellner, D
    Keller, F
    Zellner, GE
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2004, 33 (03) : 787 - 805
  • [5] An Ensemble EM Algorithm for Bayesian Variable Selection
    Wang, Jin
    Ouyang, Yunbo
    Ji, Yuan
    Liang, Feng
    [J]. BAYESIAN ANALYSIS, 2022, 17 (03): : 879 - 900
  • [6] Bayesian variable selection for the Cox regression model with missing covariates
    Joseph G. Ibrahim
    Ming-Hui Chen
    Sungduk Kim
    [J]. Lifetime Data Analysis, 2008, 14 : 496 - 520
  • [7] Bayesian variable selection for the Cox regression model with missing covariates
    Ibrahim, Joseph G.
    Chen, Ming-Hui
    Kim, Sungduk
    [J]. LIFETIME DATA ANALYSIS, 2008, 14 (04) : 496 - 520
  • [8] Bayesian variable selection for regression models
    Kuo, L
    Mallick, B
    [J]. AMERICAN STATISTICAL ASSOCIATION - 1996 PROCEEDINGS OF THE SECTION ON BAYESIAN STATISTICAL SCIENCE, 1996, : 170 - 175
  • [9] Application of Bayesian variable selection in logistic regression model
    Bangchang, Kannat Na
    [J]. AIMS MATHEMATICS, 2024, 9 (05): : 13336 - 13345
  • [10] Variable selection for linear regression models with random covariates
    Nkiet, GM
    [J]. COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE I-MATHEMATIQUE, 2001, 333 (12): : 1105 - 1110