Penalized logistic regression for detecting gene interactions

被引:247
|
作者
Park, Mee Young [1 ]
Hastie, Trevor [2 ,3 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Hlth Res & Policy, Stanford, CA 94305 USA
关键词
discrete factors; gene interactions; high dimensional; logistic regression; L-2-regularization;
D O I
10.1093/biostatistics/kxm010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose using a variant of logistic regression (LR) with L-2-regularization to fit gene-gene and gene environment interaction models. Studies have shown that many common diseases are influenced by interaction of certain genes. LR models with quadratic penalization not only correctly characterizes the influential genes along with their interaction structures but also yields additional benefits in handling high-dimensional, discrete factors with a binary response. We illustrate the advantages of using an L-2-regularization scheme and compare its performance with that of "multifactor dimensionality reduction" and "FlexTree," 2 recent tools for identifying gene-gene interactions. Through simulated and real data sets, we demonstrate that our method outperforms other methods in the identification of the interaction structures as well as prediction accuracy. In addition, we validate the significance of the factors selected through bootstrap analyses.
引用
收藏
页码:30 / 50
页数:21
相关论文
共 50 条
  • [41] Comparison of multivariate adaptive regression splines and logistic regression in detecting SNP–SNP interactions and their application in prostate cancer
    Hui-Yi Lin
    Wenquan Wang
    Yung-Hsin Liu
    Seng-Jaw Soong
    Timothy P. York
    Leann Myers
    Jennifer J. Hu
    Journal of Human Genetics, 2008, 53 : 802 - 811
  • [42] Penalized wavelet nonparametric univariate logistic regression for irregular spaced data
    Amato, Umberto
    Antoniadis, Anestis
    De Feis, Italia
    Gijbels, Irene
    STATISTICS, 2023, 57 (05) : 1037 - 1060
  • [43] Approximate Bayesian logistic regression via penalized likelihood by data augmentation
    Discacciati, Andrea
    Orsini, Nicola
    Greenland, Sander
    Stata Journal, 2015, 15 (03): : 712 - 736
  • [44] A Double-Penalized Estimator to Combat Separation and Multicollinearity in Logistic Regression
    Guan, Ying
    Fu, Guang-Hui
    MATHEMATICS, 2022, 10 (20)
  • [45] Stacked penalized logistic regression for selecting views in multi -view learning
    van Loon, Wouter
    Fokkema, Marjolein
    Szabo, Botond
    de Rooij, Mark
    INFORMATION FUSION, 2020, 61 : 113 - 123
  • [46] PENALIZED MAXIMUM-LIKELIHOOD ESTIMATION IN LOGISTIC-REGRESSION AND DISCRIMINATION
    ANDERSON, JA
    BLAIR, V
    BIOMETRIKA, 1982, 69 (01) : 123 - 136
  • [47] Paths Following Algorithm for Penalized Logistic Regression Using SCAD and MCP
    Jiang, Wenhua
    Zhang, Cun-Hui
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2014, 43 (05) : 1064 - 1077
  • [48] Active learning for penalized logistic regression via sequential experimental design
    Wang, Jing
    Park, Eunsik
    NEUROCOMPUTING, 2017, 222 : 183 - 190
  • [49] Maximum softly-penalized likelihood for mixed effects logistic regression
    Sterzinger, Philipp
    Kosmidis, Ioannis
    STATISTICS AND COMPUTING, 2023, 33 (02)
  • [50] Identification of Grouped Rare and Common Variants via Penalized Logistic Regression
    Ayers, Kristin L.
    Cordell, Heather J.
    GENETIC EPIDEMIOLOGY, 2013, 37 (06) : 592 - 602