Penalized logistic regression for detecting gene interactions

被引:247
|
作者
Park, Mee Young [1 ]
Hastie, Trevor [2 ,3 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Hlth Res & Policy, Stanford, CA 94305 USA
关键词
discrete factors; gene interactions; high dimensional; logistic regression; L-2-regularization;
D O I
10.1093/biostatistics/kxm010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose using a variant of logistic regression (LR) with L-2-regularization to fit gene-gene and gene environment interaction models. Studies have shown that many common diseases are influenced by interaction of certain genes. LR models with quadratic penalization not only correctly characterizes the influential genes along with their interaction structures but also yields additional benefits in handling high-dimensional, discrete factors with a binary response. We illustrate the advantages of using an L-2-regularization scheme and compare its performance with that of "multifactor dimensionality reduction" and "FlexTree," 2 recent tools for identifying gene-gene interactions. Through simulated and real data sets, we demonstrate that our method outperforms other methods in the identification of the interaction structures as well as prediction accuracy. In addition, we validate the significance of the factors selected through bootstrap analyses.
引用
收藏
页码:30 / 50
页数:21
相关论文
共 50 条
  • [1] Classification of gene microarrays by penalized logistic regression
    Zhu, J
    Hastie, T
    BIOSTATISTICS, 2004, 5 (03) : 427 - 443
  • [2] Detecting Gene-gene Interactions in Complex Diseases using Lasso Penalized Regression
    Keildson, Sarah L.
    Morris, Andrew P.
    Farrall, Martin
    GENETIC EPIDEMIOLOGY, 2010, 34 (08) : 932 - 932
  • [3] A SCREENING-TESTING APPROACH FOR DETECTING GENE-ENVIRONMENT INTERACTIONS USING SEQUENTIAL PENALIZED AND UNPENALIZED MULTIPLE LOGISTIC REGRESSION
    Frost, H. Robert
    Andrew, Angeline S.
    Karagas, Margaret R.
    Moore, Jason H.
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2015 (PSB), 2015, : 183 - 194
  • [4] Gene and pathway identification with Lp penalized Bayesian logistic regression
    Liu, Zhenqiu
    Gartenhaus, Ronald B.
    Tan, Ming
    Jiang, Feng
    Jiao, Xiaoli
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [5] Power of multifactor dimensionality reduction and penalized logistic regression for detecting gene-gene Interaction in a case-control study
    He, Hua
    Oetting, William S.
    Brott, Marcia J.
    Basu, Saonli
    BMC MEDICAL GENETICS, 2009, 10
  • [6] Structured Penalized Logistic Regression for Gene Selection in Gene Expression Data Analysis
    Liu, Cheng
    Wong, Hau San
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (01) : 312 - 321
  • [7] Penalized logistic regression with prior information for microarray gene expression classification
    Genc, Murat
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2024, 20 (01): : 107 - 122
  • [8] Multiclass-penalized logistic regression
    Nibbering, Didier
    Hastie, Trevor J.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 169
  • [9] Identifying gene-gene interactions using penalized tensor regression
    Wu, Mengyun
    Huang, Jian
    Ma, Shuangge
    STATISTICS IN MEDICINE, 2018, 37 (04) : 598 - 610
  • [10] Detecting Maternal-Fetal Genotype Interactions Associated With Conotruncal Heart Defects: A Haplotype-Based Analysis With Penalized Logistic Regression
    Li, Ming
    Erickson, Stephen W.
    Hobbs, Charlotte A.
    Li, Jingyun
    Tang, Xinyu
    Nick, Todd G.
    Macleod, Stewart L.
    Cleves, Mario A.
    GENETIC EPIDEMIOLOGY, 2014, 38 (03) : 198 - 208