Penalized logistic regression for detecting gene interactions

被引:247
|
作者
Park, Mee Young [1 ]
Hastie, Trevor [2 ,3 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Hlth Res & Policy, Stanford, CA 94305 USA
关键词
discrete factors; gene interactions; high dimensional; logistic regression; L-2-regularization;
D O I
10.1093/biostatistics/kxm010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose using a variant of logistic regression (LR) with L-2-regularization to fit gene-gene and gene environment interaction models. Studies have shown that many common diseases are influenced by interaction of certain genes. LR models with quadratic penalization not only correctly characterizes the influential genes along with their interaction structures but also yields additional benefits in handling high-dimensional, discrete factors with a binary response. We illustrate the advantages of using an L-2-regularization scheme and compare its performance with that of "multifactor dimensionality reduction" and "FlexTree," 2 recent tools for identifying gene-gene interactions. Through simulated and real data sets, we demonstrate that our method outperforms other methods in the identification of the interaction structures as well as prediction accuracy. In addition, we validate the significance of the factors selected through bootstrap analyses.
引用
收藏
页码:30 / 50
页数:21
相关论文
共 50 条
  • [31] Isolated-word recognition with penalized logistic regression machines
    Birkenes, Oystein
    Matsui, Tomoko
    Tanabe, Kunio
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 405 - 408
  • [32] Overlapping Haplotype Association Analysis via Penalized Logistic Regression
    Ayers, Kristin L.
    Cordell, Heather J.
    GENETIC EPIDEMIOLOGY, 2010, 34 (08) : 947 - 947
  • [33] A Penalized Logistic Regression Approach to Detection Based Phone Classification
    Siniscalchi, Sabato Marco
    Svendsen, Torbjorn
    Lee, Chin-Hui
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2390 - 2393
  • [34] Fitting Penalized Logistic Regression Models Using QR Factorization
    Klimaszewski, Jacek
    Korzen, Marcin
    COMPUTATIONAL SCIENCE - ICCS 2020, PT II, 2020, 12138 : 44 - 57
  • [35] PREDICTION OF THE NASH THROUGH PENALIZED MIXTURE OF LOGISTIC REGRESSION MODELS
    Morvan, Marie
    Devijver, Emilie
    Giacofci, Madison
    Monbet, Valerie
    ANNALS OF APPLIED STATISTICS, 2021, 15 (02): : 952 - 970
  • [36] Comparison of standard and penalized logistic regression in risk model development
    Yan, Yan
    Yang, Zhizhou
    Semenkovich, Tara R.
    Kozower, Benjamin D.
    Meyers, Bryan F.
    Nava, Ruben G.
    Kreisel, Daniel
    Puri, Varun
    JTCVS OPEN, 2022, 9 : 303 - 316
  • [38] An application of conditional logistic regression and multifactor dimensionality reduction for detecting gene-gene interactions on risk of myocardial infarction: The importance of model validation
    Coffey, CS
    Hebert, PR
    Ritchie, MD
    Krumholz, HM
    Gaziano, JM
    Ridker, PM
    Brown, NJ
    Vaughan, DE
    Moore, JH
    BMC BIOINFORMATICS, 2004, 5 (1)
  • [39] An application of conditional logistic regression and multifactor dimensionality reduction for detecting gene-gene Interactions on risk of myocardial infarction: The importance of model validation
    Christopher S Coffey
    Patricia R Hebert
    Marylyn D Ritchie
    Harlan M Krumholz
    J Michael Gaziano
    Paul M Ridker
    Nancy J Brown
    Douglas E Vaughan
    Jason H Moore
    BMC Bioinformatics, 5
  • [40] Variable selection in logistic regression for detecting SNP–SNP interactions: the rheumatoid arthritis example
    Hui-Yi Lin
    Renee Desmond
    S Louis Bridges
    Seng-jaw Soong
    European Journal of Human Genetics, 2008, 16 : 735 - 741