Genome-wide association analysis by lasso penalized logistic regression

被引:522
|
作者
Wu, Tong Tong [5 ]
Chen, Yi Fang [4 ]
Hastie, Trevor [3 ,4 ]
Sobel, Eric [1 ]
Lange, Kenneth [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Biomath, Los Angeles, CA 90095 USA
[3] Stanford Univ, Dept Biostat, Stanford, CA 94305 USA
[4] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[5] Univ Maryland, Dept Epidemiol & Biostat, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btp041
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In ordinary regression, imposition of a lasso penalty makes continuous model selection straightforward. Lasso penalized regression is particularly advantageous when the number of predictors far exceeds the number of observations. Method: The present article evaluates the performance of lasso penalized logistic regression in case-control disease gene mapping with a large number of SNPs (single nucleotide polymorphisms) predictors. The strength of the lasso penalty can be tuned to select a predetermined number of the most relevant SNPs and other predictors. For a given value of the tuning constant, the penalized likelihood is quickly maximized by cyclic coordinate ascent. Once the most potent marginal predictors are identified, their two-way and higher order interactions can also be examined by lasso penalized logistic regression. Results: This strategy is tested on both simulated and real data. Our findings on coeliac disease replicate the previous SNP results and shed light on possible interactions among the SNPs.
引用
收藏
页码:714 / 721
页数:8
相关论文
共 50 条
  • [1] Penalized Regression and Risk Prediction in Genome-Wide Association Studies
    Austin, Erin
    Pan, Wei
    Shen, Xiaotong
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2013, 6 (04) : 315 - 328
  • [2] PENALIZED REGRESSION FOR GENOME-WIDE ASSOCIATION SCREENING OF SEQUENCE DATA
    Zhou, H.
    Alexander, D. H.
    Sehl, M. E.
    Sinsheimer, J. S.
    Sobel, E. M.
    Lange, K.
    [J]. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2011, 2011, : 106 - 117
  • [3] Mixed logistic regression in genome-wide association studies
    Jacqueline Milet
    David Courtin
    André Garcia
    Hervé Perdry
    [J]. BMC Bioinformatics, 21
  • [4] Mixed logistic regression in genome-wide association studies
    Milet, Jacqueline
    Courtin, David
    Garcia, Andre
    Perdry, Herve
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [5] A Two-Stage Penalized Logistic Regression Approach to Case-Control Genome-Wide Association Studies
    Jingyuan Zhao
    Zehua Chen
    [J]. JOURNAL OF PROBABILITY AND STATISTICS, 2012, 2012
  • [6] SNP Selection in Genome-Wide and Candidate Gene Studies via Penalized Logistic Regression
    Ayers, Kristin L.
    Cordell, Heather J.
    [J]. GENETIC EPIDEMIOLOGY, 2010, 34 (08) : 879 - 891
  • [7] A LASSO penalized regression approach for genome-wide association analyses using related individuals: application to the Genetic Analysis Workshop 19 simulated data
    Charalampos Papachristou
    Carole Ober
    Mark Abney
    [J]. BMC Proceedings, 10 (Suppl 7)
  • [8] Analysis of genome-wide association data by large-scale Bayesian logistic regression
    Yuanjia Wang
    Nanshi Sha
    Yixin Fang
    [J]. BMC Proceedings, 3 (Suppl 7)
  • [9] Accounting for linkage disequilibrium in genome-wide association studies: A penalized regression method
    Liu, Jin
    Wang, Kai
    Ma, Shuangge
    Huang, Jian
    [J]. STATISTICS AND ITS INTERFACE, 2013, 6 (01) : 99 - 115
  • [10] Genome-wide association studies using a penalized moving-window regression
    Bao, Minli
    Wang, Kai
    [J]. BIOINFORMATICS, 2017, 33 (24) : 3887 - 3894