A Two-Stage Penalized Logistic Regression Approach to Case-Control Genome-Wide Association Studies

被引:1
|
作者
Jingyuan Zhao [1 ]
Zehua Chen [2 ]
机构
[1] Genome Inst Singapore, Human Genet, Singapore 138672, Singapore
[2] Natl Univ Singapore, Dept Stat & Appl Probabil, 3 Sci Dr 2, Singapore 117546, Singapore
关键词
D O I
10.1155/2012/642403
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a two-stage penalized logistic regression approach to case-control genome-wide association studies. This approach consists of a screening stage and a selection stage. In the screening stage, main-effect and interaction-effect features are screened by using L-1-penalized logistic like-lihoods. In the selection stage, the retained features are ranked by the logistic likelihood with the smoothly clipped absolute deviation (SCAD) penalty (Fan and Li, 2001) and Jeffrey's Prior penalty (Firth, 1993), a sequence of nested candidate models are formed, and the models are assessed by a family of extended Bayesian information criteria (J. Chen and Z. Chen, 2008). The proposed approach is applied to the analysis of the prostate cancer data of the Cancer Genetic Markers of Susceptibility (CGEMS) project in the National Cancer Institute, USA. Simulation studies are carried out to compare the approach with the pair-wise multiple testing approach (Marchini et al. 2005) and the LASSO-patternsearch algorithm (Shi et al. 2007)
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A Novel Two-Stage Approach for Epistasis Detection in Genome-Wide Case-Control Studies
    Liao, Zhongli
    Zeng, Qingguang
    Liao, Bo
    Li, Xiong
    [J]. BIOCHEMICAL GENETICS, 2014, 52 (9-10) : 403 - 414
  • [2] Genome-wide association studies using an adaptive two-stage analysis for a case-control design
    Kijoung Song
    Qing Lu
    Xiwu Lin
    Dawn Waterworth
    Robert C Elston
    [J]. BMC Proceedings, 1 (Suppl 1)
  • [3] A General Framework for Two-Stage Analysis of Genome-wide Association Studies and Its Application to Case-Control Studies
    Wason, James M. S.
    Dudbridge, Frank
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2012, 90 (05) : 760 - 773
  • [4] Genome-wide association analysis by lasso penalized logistic regression
    Wu, Tong Tong
    Chen, Yi Fang
    Hastie, Trevor
    Sobel, Eric
    Lange, Kenneth
    [J]. BIOINFORMATICS, 2009, 25 (06) : 714 - 721
  • [5] A Novel Powerful Joint Analysis with Data Fusion in Two-stage Case-Control Genome-wide Association Studies
    Pan, Dong-Dong
    Li, Zheng-Bang
    Li, Qi-Zhai
    Fung, Wing Kam
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2016, 45 (07) : 2362 - 2376
  • [6] A Novel Two-Stage Approach for Epistasis Detection in Genome-Wide Case–Control Studies
    Zhongli Liao
    Qingguang Zeng
    Bo Liao
    Xiong Li
    [J]. Biochemical Genetics, 2014, 52 : 403 - 414
  • [7] A Two-stage Gene-based Approach for Genome-wide Association Studies
    Zhao, Jingyuan
    Thalamuthu, Anbupalam
    [J]. GENETIC EPIDEMIOLOGY, 2010, 34 (08) : 977 - 978
  • [8] Penalized Regression and Risk Prediction in Genome-Wide Association Studies
    Austin, Erin
    Pan, Wei
    Shen, Xiaotong
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2013, 6 (04) : 315 - 328
  • [9] Mixed logistic regression in genome-wide association studies
    Jacqueline Milet
    David Courtin
    André Garcia
    Hervé Perdry
    [J]. BMC Bioinformatics, 21
  • [10] Optimal designs for two-stage genome-wide association studies
    Skoll, Andrew D.
    Scott, Laura J.
    Abecasis, Goncalo R.
    Boehnke, Michael
    [J]. GENETIC EPIDEMIOLOGY, 2007, 31 (07) : 776 - 788