Kernel Machine SNP-Set Analysis for Censored Survival Outcomes in Genome-Wide Association Studies

被引:49
|
作者
Lin, Xinyi [1 ]
Cai, Tianxi [1 ]
Wu, Michael C. [2 ]
Zhou, Qian [1 ]
Liu, Geoffrey [3 ]
Christiani, David C. [4 ,5 ,6 ]
Lin, Xihong [1 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Univ N Carolina, Dept Biostat, Chapel Hill, NC USA
[3] Princess Margaret Hosp, Ontario Canc Inst, Toronto, ON M4X 1K9, Canada
[4] Harvard Univ, Sch Publ Hlth, Dept Environm Hlth, Boston, MA 02115 USA
[5] Harvard Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
[6] Harvard Univ, Massachusetts Gen Hosp, Sch Med, Dept Med, Boston, MA USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Cox model; genetic studies; gene-based analysis; kernel machine; multi-locus test; score test; single nucleotide polymorphism; LINKAGE DISEQUILIBRIUM; GENETIC PATHWAY; MIXED MODELS; REGRESSION; TESTS; CANCER; HAPLOTYPES; SIMILARITY; POWER;
D O I
10.1002/gepi.20610
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In this article, we develop a powerful test for identifying single nucleotide polymorphism (SNP)-sets that are predictive of survival with data from genome-wide association studies. We first group typed SNPs into SNP-sets based on genomic features and then apply a score test to assess the overall effect of each SNP-set on the survival outcome through a kernel machine Cox regression framework. This approach uses genetic information from all SNPs in the SNP-set simultaneously and accounts for linkage disequilibrium (LD), leading to a powerful test with reduced degrees of freedom when the typed SNPs are in LD with each other. This type of test also has the advantage of capturing the potentially nonlinear effects of the SNPs, SNP-SNP interactions (epistasis), and the joint effects of multiple causal variants. By simulating SNP data based on the LD structure of real genes from the HapMap project, we demonstrate that our proposed test is more powerful than the standard single SNP minimum P-value-based test for association studies with censored survival outcomes. We illustrate the proposed test with a real data application. Genet. Epidemiol. 35:620-631, 2011. (C) 2011 Wiley Periodicals, Inc.
引用
收藏
页码:620 / 631
页数:12
相关论文
共 50 条
  • [41] GStream: Improving SNP and CNV Coverage on Genome-Wide Association Studies
    Alonso, Arnald
    Marsal, Sara
    Tortosa, Rauel
    Canela-Xandri, Oriol
    Julia, Antonio
    PLOS ONE, 2013, 8 (07):
  • [42] Evaluation of coverage variation of SNP chips for genome-wide association studies
    Mingyao Li
    Chun Li
    Weihua Guan
    European Journal of Human Genetics, 2008, 16 : 635 - 643
  • [43] A genome-wide SNP panel for use in mapping and association studies.
    Scott, KE
    Studebaker, JF
    Kuo, F
    Phillips, MS
    Bauer, P
    Ankener, WA
    Donaldson, MA
    Cardon, L
    Boyce-Jacino, MT
    AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 71 (04) : 463 - 463
  • [44] SNP Selection in Genome-Wide Association Studies via Penalized Support Vector Machine with MAX Test
    Kim, Jinseog
    Sohn, Insuk
    Kim, Dennis
    Jung, Sin-Ho
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013
  • [45] An Analysis Pipeline for Genome-wide Association Studies
    Stefanov, Stefan
    Lautenberger, James
    Gold, Bert
    CANCER INFORMATICS, 2008, 6 : 455 - +
  • [46] REPLICABILITY ANALYSIS FOR GENOME-WIDE ASSOCIATION STUDIES
    Heller, Ruth
    Yekutieli, Daniel
    ANNALS OF APPLIED STATISTICS, 2014, 8 (01): : 481 - 498
  • [47] Power analysis for genome-wide association studies
    Robert J Klein
    BMC Genetics, 8
  • [48] Power analysis for genome-wide association studies
    Klein, Robert J.
    BMC GENETICS, 2007, 8 (1)
  • [49] The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies
    Barnett, Ian
    Mukherjee, Rajarshi
    Lin, Xihong
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) : 64 - 76
  • [50] How Much Are We Missing in SNP-by-SNP Analyses of Genome-wide Association Studies?
    Shi, Min
    Weinberg, Clarice R.
    EPIDEMIOLOGY, 2011, 22 (06) : 845 - 847