Imputing Phenotypes for Genome-wide Association Studies

被引:29
|
作者
Hormozdiari, Farhad [1 ]
Kang, Eun Yong [1 ]
Bilow, Michael [1 ]
Ben-David, Eyal [2 ]
Vulpe, Chris [3 ]
McLachlan, Stela [4 ]
Lusis, Aldons J. [2 ,5 ]
Han, Buhm [6 ,7 ]
Eskin, Eleazar [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[3] Univ Calif Berkeley, Dept Nutr Sci & Toxicol, Berkeley, CA 94720 USA
[4] Univ Edinburgh, Usher Inst Populat Hlth Sci & Informat, Ctr Populat Hlth Sci, Edinburgh EH8 9AG, Midlothian, Scotland
[5] Univ Calif Los Angeles, Dept Med, Los Angeles, CA 90095 USA
[6] Univ Ulsan, Coll Med, Dept Convergence Med, Seoul 05505, South Korea
[7] Asan Med Ctr, Asan Inst Life Sci, Seoul 05505, South Korea
基金
美国国家科学基金会;
关键词
LINEAR MIXED MODELS; LINKAGE DISEQUILIBRIUM; SUSCEPTIBILITY LOCI; COMPLEX TRAITS; IMPUTATION; DISEASE; METAANALYSIS; ALGORITHMS; INFERENCE; SELECTION;
D O I
10.1016/j.ajhg.2016.04.013
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genome-wide association studies (GWASs) have been successful in detecting variants correlated with phenotypes of clinical interest. However, the power to detect these variants depends on the number of individuals whose phenotypes are collected, and for phenotypes that are difficult to collect, the sample size might be insufficient to achieve the desired statistical power. The phenotype of interest is often difficult to collect, whereas surrogate phenotypes or related phenotypes are easier to collect and have already been collected in very large samples. This paper demonstrates how we take advantage of these additional related phenotypes to impute the phenotype of interest or target phenotype and then perform association analysis. Our approach leverages the correlation structure between phenotypes to perform the imputation. The correlation structure can be estimated from a smaller complete dataset for which both the target and related phenotypes have been collected. Under some assumptions, the statistical power can be computed analytically given the correlation structure of the phenotypes used in imputation. In addition, our method can impute the summary statistic of the target phenotype as a weighted linear combination of the summary statistics of related phenotypes. Thus, our method is applicable to datasets for which we have access only to summary statistics and not to the raw genotypes. We illustrate our approach by analyzing associated loci to triglycerides (TGs), body mass index (BMI), and systolic blood pressure (SBP) in the Northern Finland Birth Cohort dataset.
引用
收藏
页码:89 / 103
页数:15
相关论文
共 50 条
  • [41] Genome-wide association studies in neurology
    Tan, Meng-Shan
    Jiang, Teng
    Tan, Lan
    Yu, Jin-Tai
    [J]. ANNALS OF TRANSLATIONAL MEDICINE, 2014, 2 (12)
  • [42] Guidelines for Genome-Wide Association Studies
    Barsh, Gregory S.
    Copenhaver, Gregory P.
    Gibson, Greg
    Williams, Scott M.
    [J]. PLOS GENETICS, 2012, 8 (07):
  • [43] Genome-Wide Association Studies and Diet
    Ferguson, Lynnette R.
    [J]. JOURNAL OF NUTRIGENETICS AND NUTRIGENOMICS, 2009, 2 (4-5) : 191 - 191
  • [44] The road to genome-wide association studies
    Kruglyak, Leonid
    [J]. NATURE REVIEWS GENETICS, 2008, 9 (04) : 314 - 318
  • [45] Genome-wide association studies in ADHD
    Barbara Franke
    Benjamin M. Neale
    Stephen V. Faraone
    [J]. Human Genetics, 2009, 126 : 13 - 50
  • [46] Genome-wide association studies in pharmacogenomics
    Daly, Ann K.
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (04) : 241 - 246
  • [47] Genome-Wide Association Studies of Cancer
    Stadler, Zsofia K.
    Thom, Peter
    Robson, Mark E.
    Weitzel, Jeffrey N.
    Kauff, Noah D.
    Hurley, Karen E.
    Devlin, Vincent
    Gold, Bert
    Klein, Robert J.
    Offit, Kenneth
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2010, 28 (27) : 4255 - 4267
  • [48] Genome-wide association studies in atherothrombosis
    Lotta, Luca Andrea
    [J]. EUROPEAN JOURNAL OF INTERNAL MEDICINE, 2010, 21 (02) : 74 - 78
  • [49] The road to genome-wide association studies
    Leonid Kruglyak
    [J]. Nature Reviews Genetics, 2008, 9 : 314 - 318
  • [50] Genome-Wide Association Studies in Atherosclerosis
    S. Sivapalaratnam
    M. M. Motazacker
    S. Maiwald
    G. K. Hovingh
    J. J. P. Kastelein
    M. Levi
    M. D. Trip
    G. M. Dallinga-Thie
    [J]. Current Atherosclerosis Reports, 2011, 13 : 225 - 232