Imputing Phenotypes for Genome-wide Association Studies

被引:29
|
作者
Hormozdiari, Farhad [1 ]
Kang, Eun Yong [1 ]
Bilow, Michael [1 ]
Ben-David, Eyal [2 ]
Vulpe, Chris [3 ]
McLachlan, Stela [4 ]
Lusis, Aldons J. [2 ,5 ]
Han, Buhm [6 ,7 ]
Eskin, Eleazar [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[3] Univ Calif Berkeley, Dept Nutr Sci & Toxicol, Berkeley, CA 94720 USA
[4] Univ Edinburgh, Usher Inst Populat Hlth Sci & Informat, Ctr Populat Hlth Sci, Edinburgh EH8 9AG, Midlothian, Scotland
[5] Univ Calif Los Angeles, Dept Med, Los Angeles, CA 90095 USA
[6] Univ Ulsan, Coll Med, Dept Convergence Med, Seoul 05505, South Korea
[7] Asan Med Ctr, Asan Inst Life Sci, Seoul 05505, South Korea
基金
美国国家科学基金会;
关键词
LINEAR MIXED MODELS; LINKAGE DISEQUILIBRIUM; SUSCEPTIBILITY LOCI; COMPLEX TRAITS; IMPUTATION; DISEASE; METAANALYSIS; ALGORITHMS; INFERENCE; SELECTION;
D O I
10.1016/j.ajhg.2016.04.013
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genome-wide association studies (GWASs) have been successful in detecting variants correlated with phenotypes of clinical interest. However, the power to detect these variants depends on the number of individuals whose phenotypes are collected, and for phenotypes that are difficult to collect, the sample size might be insufficient to achieve the desired statistical power. The phenotype of interest is often difficult to collect, whereas surrogate phenotypes or related phenotypes are easier to collect and have already been collected in very large samples. This paper demonstrates how we take advantage of these additional related phenotypes to impute the phenotype of interest or target phenotype and then perform association analysis. Our approach leverages the correlation structure between phenotypes to perform the imputation. The correlation structure can be estimated from a smaller complete dataset for which both the target and related phenotypes have been collected. Under some assumptions, the statistical power can be computed analytically given the correlation structure of the phenotypes used in imputation. In addition, our method can impute the summary statistic of the target phenotype as a weighted linear combination of the summary statistics of related phenotypes. Thus, our method is applicable to datasets for which we have access only to summary statistics and not to the raw genotypes. We illustrate our approach by analyzing associated loci to triglycerides (TGs), body mass index (BMI), and systolic blood pressure (SBP) in the Northern Finland Birth Cohort dataset.
引用
收藏
页码:89 / 103
页数:15
相关论文
共 50 条
  • [1] A Unifying Framework for Imputing Summary Statistics in Genome-Wide Association Studies
    Wu, Yue
    Eskin, Eleazar
    Sankararaman, Sriram
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (03) : 418 - 428
  • [2] Genome-wide association studies of cardiac electrical phenotypes
    Glinge, Charlotte
    Lahrouchi, Najim
    Jabbari, Reza
    Tfelt-Hansen, Jacob
    Bezzina, Connie R.
    [J]. CARDIOVASCULAR RESEARCH, 2020, 116 (09) : 1620 - 1634
  • [3] Analysis of multiple related phenotypes in genome-wide association studies
    Oh, Sohee
    Huh, Iksoo
    Lee, Seung Yeoun
    Park, Taesung
    [J]. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2016, 14 (05)
  • [4] Dimensional and transdiagnostic phenotypes in psychiatric genome-wide association studies
    Waszczuk, Monika A.
    Jonas, Katherine G.
    Bornovalova, Marina
    Breen, Gerome
    Bulik, Cynthia M.
    Docherty, Anna R.
    Eley, Thalia C.
    Hettema, John M.
    Kotov, Roman
    Krueger, Robert F.
    Lencz, Todd
    Li, James J.
    Vassos, Evangelos
    Waldman, Irwin D.
    [J]. MOLECULAR PSYCHIATRY, 2023, 28 (12) : 4943 - 4953
  • [5] Genome-wide association studies with high-dimensional phenotypes
    Marttinen, Pekka
    Gillberg, Jussi
    Havulinna, Aki
    Corander, Jukka
    Kaski, Samuel
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2013, 12 (04) : 413 - 431
  • [6] Dimensional and transdiagnostic phenotypes in psychiatric genome-wide association studies
    Monika A. Waszczuk
    Katherine G. Jonas
    Marina Bornovalova
    Gerome Breen
    Cynthia M. Bulik
    Anna R. Docherty
    Thalia C. Eley
    John M. Hettema
    Roman Kotov
    Robert F. Krueger
    Todd Lencz
    James J. Li
    Evangelos Vassos
    Irwin D. Waldman
    [J]. Molecular Psychiatry, 2023, 28 : 4943 - 4953
  • [7] Genome-wide association studies of brain imaging phenotypes in UK Biobank
    Elliott, Lloyd T.
    Sharp, Kevin
    Alfaro-Almagro, Fidel
    Shi, Sinan
    Miller, Karla L.
    Douaud, Gwenaelle
    Marchini, Jonathan
    Smith, Stephen M.
    [J]. NATURE, 2018, 562 (7726) : 210 - +
  • [8] phenosim - A software to simulate phenotypes for testing in genome-wide association studies
    Guenther, Torsten
    Gawenda, Inka
    Schmid, Karl J.
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [9] phenosim - A software to simulate phenotypes for testing in genome-wide association studies
    Torsten Günther
    Inka Gawenda
    Karl J Schmid
    [J]. BMC Bioinformatics, 12
  • [10] Genome-wide association studies of hematologic phenotypes: a window into human hematopoiesis
    Sankaran, Vijay G.
    Orkin, Stuart H.
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 2013, 23 (03) : 339 - 344