A clustering linear combination method for multiple phenotype association studies based on GWAS summary statistics

被引:2
|
作者
Wang, Meida [1 ]
Cao, Xuewei [1 ]
Zhang, Shuanglin [1 ]
Sha, Qiuying [1 ]
机构
[1] Michigan Technol Univ, Math Sci, Houghton, MI 49931 USA
关键词
TRAIT ANALYSIS; GENE; CLASSIFICATION; DISEASES; TESTS; MODEL; RARE;
D O I
10.1038/s41598-023-30415-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
There is strong evidence showing that joint analysis of multiple phenotypes in genome-wide association studies (GWAS) can increase statistical power when detecting the association between genetic variants and human complex diseases. We previously developed the Clustering Linear Combination (CLC) method and a computationally efficient CLC (ceCLC) method to test the association between multiple phenotypes and a genetic variant, which perform very well. However, both of these methods require individual-level genotypes and phenotypes that are often not easily accessible. In this research, we develop a novel method called sCLC for association studies of multiple phenotypes and a genetic variant based on GWAS summary statistics. We use the LD score regression to estimate the correlation matrix among phenotypes. The test statistic of sCLC is constructed by GWAS summary statistics and has an approximate Cauchy distribution. We perform a variety of simulation studies and compare sCLC with other commonly used methods for multiple phenotype association studies using GWAS summary statistics. Simulation results show that sCLC can control Type I error rates well and has the highest power in most scenarios. Moreover, we apply the newly developed method to the UK Biobank GWAS summary statistics from the XIII category with 70 related musculoskeletal system and connective tissue phenotypes. The results demonstrate that sCLC detects the most number of significant SNPs, and most of these identified SNPs can be matched to genes that have been reported in the GWAS catalog to be associated with those phenotypes. Furthermore, sCLC also identifies some novel signals that were missed by standard GWAS, which provide new insight into the potential genetic factors of the musculoskeletal system and connective tissue phenotypes.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A clustering linear combination method for multiple phenotype association studies based on GWAS summary statistics
    Meida Wang
    Xuewei Cao
    Shuanglin Zhang
    Qiuying Sha
    Scientific Reports, 13
  • [2] An Adaptive Association Test for Multiple Phenotypes with GWAS Summary Statistics
    Kim, Junghi
    Bai, Yun
    Pan, Wei
    GENETIC EPIDEMIOLOGY, 2015, 39 (08) : 651 - 663
  • [3] Control for population stratification in genetic association studies based on GWAS summary statistics
    Yan, Shijia
    Sha, Qiuying
    Zhang, Shuanglin
    GENETIC EPIDEMIOLOGY, 2022, 46 (08) : 604 - 614
  • [4] Meta-analysis of set-based multiple phenotype association test based on GWAS summary statistics from different cohorts
    Zhu, Lirong
    Zhang, Shuanglin
    Sha, Qiuying
    FRONTIERS IN GENETICS, 2024, 15
  • [5] Flexible Framework for Gene-Based Association Studies using GWAS Summary Statistics
    Belonogova, Nadezhda
    Svishcheva, Gulnara
    Tsepilov, Yakov
    Axenovich, Tatiana
    HUMAN HEREDITY, 2021, 85 (02) : 72 - 72
  • [6] Multiple phenotype association tests using summary statistics in genome-wide association studies
    Liu, Zhonghua
    Lin, Xihong
    BIOMETRICS, 2018, 74 (01) : 165 - 175
  • [7] Gene- and pathway-based association tests for multiple traits with GWAS summary statistics
    Kwak, Il-Youp
    Pan, Wei
    BIOINFORMATICS, 2017, 33 (01) : 64 - 71
  • [8] Gene- and Pathway-Based Association Tests for Multiple Traits with GWAS Summary Statistics
    Kwak, Il-Youp
    Pan, Wei
    GENETIC EPIDEMIOLOGY, 2016, 40 (07) : 648 - 648
  • [9] sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics
    Belonogova, Nadezhda M.
    Svishcheva, Gulnara R.
    Kirichenko, Anatoly, V
    Zorkoltseva, Irina, V
    Tsepilov, Yakov A.
    Axenovich, Tatiana, I
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (06)
  • [10] Comparison of Multiple Phenotype Association Tests Using Summary Statistics in Genome-wide Association Studies
    Sitlani, Colleen M.
    Baldassari, Antoine R.
    Highland, Heather M.
    Hodonsky, Chani J.
    McKnight, Barbara
    Avery, Christy L.
    GENETIC EPIDEMIOLOGY, 2019, 43 (07) : 909 - 910