A clustering linear combination method for multiple phenotype association studies based on GWAS summary statistics

被引:2
|
作者
Wang, Meida [1 ]
Cao, Xuewei [1 ]
Zhang, Shuanglin [1 ]
Sha, Qiuying [1 ]
机构
[1] Michigan Technol Univ, Math Sci, Houghton, MI 49931 USA
关键词
TRAIT ANALYSIS; GENE; CLASSIFICATION; DISEASES; TESTS; MODEL; RARE;
D O I
10.1038/s41598-023-30415-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
There is strong evidence showing that joint analysis of multiple phenotypes in genome-wide association studies (GWAS) can increase statistical power when detecting the association between genetic variants and human complex diseases. We previously developed the Clustering Linear Combination (CLC) method and a computationally efficient CLC (ceCLC) method to test the association between multiple phenotypes and a genetic variant, which perform very well. However, both of these methods require individual-level genotypes and phenotypes that are often not easily accessible. In this research, we develop a novel method called sCLC for association studies of multiple phenotypes and a genetic variant based on GWAS summary statistics. We use the LD score regression to estimate the correlation matrix among phenotypes. The test statistic of sCLC is constructed by GWAS summary statistics and has an approximate Cauchy distribution. We perform a variety of simulation studies and compare sCLC with other commonly used methods for multiple phenotype association studies using GWAS summary statistics. Simulation results show that sCLC can control Type I error rates well and has the highest power in most scenarios. Moreover, we apply the newly developed method to the UK Biobank GWAS summary statistics from the XIII category with 70 related musculoskeletal system and connective tissue phenotypes. The results demonstrate that sCLC detects the most number of significant SNPs, and most of these identified SNPs can be matched to genes that have been reported in the GWAS catalog to be associated with those phenotypes. Furthermore, sCLC also identifies some novel signals that were missed by standard GWAS, which provide new insight into the potential genetic factors of the musculoskeletal system and connective tissue phenotypes.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Gene-based Association Tests Using GWAS Summary Statistics and Incorporating eQTL
    Cao, Xuewei
    Wang, Xuexia
    Zhang, Shuanglin
    Sha, Qiuying
    GENETIC EPIDEMIOLOGY, 2021, 45 (07) : 746 - 746
  • [22] Gene-based association tests in family samples using GWAS summary statistics
    Wang, Peng
    Xu, Xiao
    Li, Ming
    Lou, Xiang-Yang
    Xu, Siqi
    Wu, Baolin
    Gao, Guimin
    Yin, Ping
    Liu, Nianjun
    GENETIC EPIDEMIOLOGY, 2024, 48 (03) : 103 - 113
  • [23] Gene-based association tests using GWAS summary statistics and incorporating eQTL
    Cao, Xuewei
    Wang, Xuexia
    Zhang, Shuanglin
    Sha, Qiuying
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [24] Joint analysis of multiple phenotypes using a clustering linear combination method based on hierarchical clustering
    Li, Xueling
    Zhang, Shuanglin
    Sha, Qiuying
    GENETIC EPIDEMIOLOGY, 2020, 44 (01) : 67 - 78
  • [25] An Omnibus Test for Detecting Multiple Phenotype Associations Based on GWAS Summary Level Data
    Liu, Wei
    Guo, Yunshan
    Liu, Zhonghua
    FRONTIERS IN GENETICS, 2021, 12
  • [26] Control for Population Stratification in Genetic Association Studies based on GWAS Summary Data
    Yan, Shijian
    Sha, Qiuying
    Zhang, Shuanglin
    GENETIC EPIDEMIOLOGY, 2021, 45 (07) : 760 - 760
  • [27] Approximate conditional phenotype analysis based on genome wide association summary statistics
    Wu, Peitao
    Wang, Biqi
    Lubitz, Steven A.
    Benjamin, Emelia J.
    Meigs, James B.
    Dupuis, Josee
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [28] Approximate conditional phenotype analysis based on genome wide association summary statistics
    Peitao Wu
    Biqi Wang
    Steven A. Lubitz
    Emelia J. Benjamin
    James B. Meigs
    Josée Dupuis
    Scientific Reports, 11
  • [29] PLEIO: a method to map and interpret pleiotropic loci with GWAS summary statistics
    Lee, Cue Hyunkyu
    Shi, Huwenbo
    Pasaniuc, Bogdan
    Eskin, Eleazar
    Han, Buhm
    AMERICAN JOURNAL OF HUMAN GENETICS, 2021, 108 (01) : 36 - 48
  • [30] Big Data Clustering based on Summary Statistics
    Fu, Junsong
    Liu, Yun
    Zhang, Zhenjiang
    Xiong, Fei
    2015 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE THEORY, SYSTEMS AND APPLICATIONS (CCITSA 2015), 2015, : 87 - 91