Correlation-based tests for the formal comparison of polygenic scores in multiple populations

被引:0
|
作者
Gunn, Sophia [1 ,2 ]
Lunetta, Kathryn L. [1 ]
机构
[1] Boston Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02215 USA
[2] New York Genome Ctr, New York, NY 10013 USA
来源
PLOS GENETICS | 2024年 / 20卷 / 04期
关键词
D O I
10.1371/journal.pgen.1011249
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Polygenic scores (PGS) are measures of genetic risk, derived from the results of genome wide association studies (GWAS). Previous work has proposed the coefficient of determination (R2) as an appropriate measure by which to compare PGS performance in a validation dataset. Here we propose correlation-based methods for evaluating PGS performance by adapting previous work which produced a statistical framework and robust test statistics for the comparison of multiple correlation measures in multiple populations. This flexible framework can be extended to a wider variety of hypothesis tests than currently available methods. We assess our proposed method in simulation and demonstrate its utility with two examples, assessing previously developed PGS for low-density lipoprotein cholesterol and height in multiple populations in the All of Us cohort. Finally, we provide an R package 'coranova' with both parametric and nonparametric implementations of the described methods. Polygenic scores (PGS) are measures of genetic risk of disease that have been widely embraced by the scientific community. While there are many methods available to develop PGS, we have limited tools by which to compare PGS performance. Previous work has proposed an R2-based approach which appropriately accounts for the correlation between PGS when comparing their performance. Here, we propose correlation-based tests which can assess multiple scores in multiple populations while accounting for the correlation between the scores. Our method is highly flexible and can be used by researchers to test any linear hypothesis of PGS performance, though we suggest three ANOVA-like tests as a starting point. We apply our method to PGS developed for LDL cholesterol and height in the All of Us cohort. In these examples, we demonstrate how our method can be used by researchers to compare and evaluate PGS in multiple populations. This approach will be particularly useful as we look to improve PGS performance in underrepresented populations in genetic research and need to evaluate PGS in multiple populations to appropriately assess PGS performance.
引用
下载
收藏
页数:17
相关论文
共 50 条
  • [21] Relationship of Major Depressive Disorder and Schizophrenia Polygenic Risk Scores to Suicide: A Comparison Between European and Asian Ancestry Populations
    Otsuka, Ikuo
    Galfalvy, Hanga
    Guo, Jia
    Akiyama, Masato
    Okazaki, Satoshi
    Terao, Chikashi
    Rujescu, Dan
    Turecki, Gustavo
    Hishimoto, Akitoyo
    Mann, J. John
    ARCHIVES OF SUICIDE RESEARCH, 2024,
  • [22] Comparison of derivative-based and correlation-based methods to estimate effective connectivity in neural networks
    Niklas Laasch
    Wilhelm Braun
    Lisa Knoff
    Jan Bielecki
    Claus C. Hilgetag
    Scientific Reports, 15 (1)
  • [23] Correlation-based feature selection of single cell transcriptomics data from multiple sources
    Nenad S. Mitić
    Saša N. Malkov
    Mirjana M. Maljković Ružičić
    Aleksandar N. Veljković
    Ivan Lj. Čukić
    Xin Lin
    Minjie Lyu
    Vladimir Brusić
    Journal of Big Data, 12 (1)
  • [24] Comparison of a Local Correlation-Based Transition Model with an eN-Method for Transition Prediction
    Seyfert, Cornelia
    Krumbein, Andreas
    NEW RESULTS IN NUMERICAL AND EXPERIMENTAL FLUID MECHANICS VIII, 2013, 121 : 541 - 548
  • [25] Gene-Based Association Tests Using New Polygenic Risk Scores and Incorporating Gene Expression Data
    Yan, Shijia
    Sha, Qiuying
    Zhang, Shuanglin
    GENES, 2022, 13 (07)
  • [26] Correlation-based analysis and generation of multiple spike trains using hawkes models with an exogenous input
    Krumin, Michael
    Reutsky, Inna
    Shoham, Shy
    Frontiers in Computational Neuroscience, 2010, 4 (NOV):
  • [27] CORRELATIONS OF SIGHTING-EYE DOMINANCE TESTS AND COMPARISON OF COMBINED SCORES IN CLASSROOM, CLINIC, AND OPHTHALMOLOGICAL POPULATIONS
    PALMER, LL
    PERCEPTUAL AND MOTOR SKILLS, 1977, 45 (03) : 1259 - 1263
  • [28] Correlation-based analysis and generation of multiple spike trains using Hawkes models with an exogenous input
    Krumin, Michael
    Reutsky, Inna
    Shoham, Shy
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2010, 4
  • [29] Heterogeneous Defect Prediction through Correlation-Based Selection of Multiple Source Projects and Ensemble Learning
    Kim, Eunseob
    Baik, Jongmoon
    Ryu, Duksan
    2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021), 2021, : 503 - 513
  • [30] An effective correlation-based compromise approach for multiple criteria decision analysis with Pythagorean fuzzy information
    Chen, Ting-Yu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (03) : 3529 - 3541