Comprehensive-GWAS: a pipeline for genome-wide association studies utilizing cross-validation to assess the predictivity of genetic variations

被引:1
|
作者
Dagasso, Gabrielle [1 ]
Yan, Yan [2 ]
Wang, Lipu [3 ]
Li, Longhai [4 ]
Kutcher, Randy [3 ]
Zhang, Wentao [5 ]
Jin, Lingling [6 ]
机构
[1] Thompson Rivers Univ, Dept Math & Stat, Kamloops, BC, Canada
[2] Thompson Rivers Univ, Dept Comp Sci, Kamloops, BC, Canada
[3] Univ Saskatchewan, Dept Plant Sci, Saskatoon, SK, Canada
[4] Univ Saskatchewan, Dept Math & Stat, Saskatoon, SK, Canada
[5] Natl Res Council Canada, Ottawa, ON, Canada
[6] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK, Canada
关键词
SOFTWARE; MODELS;
D O I
10.1109/BIBM49941.2020.9313355
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Genome-wide association studies is an important approach to associate genetic variations among individuals with a particular trait. Despite many GWAS programs have been developed based on different statistical models, their results could vary to a large extent. To obtain a more comprehensive and accurate set of associated SNPs with a trait, we present comprehensive-GWAS, a novel automated pipeline that allows a two-step wrapper model for seamless GWAS analyses between various programs involved in performing traditional GWAS analyses and machine learning methods with additional population structure analysis. It first performs population structure analysis, then executes multiple GWAS software and combines their results into a single SNP subset. After that, it selects relevant SNPs with high individual and/or joint effects from that SNP subset and assess the predictivity of the model using cross-validation by LASSO. The combined and validated "true" significant SNPs are output as Manhattan plot, QQ plot and statistical results for each trait. To demonstrate the utility of the comprehensive-GWAS pipeline, it was applied to 199 wheat varieties that were genotyped with 90K infinium SNP array and phenotyped for traits related to fusarium head blight (FHB) disease in greenhouse condition in the year 2019 with three replications. It pinpoints genome regions that are more likely to be responsible for FHB resistance. The results will contribute to characterizing the genetic architecture of wheat lines with the highest FHB resistance. The pipeline is publicly available at https://github.com/notTrivial/Comprehensive-GWAS.
引用
收藏
页码:1361 / 1367
页数:7
相关论文
共 50 条
  • [1] GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies
    Beck, Tim
    Hastings, Robert K.
    Gollapudi, Sirisha
    Free, Robert C.
    Brookes, Anthony J.
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2014, 22 (07) : 949 - 952
  • [2] GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies
    Tim Beck
    Robert K Hastings
    Sirisha Gollapudi
    Robert C Free
    Anthony J Brookes
    [J]. European Journal of Human Genetics, 2014, 22 : 949 - 952
  • [3] Genome-wide association studies (GWAS) and their importance in asthma
    Garcia-Sanchez, A.
    Isidoro-Garcia, M.
    Garcia-Solaesa, V.
    Sanz, C.
    Hernandez-Hernandez, L.
    Padron-Morales, J.
    Lorente-Toledano, F.
    Davila, I.
    [J]. ALLERGOLOGIA ET IMMUNOPATHOLOGIA, 2015, 43 (06) : 601 - 608
  • [4] Genome-wide association studies (GWAS) vs functional validation: the challenge of the post-GWAS era
    Martinez-Gil, Nuria
    Patino-Salazar, Juan David
    Rabionet, Raquel
    Grinberg, Daniel
    Balcells, Susanna
    [J]. REVISTA DE OSTEOPOROSIS Y METABOLISMO MINERAL, 2023, 15 (01) : 29 - 39
  • [5] Pro: Genome-Wide Association Studies (GWAS) in Asthma
    Weiss, Scott T.
    Silverman, Edwin K.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2011, 184 (06) : 631 - 633
  • [6] An Introduction to Genome-Wide Association Studies: GWAS for Dummies
    Uitterlinden, A. G.
    [J]. SEMINARS IN REPRODUCTIVE MEDICINE, 2016, 34 (04) : 196 - 204
  • [7] From genome-wide association studies (GWAS) to genome-wide polygenic scores (GPS)
    Jordan, Bertrand
    [J]. M S-MEDECINE SCIENCES, 2018, 34 (12): : 1116 - 1119
  • [8] An Analysis Pipeline for Genome-wide Association Studies
    Stefanov, Stefan
    Lautenberger, James
    Gold, Bert
    [J]. CANCER INFORMATICS, 2008, 6 : 455 - +
  • [9] Contributions of candidate-gene associations studies and genome-wide association studies (GWAS) to identification of genetic variations associated with asthma
    Friedrich, Frederico
    de Castro, Miguel Angelo Uflacker Lutz
    Herter, Eduardo da Costa
    Prestes, Laura Menestrino
    Pinto, Leonardo Araujo
    [J]. ANNALS OF TRANSLATIONAL MEDICINE, 2022,
  • [10] A Review of Prostate Cancer Genome-Wide Association Studies (GWAS)
    Benafif, Sarah
    Kote-Jarai, Zsofia
    Eeles, Rosalind A.
    [J]. CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2018, 27 (08) : 845 - 857