Functional annotation signatures of disease susceptibility loci improve SNP association analysis

被引:12
|
作者
Iversen, Edwin S. [1 ]
Lipton, Gary [1 ]
Clyde, Merlise A. [1 ]
Monteiro, Alvaro N. A. [2 ]
机构
[1] Duke Univ, Dept Stat Sci, Durham, NC 27708 USA
[2] H Lee Moffitt Canc Ctr & Res Inst, Canc Epidemiol Program, Tampa, FL 33612 USA
来源
BMC GENOMICS | 2014年 / 15卷
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Association study; GWAS; SNPs; Functional annotations; Bayesian analysis; ENCODE project; GENOME-WIDE ASSOCIATION; BINDING SITES; CANCER; VARIANTS; DISCOVERY; INFERENCE; DATABASE; BREAST; LENGTH; STATE;
D O I
10.1186/1471-2164-15-398
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Genetic association studies are conducted to discover genetic loci that contribute to an inherited trait, identify the variants behind these associations and ascertain their functional role in determining the phenotype. To date, functional annotations of the genetic variants have rarely played more than an indirect role in assessing evidence for association. Here, we demonstrate how these data can be systematically integrated into an association study's analysis plan. Results: We developed a Bayesian statistical model for the prior probability of phenotype-genotype association that incorporates data from past association studies and publicly available functional annotation data regarding the susceptibility variants under study. The model takes the form of a binary regression of association status on a set of annotation variables whose coefficients were estimated through an analysis of associated SNPs in the GWAS Catalog (GC). The functional predictors examined included measures that have been demonstrated to correlate with the association status of SNPs in the GC and some whose utility in this regard is speculative: summaries of the UCSC Human Genome Browser ENCODE super-track data, dbSNP function class, sequence conservation summaries, proximity to genomic variants in the Database of Genomic Variants and known regulatory elements in the Open Regulatory Annotation database, PolyPhen-2 probabilities and RegulomeDB categories. Because we expected that only a fraction of the annotations would contribute to predicting association, we employed a penalized likelihood method to reduce the impact of non-informative predictors and evaluated the model's ability to predict GC SNPs not used to construct the model. We show that the functional data alone are predictive of a SNP's presence in the GC. Further, using data from a genome-wide study of ovarian cancer, we demonstrate that their use as prior data when testing for association is practical at the genome-wide scale and improves power to detect associations. Conclusions: We show how diverse functional annotations can be efficiently combined to create 'functional signatures' that predict the a priori odds of a variant's association to a trait and how these signatures can be integrated into a standard genome-wide-scale association analysis, resulting in improved power to detect truly associated variants.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Functional annotation signatures of disease susceptibility loci improve SNP association analysis
    Edwin S Iversen
    Gary Lipton
    Merlise A Clyde
    Alvaro NA Monteiro
    BMC Genomics, 15
  • [2] Functional Annotation of Putative Regulatory Elements at Cancer Susceptibility Loci
    Rosse, Stephanie A.
    Auer, Paul L.
    Carlson, Christopher S.
    CANCER INFORMATICS, 2014, 13 : 5 - 17
  • [3] Functional annotation of melanoma risk loci identifies novel susceptibility genes
    Fang, Shenying
    Lu, Jiachun
    Zhou, Xinke
    Wang, Yuling
    Ross, Merrick, I
    Gershenwald, Jeffrey E.
    Cormier, Janice N.
    Wargo, Jennifer
    Sui, Dawen
    Amos, Christopher, I
    Lee, Jeffrey E.
    CARCINOGENESIS, 2020, 41 (04) : 452 - 457
  • [4] Annotation of functional variation within non-MHC MS susceptibility loci through bioinformatics analysis
    F B S Briggs
    L J Leung
    L F Barcellos
    Genes & Immunity, 2014, 15 : 466 - 476
  • [5] Annotation of functional variation within non-MHC MS susceptibility loci through bioinformatics analysis
    Briggs, F. B. S.
    Leung, L. J.
    Barcellos, L. F.
    GENES AND IMMUNITY, 2014, 15 (07) : 466 - 476
  • [6] Analysis of Disease Association and Susceptibility for SNP Data Using Emotional Neural Networks
    Wang, Xiao
    Peng, Qinke
    Zhong, Tao
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2901 - 2905
  • [7] SNP arrays facilitite genotyping of non-synonymous SNP in MDS to identify disease susceptibility loci
    Jankowska, Anna M.
    Przychodzen, Bartlomiej P.
    Gondek, Lukasz P.
    Maciejewski, Jaroslaw P.
    BLOOD, 2007, 110 (11) : 714A - 714A
  • [9] Integrated Functional Genomic Analysis Enables Annotation of Kidney Genome-Wide Association Study Loci
    Sieber, Karsten B.
    Batorsky, Anna
    Siebenthall, Kyle
    Hudkins, Kelly L.
    Vierstra, Jeff D.
    Sullivan, Shawn
    Sur, Aakash
    McNulty, Michelle
    Sandstrom, Richard
    Reynolds, Alex
    Bates, Daniel
    Diegel, Morgan
    Dunn, Douglass
    Nelson, Jemma
    Buckley, Michael
    Kaul, Rajinder
    Sampson, Matthew G.
    Himmelfarb, Jonathan
    Alpers, Charles E.
    Waterworth, Dawn
    Akilesh, Shreeram
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2019, 30 (03): : 421 - 441
  • [10] The CD40 Kozak SNP: a new susceptibility loci for Graves' disease?
    Simmonds, MJ
    Heward, JM
    Franklyn, JA
    Gough, SCL
    CLINICAL ENDOCRINOLOGY, 2005, 63 (02) : 232 - 233