Disease-specific prioritization of non-coding GWAS variants based on chromatin accessibility

被引:0
|
作者
Liang, Qianqian [1 ,2 ,3 ]
Abraham, Abin [4 ]
Capra, John A. [5 ,6 ]
Kostka, Dennis [1 ,2 ]
机构
[1] Univ Pittsburgh, Sch Med, Dept Computat & Syst Biol, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Sch Med, Ctr Evolutionary Biol & Med, Pittsburgh, PA 15213 USA
[3] Univ Pittsburgh, Sch Publ Hlth, Dept Human Genet, Pittsburgh, PA USA
[4] Childrens Hosp Philadelphia, Philadelphia, PA USA
[5] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA USA
[6] Univ Calif San Francisco, Bakar Computat Hlth Sci Inst, San Francisco, CA USA
来源
关键词
CELLS; ASSOCIATION; ANNOTATION; FRAMEWORK; INNATE;
D O I
10.1016/j.xhgg.2024.100310
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Non-protein-coding genetic variants are a major driver of the genetic risk for human disease; however, identifying which non-coding variants contribute to diseases and their mechanisms remains challenging. In silico variant prioritization methods quantify a variant's severity, but for most methods, the specific phenotype and disease context of the prediction remain poorly defined. For example, many commonly used methods provide a single, organism-wide score for each variant, while other methods summarize a variant's impact in certain tissues and/or cell types. Here, we propose a complementary disease-specific variant prioritization scheme, which is motivated by the observation that variants contributing to disease often operate through specific biological mechanisms. We combine tissue/cell-type-specific variant scores (e.g., GenoSkyline, FitCons2, DNA accessibility) into disease-specific scores with a logistic regression approach and apply it to - 25,000 non-coding variants spanning 111 diseases. We show that this disease-specific aggregation significantly improves the association of common non-coding genetic variants with disease (average precision: 0.151, baseline = 0.09), compared with organism-wide scores (GenoCanyon, LINSIGHT, GWAVA, Eigen, CADD; average precision: 0.129, baseline = 0.09). Further on, disease similarities based on data-driven aggregation weights highlight meaningful disease groups, and it provides information about tissues and cell types that drive these similarities. We also show that so-learned similarities are complementary to genetic similarities as quantified by genetic correlation. Overall, our approach demonstrates the strengths of disease-specific variant prioritization, leads to improvement in non-coding variant prioritization, and enables interpretable models that link variants to disease via specific tissues and/or cell types.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] CHROMATIN ACCESSIBILITY IN HIPSC MODELS AS A FUNCTIONAL READOUT OF NONCODING GWAS RISK VARIANTS OF SCHIZOPHRENIA
    Duan, Jubao
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2019, 29 : 1060 - 1061
  • [22] Chromatin, Non-Coding RNAs, and the Expression of HIV
    Groen, Jessica N.
    Morris, Kevin V.
    VIRUSES-BASEL, 2013, 5 (07): : 1633 - 1645
  • [23] Long non-coding RNAs and chromatin regulation
    De Lucia, Filomena
    Dean, Caroline
    CURRENT OPINION IN PLANT BIOLOGY, 2011, 14 (02) : 168 - 173
  • [24] Long non-coding RNA and chromatin remodeling
    Han, Pei
    Chang, Ching-Pin
    RNA BIOLOGY, 2015, 12 (10) : 1094 - 1098
  • [25] Long non-coding RNA modifies chromatin Epigenetic silencing by long non-coding RNAs
    Saxena, Alka
    Carninci, Piero
    BIOESSAYS, 2011, 33 (11) : 830 - 839
  • [26] A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer
    Eliseos J. Mucaki
    Natasha G. Caminsky
    Ami M. Perri
    Ruipeng Lu
    Alain Laederach
    Matthew Halvorsen
    Joan H. M. Knoll
    Peter K. Rogan
    BMC Medical Genomics, 9
  • [28] A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer
    Mucaki, Eliseos J.
    Caminsky, Natasha G.
    Perri, Ami M.
    Lu, Ruipeng
    Laederach, Alain
    Halvorsen, Matthew
    Knoll, Joan H. M.
    Rogan, Peter K.
    BMC MEDICAL GENOMICS, 2016, 9
  • [29] Looking beyond the genes: the role of non-coding variants in human disease
    Spielmann, Malte
    Mundlos, Stefan
    HUMAN MOLECULAR GENETICS, 2016, 25 (R2) : R157 - R165
  • [30] Linking non-coding variants to function in microglia in Alzheimer's disease
    Li, Yun
    Shen, Yin
    NATURE GENETICS, 2023, 55 (10) : 1615 - 1616