Disease-specific prioritization of non-coding GWAS variants based on chromatin accessibility

被引:0
|
作者
Liang, Qianqian [1 ,2 ,3 ]
Abraham, Abin [4 ]
Capra, John A. [5 ,6 ]
Kostka, Dennis [1 ,2 ]
机构
[1] Univ Pittsburgh, Sch Med, Dept Computat & Syst Biol, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Sch Med, Ctr Evolutionary Biol & Med, Pittsburgh, PA 15213 USA
[3] Univ Pittsburgh, Sch Publ Hlth, Dept Human Genet, Pittsburgh, PA USA
[4] Childrens Hosp Philadelphia, Philadelphia, PA USA
[5] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA USA
[6] Univ Calif San Francisco, Bakar Computat Hlth Sci Inst, San Francisco, CA USA
来源
关键词
CELLS; ASSOCIATION; ANNOTATION; FRAMEWORK; INNATE;
D O I
10.1016/j.xhgg.2024.100310
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Non-protein-coding genetic variants are a major driver of the genetic risk for human disease; however, identifying which non-coding variants contribute to diseases and their mechanisms remains challenging. In silico variant prioritization methods quantify a variant's severity, but for most methods, the specific phenotype and disease context of the prediction remain poorly defined. For example, many commonly used methods provide a single, organism-wide score for each variant, while other methods summarize a variant's impact in certain tissues and/or cell types. Here, we propose a complementary disease-specific variant prioritization scheme, which is motivated by the observation that variants contributing to disease often operate through specific biological mechanisms. We combine tissue/cell-type-specific variant scores (e.g., GenoSkyline, FitCons2, DNA accessibility) into disease-specific scores with a logistic regression approach and apply it to - 25,000 non-coding variants spanning 111 diseases. We show that this disease-specific aggregation significantly improves the association of common non-coding genetic variants with disease (average precision: 0.151, baseline = 0.09), compared with organism-wide scores (GenoCanyon, LINSIGHT, GWAVA, Eigen, CADD; average precision: 0.129, baseline = 0.09). Further on, disease similarities based on data-driven aggregation weights highlight meaningful disease groups, and it provides information about tissues and cell types that drive these similarities. We also show that so-learned similarities are complementary to genetic similarities as quantified by genetic correlation. Overall, our approach demonstrates the strengths of disease-specific variant prioritization, leads to improvement in non-coding variant prioritization, and enables interpretable models that link variants to disease via specific tissues and/or cell types.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Role of non-coding sequence variants in cancer
    Khurana, Ekta
    Fu, Yao
    Chakravarty, Dimple
    Demichelis, Francesca
    Rubin, Marka.
    Gerstein, Mark
    NATURE REVIEWS GENETICS, 2016, 17 (02) : 93 - 108
  • [42] Non-coding DNA variants for risk in lupus☆
    Zhang, Yutong
    Hou, Guojun
    Shen, Nan
    BEST PRACTICE & RESEARCH IN CLINICAL RHEUMATOLOGY, 2024, 38 (02):
  • [43] Coding RNAs with a non-coding function Maintenance of open chromatin structure
    Caudron-Herger, Maiwen
    Mueller-Ott, Katharina
    Mallm, Jan-Philipp
    Marth, Caroline
    Schmidt, Ute
    Fejes-Toth, Katalin
    Rippe, Karsten
    NUCLEUS, 2011, 2 (05) : 410 - 424
  • [44] Huntington's Disease as Neurodevelopmental Disorder: Altered Chromatin Regulation, Coding, and Non-Coding RNA Transcription
    Kerschbamer, Emanuela
    Biagioli, Marta
    FRONTIERS IN NEUROSCIENCE, 2016, 9
  • [45] The Landscape of Liver Chromatin Accessibility and Conserved Non-coding Elements in Larimichthys crocea, Nibea albiflora, and Lateolabrax maculatus
    Chaowei Song
    Wanbo Li
    Zhiyong Wang
    Marine Biotechnology, 2022, 24 : 763 - 775
  • [46] The Landscape of Liver Chromatin Accessibility and Conserved Non-coding Elements in Larimichthys crocea, Nibea albiflora, and Lateolabrax maculatus
    Song, Chaowei
    Li, Wanbo
    Wang, Zhiyong
    MARINE BIOTECHNOLOGY, 2022, 24 (04) : 763 - 775
  • [47] CHROMATIN ACCESSIBILITY MAPPING IN HIPSC IMPLICATES FUNCTIONAL GWAS RISK VARIANTS AND TARGET GENES FOR NEUROPSYCHIATRIC DISORDERS AND ALZHEIMER'S DISEASE
    Zhang, Siwei
    Zhang, Hanwen
    Kozlova, Alena
    Forrest, Marc
    Zhou, Yifan
    Bagchi, Vikram
    Dos Santos, Marc
    Wood, Whitney
    Li, Lexi
    Sanders, Alan
    Pang, Zhiping
    He, Xin
    Penzes, Peter
    Duan, Jubao
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2023, 75 : S90 - S90
  • [48] GREEN-DB: a framework for the annotation and prioritization of non-coding regulatory variants in whole-genome sequencing
    Giacopuzzi, Edoardo
    Popitsch, Niko
    Taylor, Jenny C.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2022, 30 (SUPPL 1) : 496 - 496
  • [49] Non-coding RNAs in chromatin folding and nuclear organization
    Sergey V. Razin
    Alexey A. Gavrilov
    Cellular and Molecular Life Sciences, 2021, 78 : 5489 - 5504
  • [50] UpSETing chromatin during non-coding RNA production
    Venkatesh, Swaminathan
    Workman, Jerry L.
    Smolle, Michaela
    EPIGENETICS & CHROMATIN, 2013, 6