TVAR: assessing tissue-specific functional effects of non-coding variants with deep learning

被引:2
|
作者
Yang, Hai [1 ,2 ]
Chen, Rui [2 ,3 ]
Wang, Quan [2 ,3 ]
Wei, Qiang [2 ,3 ]
Ji, Ying [2 ,3 ]
Zhong, Xue [3 ,4 ]
Li, Bingshan [2 ,3 ]
机构
[1] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
[2] Vanderbilt Univ, Dept Mol Physiol & Biophys, Nashville, TN 37232 USA
[3] Vanderbilt Univ, Vanderbilt Genet Inst, Nashville, TN 37232 USA
[4] Vanderbilt Univ, Med Ctr, Dept Med, Nashville, TN 37232 USA
基金
美国国家卫生研究院;
关键词
INTEGRATIVE ANALYSIS; REGULATORY VARIANTS; PATHOGENICITY; FRAMEWORK; ELEMENTS; MODEL;
D O I
10.1093/bioinformatics/btac608
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Analysis of whole-genome sequencing (WGS) for genetics is still a challenge due to the lack of accurate functional annotation of non-coding variants, especially the rare ones. As eQTLs have been extensively implicated in the genetics of human diseases, we hypothesize that rare non-coding variants discovered in WGS play a regulatory role in predisposing disease risk. Results: With thousands of tissue- and cell-type-specific epigenomic features, we propose TVAR. This multi-label learning-based deep neural network predicts the functionality of non-coding variants in the genome based on eQTLs across 49 human tissues in the GTEx project. TVAR learns the relationships between high-dimensional epigenomics and eQTLs across tissues, taking the correlation among tissues into account to understand shared and tissue-specific eQTL effects. As a result, TVAR outputs tissue-specific annotations, with an average AUROC of 0.77 across these tissues. We evaluate TVAR's performance on four complex diseases (coronary artery disease, breast cancer, Type 2 diabetes and Schizophrenia), using TVAR's tissue-specific annotations, and observe its superior performance in predicting functional variants for both common and rare variants, compared with five existing state-of-the-art tools. We further evaluate TVAR's G-score, a scoring scheme across all tissues, on ClinVar, fine-mapped GWAS loci, Massive Parallel Reporter Assay (MPRA) validated variants and observe the consistently better performance of TVAR compared with other competing tools. Availability and implementation: The TVAR source code and its scores on the ClinVar catalog, fine mapped GWAS Loci, high confidence eQTLs from GTEx dataset, and MPRA validated functional variants are available at https:// github.com/haiyang1986/TVAR.
引用
收藏
页码:4697 / 4704
页数:8
相关论文
共 50 条
  • [21] Systematic analysis of the effects of genetic variants on chromatin accessibility to decipher functional variants in non-coding regions
    Wang, Dongyang
    Wu, Xiaohong
    Jiang, Guanghui
    Yang, Jianye
    Yu, Zhanhui
    Yang, Yanbo
    Yang, Wenqian
    Niu, Xiaohui
    Tang, Ke
    Gong, Jing
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [22] Beyond the genome: unveiling tissue-specific non-coding RNAs in clove (Syzygium aromaticum L.)
    Sharma, Nitesh Kumar
    Mishra, Dwijesh Chandra
    Kumar, Baibhav
    Srivastava, Sudhir
    Chaturvedi, Krishna Kumar
    Singh, Awani Kumar
    Madival, Sharanbasappa D.
    Budhlakoti, Neeraj
    Jha, Girish Kumar
    3 BIOTECH, 2025, 15 (04)
  • [23] Systematic assays and resources for the functional annotation of non-coding variants
    Kircher, Martin
    Ludwig, Kerstin U.
    MEDIZINISCHE GENETIK, 2022, 34 (04) : 275 - 286
  • [24] Comparative analysis of distinct non-coding characteristics potentially contributing to the divergence of human tissue-specific genes
    Huang, Qianli
    Guo, Jiaming
    Ge, Qq
    Li-Ling, Jesse
    Chen, Xueping
    Ma, Fei
    GENETICA, 2009, 136 (01) : 127 - 134
  • [25] EZH2 RIP-seq Identifies Tissue-specific Long Non-coding RNAs
    Wang, Yan
    Xie, Yinping
    Li, Lili
    He, Yuan
    Zheng, Di
    Yu, Pengcheng
    Yu, Ling
    Tang, Lixu
    Wang, Yibin
    Wang, Zhihua
    CURRENT GENE THERAPY, 2018, 18 (05) : 275 - 285
  • [26] Comparative analysis of distinct non-coding characteristics potentially contributing to the divergence of human tissue-specific genes
    Qianli Huang
    Jiaming Guo
    Qq Ge
    Jesse Li-Ling
    Xueping Chen
    Fei Ma
    Genetica, 2009, 136 : 127 - 134
  • [27] Evaluation of deep learning in non-coding RNA classification
    Amin, Noorul
    McGrath, Annette
    Chen, Yi-Ping Phoebe
    NATURE MACHINE INTELLIGENCE, 2019, 1 (05) : 246 - 256
  • [28] Evaluation of deep learning in non-coding RNA classification
    Noorul Amin
    Annette McGrath
    Yi-Ping Phoebe Chen
    Nature Machine Intelligence, 2019, 1 : 246 - 256
  • [29] Molecular Characterisation Reveals Tissue-Specific, Antisense Intronic Non-Coding RNA Transcription in the GHSR Locus.
    Seim, I.
    Carter, S. L.
    Herington, A. C.
    Chopin, L. K.
    ENDOCRINE REVIEWS, 2010, 31 (03)
  • [30] Genome-wide identification of tissue-specific long non-coding RNA in three farm animal species
    Colin Kern
    Ying Wang
    James Chitwood
    Ian Korf
    Mary Delany
    Hans Cheng
    Juan F. Medrano
    Alison L. Van Eenennaam
    Catherine Ernst
    Pablo Ross
    Huaijun Zhou
    BMC Genomics, 19