TVAR: assessing tissue-specific functional effects of non-coding variants with deep learning

被引:2
|
作者
Yang, Hai [1 ,2 ]
Chen, Rui [2 ,3 ]
Wang, Quan [2 ,3 ]
Wei, Qiang [2 ,3 ]
Ji, Ying [2 ,3 ]
Zhong, Xue [3 ,4 ]
Li, Bingshan [2 ,3 ]
机构
[1] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
[2] Vanderbilt Univ, Dept Mol Physiol & Biophys, Nashville, TN 37232 USA
[3] Vanderbilt Univ, Vanderbilt Genet Inst, Nashville, TN 37232 USA
[4] Vanderbilt Univ, Med Ctr, Dept Med, Nashville, TN 37232 USA
基金
美国国家卫生研究院;
关键词
INTEGRATIVE ANALYSIS; REGULATORY VARIANTS; PATHOGENICITY; FRAMEWORK; ELEMENTS; MODEL;
D O I
10.1093/bioinformatics/btac608
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Analysis of whole-genome sequencing (WGS) for genetics is still a challenge due to the lack of accurate functional annotation of non-coding variants, especially the rare ones. As eQTLs have been extensively implicated in the genetics of human diseases, we hypothesize that rare non-coding variants discovered in WGS play a regulatory role in predisposing disease risk. Results: With thousands of tissue- and cell-type-specific epigenomic features, we propose TVAR. This multi-label learning-based deep neural network predicts the functionality of non-coding variants in the genome based on eQTLs across 49 human tissues in the GTEx project. TVAR learns the relationships between high-dimensional epigenomics and eQTLs across tissues, taking the correlation among tissues into account to understand shared and tissue-specific eQTL effects. As a result, TVAR outputs tissue-specific annotations, with an average AUROC of 0.77 across these tissues. We evaluate TVAR's performance on four complex diseases (coronary artery disease, breast cancer, Type 2 diabetes and Schizophrenia), using TVAR's tissue-specific annotations, and observe its superior performance in predicting functional variants for both common and rare variants, compared with five existing state-of-the-art tools. We further evaluate TVAR's G-score, a scoring scheme across all tissues, on ClinVar, fine-mapped GWAS loci, Massive Parallel Reporter Assay (MPRA) validated variants and observe the consistently better performance of TVAR compared with other competing tools. Availability and implementation: The TVAR source code and its scores on the ClinVar catalog, fine mapped GWAS Loci, high confidence eQTLs from GTEx dataset, and MPRA validated functional variants are available at https:// github.com/haiyang1986/TVAR.
引用
收藏
页码:4697 / 4704
页数:8
相关论文
共 50 条
  • [41] Shortcuts to a functional adipose tissue: The role of small non-coding RNAs
    Brandao, Bruna B.
    Guerra, Beatriz A.
    Mori, Marcelo A.
    REDOX BIOLOGY, 2017, 12 : 82 - 102
  • [42] An integrative approach to predicting the functional effects of non-coding and coding sequence variation
    Shihab, Hashem A.
    Rogers, Mark F.
    Gough, Julian
    Mort, Matthew
    Cooper, David N.
    Day, Ian N. M.
    Gaunt, Tom R.
    Campbell, Colin
    BIOINFORMATICS, 2015, 31 (10) : 1536 - 1543
  • [43] DeepSATA: A Deep Learning-Based Sequence Analyzer Incorporating the Transcription Factor Binding Affinity to Dissect the Effects of Non-Coding Genetic Variants
    Ma, Wenlong
    Fu, Yang
    Bao, Yongzhou
    Wang, Zhen
    Lei, Bowen
    Zheng, Weigang
    Wang, Chao
    Liu, Yuwen
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (15)
  • [44] Tissue-specific differential expression of novel genes and long intergenic non-coding RNAs in humans with extreme response to evoked endotoxemia
    Yuanfeng Gao
    中国循环杂志, 2018, (S1) : 125 - 125
  • [45] Uncovering tissue-specific binding features from differential deep learning
    Phuycharoen, Mike
    Zarrineh, Peyman
    Bridoux, Laure
    Amin, Shilu
    Losa, Marta
    Chen, Ke
    Bobola, Nicoletta
    Rattray, Magnus
    NUCLEIC ACIDS RESEARCH, 2020, 48 (05) : E27
  • [46] lncDML: Identification of long non-coding RNAs by Deep Metric Learning
    Zhao, Pengfei
    Peng, Qinke
    Zhu, Zhibo
    Han, Tian
    Dong, Rida
    Huang, Huijun
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1808 - 1812
  • [47] Genome-wide identification and characterization of tissue-specific non-coding RNAs in black pepper (Piper nigrum L.)
    Kumar, Baibhav
    Saha, Bibek
    Jaiswal, Sarika
    Angadi, U. B.
    Rai, Anil
    Iquebal, Mir Asif
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [48] Author Correction: Evaluation of deep learning in non-coding RNA classification
    Noorul Amin
    Annette McGrath
    Yi-Ping Phoebe Chen
    Nature Machine Intelligence, 2020, 2 : 236 - 236
  • [49] Comparison and benchmark of deep learning methods for non-coding RNA classification
    Creux, Constance
    Zehraoui, Farida
    Radvanyi, Francois
    Tahi, Fariza
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (09)
  • [50] Functional coding and non-coding variants in human BRCA1 gene and their use in genetic screening
    Gulten Tuncel
    Mahmut Çerkez Ergören
    Medical Oncology, 2019, 36