deepSimDEF: deep neural embeddings of gene products and gene ontology terms for functional analysis of genes

被引:4
|
作者
Pesaranghader, Ahmad [1 ,2 ,3 ,4 ]
Matwin, Stan [5 ,6 ,7 ]
Sokolova, Marina [6 ,8 ,9 ]
Grenier, Jean-Christophe [1 ]
Beiko, Robert G. [5 ,6 ]
Hussin, Julie [1 ,2 ]
机构
[1] Montreal Heart Inst, Montreal, PQ H1T 1C8, Canada
[2] Univ Montreal, Fac Med, Montreal, PQ H3T 1J4, Canada
[3] Mila Quebec Artificial Intelligence Inst, Montreal, PQ H2S 3H1, Canada
[4] Univ Montreal, Dept Comp Sci & Operat Res, Montreal, PQ H3T 1J4, Canada
[5] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
[6] Dalhousie Univ, Inst Big Data Analyt, Halifax, NS B3H 4R2, Canada
[7] Polish Acad Sci, Inst Comp Sci, Warsaw, Poland
[8] Univ Ottawa, Fac Med, Ottawa, ON K1H 8M5, Canada
[9] Univ Ottawa, Fac Engn, Ottawa, ON K1H 8M5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
SEMANTIC SIMILARITY; SHARED INFORMATION; NETWORK; PREDICTION; FEATURES;
D O I
10.1093/bioinformatics/btac304
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: There is a plethora of measures to evaluate functional similarity (FS) of genes based on their co-expression, protein-protein interactions and sequence similarity. These measures are typically derived from hand-engineered and application-specific metrics to quantify the degree of shared information between two genes using their Gene Ontology (GO) annotations. Results: We introduce deepSimDEF, a deep learning method to automatically learn FS estimation of gene pairs given a set of genes and their GO annotations. deepSimDEF's key novelty is its ability to learn low-dimensional embedding vector representations of GO terms and gene products and then calculate FS using these learned vectors. We show that deepSimDEF can predict the FS of new genes using their annotations: it outperformed all other FS measures by >5-10% on yeast and human reference datasets on protein-protein interactions, gene co-expression and sequence homology tasks. Thus, deepSimDEF offers a powerful and adaptable deep neural architecture that can benefit a wide range of problems in genomics and proteomics, and its architecture is flexible enough to support its extension to any organism.
引用
收藏
页码:3051 / 3061
页数:11
相关论文
共 50 条
  • [41] Ontology design patterns to disambiguate relations between genes and gene products in GENIA
    Hoehndorf R.
    Ngonga Ngomo A.-C.
    Pyysalo S.
    Ohta T.
    Oellrich A.
    Rebholz-Schuhmann D.
    Journal of Biomedical Semantics, 2 (Suppl 5)
  • [42] Gene Ontology rewritten for computing gene functional similarity
    Visconti, Alessia
    Cordero, Francesca
    Botta, Marco
    Calogero, Raffaele A.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2010), 2010, : 694 - 699
  • [43] Identifying disease genes using machine learning and gene functional similarities, assessed through Gene Ontology
    Asif, Muhammad
    Martiniano, Hugo F. M. C. M.
    Vicente, Astrid M.
    Couto, Francisco M.
    PLOS ONE, 2018, 13 (12):
  • [44] Extending the interpretation of gene profiling microarray experiments to pathway analysis through the use of gene ontology terms
    Chatziioannou, Aristotelis
    Moulos, Panagiotis
    ICCS 2007, 2007, : 55 - +
  • [45] Functional-Network-Based Gene Set Analysis Using Gene-Ontology
    Chang, Billy
    Kustra, Rafal
    Tian, Weidong
    PLOS ONE, 2013, 8 (02):
  • [46] A multicolor panel of novel lentiviral " gene ontology" (LeGO) vectors for functional gene analysis
    Weber, Kristoffer
    Bartsch, Udo
    Stocking, Carol
    Fehse, Boris
    MOLECULAR THERAPY, 2008, 16 (04) : 698 - 706
  • [47] Validating gene clusterings by selecting informative gene ontology terms with mutual information
    Costa, Ivan G.
    de Souto, Marcilio C. P.
    Schliep, Alexander
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2007, 4643 : 81 - +
  • [48] Improving disease gene prioritization using the semantic similarity of Gene Ontology terms
    Schlicker, Andreas
    Lengauer, Thomas
    Albrecht, Mario
    BIOINFORMATICS, 2010, 26 (18) : i561 - i567
  • [49] Analysis of Important Gene Ontology Terms and Biological Pathways Related to Pancreatic Cancer
    Yin, Hang
    Wang, ShaoPeng
    Zhang, Yu-Hang
    Cai, Yu-Dong
    Liu, Hailin
    BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [50] A Hybrid Measure for the Semantic Similarity of Gene Ontology Terms
    Zhang, Shu-Bo
    Lai, Jian-Huang
    2014 2ND INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2014, : 911 - 916