TreeHugger: A New Test for Enrichment of Gene Ontology Terms

被引:4
|
作者
Jupiter, Daniel [1 ]
Sahutoglu, Jessica [2 ]
VanBuren, Vincent [1 ]
机构
[1] Texas A&M Hlth Sci Ctr, Dept Syst Biol & Translat Med, Coll Med, Temple, TX 76504 USA
[2] Washtenaw Community Hlth Org, Ypsilanti, MI 48198 USA
关键词
statistics; data analysis; probability; genomics; microarray; EXPRESSION PROFILES; ART; TOOL; ANNOTATION; CATEGORIES; GRAPH; SETS;
D O I
10.1287/ijoc.1090.0356
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
T he Gene Ontology (GO) project provides a structured vocabulary of biological terms used by biological researchers as a tool for standardization of references to biological entities. Genes may be annotated with GO terms to indicate their roles or localizations in the cell. GO has been used in conjunction with high-throughput experimental methods, such as microarrays. In this setting, the interest is to determine whether sets of genes identified by the high-throughput experiment are enriched for GO terms: Do certain terms annotate more genes in the identified set than one might expect? Enriched terms are taken as a potential summary of the cellular function for the identified set of genes and may provide clues leading to new directions for investigation. Current methods for determining whether sets of genes are GO-enriched have certain well-known shortcomings. Many methods do not take the hierarchical structure of the ontology into account in determining enrichment. We address this drawback by introducing a new statistical test (TreeHugger) based on a novel per-gene scoring scheme for GO terms. Given a set of genes and a specified subset of those genes, our method determines enrichment of GO terms in the subset, taking into account the structure of the ontology and ascribing a lower weight to those terms that do not themselves directly annotate the given genes. Tests on simulated and real data indicate that our method is a conservative test for enrichment. Testing TreeHugger on a biological example reveals that it also reduces the redundancy caused by giving high scores to indirect annotations as provided by standard enrichment tests.
引用
收藏
页码:210 / 221
页数:12
相关论文
共 50 条
  • [41] Association Rule Mining of Gene Ontology Annotation Terms for SGD
    Nagar, Anurag
    Hahsler, Michael
    Al-Mubaid, Hisham
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2015, : 458 - 464
  • [42] An experimental study of information content measurement of gene ontology terms
    Marianna Milano
    Giuseppe Agapito
    Pietro H. Guzzi
    Mario Cannataro
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 427 - 439
  • [43] REVIGO Summarizes and Visualizes Long Lists of Gene Ontology Terms
    Supek, Fran
    Bosnjak, Matko
    Skunca, Nives
    Smuc, Tomislav
    PLOS ONE, 2011, 6 (07):
  • [44] Validating gene clusterings by selecting informative gene ontology terms with mutual information
    Costa, Ivan G.
    de Souto, Marcilio C. P.
    Schliep, Alexander
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2007, 4643 : 81 - +
  • [45] Improving disease gene prioritization using the semantic similarity of Gene Ontology terms
    Schlicker, Andreas
    Lengauer, Thomas
    Albrecht, Mario
    BIOINFORMATICS, 2010, 26 (18) : i561 - i567
  • [46] Automatic annotation of protein motif function with Gene Ontology terms
    Xinghua Lu
    Chengxiang Zhai
    Vanathi Gopalakrishnan
    Bruce G Buchanan
    BMC Bioinformatics, 5
  • [47] Spectral clustering gene ontology terms to group genes by function
    Speer, N
    Spieth, C
    Zell, A
    ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2005, 3692 : 1 - 12
  • [48] Using reasoning to guide annotation with gene, ontology terms in GOAT
    Bada, N
    Turi, D
    McEntire, R
    Stevens, R
    SIGMOD RECORD, 2004, 33 (02) : 27 - 32
  • [49] Automatic extension of Gene Ontology with flexible identification of candidate terms
    Lee, JB
    Kim, J
    Park, JC
    BIOINFORMATICS, 2006, 22 (06) : 665 - 670
  • [50] Enrichment Disequilibrium: A novel approach for measuring the degree of enrichment after gene enrichment test
    Jiang, Yongshuai
    Zhang, Mingming
    Guo, Xiaodan
    Zhang, Ruijie
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2012, 424 (03) : 563 - 567