TreeHugger: A New Test for Enrichment of Gene Ontology Terms

被引:4
|
作者
Jupiter, Daniel [1 ]
Sahutoglu, Jessica [2 ]
VanBuren, Vincent [1 ]
机构
[1] Texas A&M Hlth Sci Ctr, Dept Syst Biol & Translat Med, Coll Med, Temple, TX 76504 USA
[2] Washtenaw Community Hlth Org, Ypsilanti, MI 48198 USA
关键词
statistics; data analysis; probability; genomics; microarray; EXPRESSION PROFILES; ART; TOOL; ANNOTATION; CATEGORIES; GRAPH; SETS;
D O I
10.1287/ijoc.1090.0356
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
T he Gene Ontology (GO) project provides a structured vocabulary of biological terms used by biological researchers as a tool for standardization of references to biological entities. Genes may be annotated with GO terms to indicate their roles or localizations in the cell. GO has been used in conjunction with high-throughput experimental methods, such as microarrays. In this setting, the interest is to determine whether sets of genes identified by the high-throughput experiment are enriched for GO terms: Do certain terms annotate more genes in the identified set than one might expect? Enriched terms are taken as a potential summary of the cellular function for the identified set of genes and may provide clues leading to new directions for investigation. Current methods for determining whether sets of genes are GO-enriched have certain well-known shortcomings. Many methods do not take the hierarchical structure of the ontology into account in determining enrichment. We address this drawback by introducing a new statistical test (TreeHugger) based on a novel per-gene scoring scheme for GO terms. Given a set of genes and a specified subset of those genes, our method determines enrichment of GO terms in the subset, taking into account the structure of the ontology and ascribing a lower weight to those terms that do not themselves directly annotate the given genes. Tests on simulated and real data indicate that our method is a conservative test for enrichment. Testing TreeHugger on a biological example reveals that it also reduces the redundancy caused by giving high scores to indirect annotations as provided by standard enrichment tests.
引用
收藏
页码:210 / 221
页数:12
相关论文
共 50 条
  • [1] Grouping Gene Ontology terms to improve the assessment of gene set enrichment in microarray data
    Alex Lewin
    Ian C Grieve
    BMC Bioinformatics, 7
  • [2] Grouping Gene Ontology terms to improve the assessment of gene set enrichment in microarray data
    Lewin, Alex
    Grieve, Ian C.
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [3] PROTEOMIC PROFILE AND FUNCTIONAL ENRICHMENT OF GENE ONTOLOGY TERMS IN MEN WITH TESTICULAR CANCER
    Tibaldi, D. S.
    Sposito, C.
    Del Giudice, P. T.
    Fariello, R. M.
    Spaine, D.
    Fraietta, R.
    FERTILITY AND STERILITY, 2011, 96 (03) : S206 - S206
  • [4] Analysis of the chemical toxicity effects using the enrichment of Gene Ontology terms and KEGG pathways
    Chen, Lei
    Zhang, Yu-Hang
    Zou, Quan
    Chu, Chen
    Ji, Zhiliang
    BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2016, 1860 (11): : 2619 - 2626
  • [5] The compositional structure of gene ontology terms
    Ogren, PV
    Cohen, KB
    Acquaah-Mensah, GK
    Eberlein, J
    Hunter, L
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2004, 2003, : 214 - 225
  • [6] Zebrafish Expression Ontology of Gene Sets (ZEOGS): A Tool to Analyze Enrichment of Zebrafish Anatomical Terms in Large Gene Sets
    Prykhozhij, Sergey V.
    Marsico, Annalisa
    Meijsing, Sebastiaan H.
    ZEBRAFISH, 2013, 10 (03) : 303 - 315
  • [7] Clustering of gene ontology terms in genomes
    Tiirikka, Timo
    Siermala, Markku
    Vihinen, Mauno
    GENE, 2014, 550 (02) : 155 - 164
  • [8] Identifying Gene Ontology Areas for Automated Enrichment
    Pesquita, Catia
    Grego, Tiago
    Couto, Francisco
    DISTRIBUTED COMPUTING, ARTIFICIAL INTELLIGENCE, BIOINFORMATICS, SOFT COMPUTING, AND AMBIENT ASSISTED LIVING, PT II, PROCEEDINGS, 2009, 5518 : 934 - 941
  • [9] Gene ontology enrichment analysis of parkin interactants
    Zanon, A.
    Pichler, I.
    Rakovic, A.
    Schwienbacher, C.
    Hicks, A. A.
    Alexa, A.
    Domingues, F. S.
    Klein, C.
    Pramstaller, P. P.
    MOVEMENT DISORDERS, 2011, 26 : S349 - S349
  • [10] Investigating the concordance of Gene Ontology terms reveals the intra- and inter-platform reproducibility of enrichment analysis
    Lifang Zhang
    Juan Zhang
    Gang Yang
    Di Wu
    Lina Jiang
    Zhining Wen
    Menglong Li
    BMC Bioinformatics, 14