Cluster analysis of protein array results via similarity of Gene Ontology annotation

被引:20
|
作者
Wolting, Cheryl
McGlade, C. Jane
Tritchler, David
机构
[1] Univ Toronto, Dept Med Biophys, Toronto, ON, Canada
[2] Hosp Sick Children, Dept Cell Biol, Arthur & Sonia Labatt Brain Tumour Res Ctr, Toronto, ON M5G 1X8, Canada
[3] Princess Margaret Hosp, Ontario Canc Inst, Toronto, ON M5G 2M9, Canada
关键词
D O I
10.1186/1471-2105-7-338
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: With the advent of high-throughput proteomic experiments such as arrays of purified proteins comes the need to analyse sets of proteins as an ensemble, as opposed to the traditional one-protein-at-a-time approach. Although there are several publicly available tools that facilitate the analysis of protein sets, they do not display integrated results in an easily-interpreted image or do not allow the user to specify the proteins to be analysed. Results: We developed a novel computational approach to analyse the annotation of sets of molecules. As proof of principle, we analysed two sets of proteins identified in published protein array screens. The distance between any two proteins was measured as the graph similarity between their Gene Ontology (GO) annotations. These distances were then clustered to highlight subsets of proteins sharing related GO annotation. In the first set of proteins found to bind small molecule inhibitors of rapamycin, we identified three subsets containing four or five proteins each that may help to elucidate how rapamycin affects cell growth whereas the original authors chose only one novel protein from the array results for further study. In a set of phosphoinositide-binding proteins, we identified subsets of proteins associated with different intracellular structures that were not highlighted by the analysis performed in the original publication. Conclusion: By determining the distances between annotations, our methodology reveals trends and enrichment of proteins of particular functions within high-throughput datasets at a higher sensitivity than perusal of end-point annotations. In an era of increasingly complex datasets, such tools will help in the formulation of new, testable hypotheses from high-throughput experimental data.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Cluster analysis of protein array results via similarity of Gene Ontology annotation
    Cheryl Wolting
    C Jane McGlade
    David Tritchler
    [J]. BMC Bioinformatics, 7
  • [2] Protein annotation from protein interaction networks and Gene Ontology
    Nguyen, Cao D.
    Gardiner, Katheleen J.
    Cios, Krzysztof J.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (05) : 824 - 829
  • [3] Gene ontology automatic annotation using a domain based gene product similarity measure
    Popescu, M
    Keller, JM
    Mitchell, JA
    [J]. FUZZ-IEEE 2005: PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS: BIGGEST LITTLE CONFERENCE IN THE WORLD, 2005, : 108 - 113
  • [4] CrowdGO: Machine learning and semantic similarity guided consensus Gene Ontology annotation
    Reijnders, Maarten J. M. F.
    Waterhouse, Robert M.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (05)
  • [5] Automatic annotation of protein motif function with Gene Ontology terms
    Xinghua Lu
    Chengxiang Zhai
    Vanathi Gopalakrishnan
    Bruce G Buchanan
    [J]. BMC Bioinformatics, 5
  • [6] Large-scale protein annotation through gene ontology
    Xie, HQ
    Wasserman, A
    Levine, Z
    Novik, A
    Grebinskiy, V
    Shoshan, A
    Mintz, L
    [J]. GENOME RESEARCH, 2002, 12 (05) : 785 - 794
  • [7] Automatic annotation of protein motif function with Gene Ontology terms
    Lu, XH
    Zhai, CX
    Gopalakrishnan, V
    Buchanan, BG
    [J]. BMC BIOINFORMATICS, 2004, 5 (1)
  • [8] Gene Ontology annotation quality analysis in model eukaryotes
    Buza, Teresia J.
    McCarthy, Fiona M.
    Wang, Nan
    Bridges, Susan M.
    Burgess, Shane C.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 (02)
  • [9] Software Suite for Gene and Protein Annotation Prediction and Similarity Search
    Chicco, Davide
    Masseroli, Marco
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (04) : 837 - 843
  • [10] Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation
    Lord, PW
    Stevens, RD
    Brass, A
    Goble, CA
    [J]. BIOINFORMATICS, 2003, 19 (10) : 1275 - 1283