Gene Function Prediction Based on the Gene Ontology Hierarchical Structure

被引:25
|
作者
Cheng, Liangxi [1 ]
Lin, Hongfei [2 ]
Hu, Yuncui [2 ]
Wang, Jian [2 ]
Yang, Zhihao [2 ]
机构
[1] Dalian Univ Technol, Dept Biomed Engn, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Liaoning, Peoples R China
来源
PLOS ONE | 2014年 / 9卷 / 09期
关键词
D O I
10.1371/journal.pone.0107187
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The information of the Gene Ontology annotation is helpful in the explanation of life science phenomena, and can provide great support for the research of the biomedical field. The use of the Gene Ontology is gradually affecting the way people store and understand bioinformatic data. To facilitate the prediction of gene functions with the aid of text mining methods and existing resources, we transform it into a multi-label top-down classification problem and develop a method that uses the hierarchical relationships in the Gene Ontology structure to relieve the quantitative imbalance of positive and negative training samples. Meanwhile the method enhances the discriminating ability of classifiers by retaining and highlighting the key training samples. Additionally, the top-down classifier based on a tree structure takes the relationship of target classes into consideration and thus solves the incompatibility between the classification results and the Gene Ontology structure. Our experiment on the Gene Ontology annotation corpus achieves an F-value performance of 50.7% (precision: 52.7% recall: 48.9%). The experimental results demonstrate that when the size of training set is small, it can be expanded via topological propagation of associated documents between the parent and child nodes in the tree structure. The top-down classification model applies to the set of texts in an ontology structure or with a hierarchical relationship.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Improving protein function prediction using the hierarchical structure of the gene ontology
    Eisner, R
    Poulin, B
    Szafron, D
    Lu, P
    Greiner, R
    PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 354 - 363
  • [2] Gene function prediction using protein domain probability and hierarchical Gene Ontology information
    Jung, Jaehee
    Thon, Michael R.
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2161 - 2164
  • [3] Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing
    Zhao, Yingwen
    Fu, Guangyuan
    Wang, Jun
    Guo, Maozu
    Yu, Guoxian
    GENOMICS, 2019, 111 (03) : 334 - 342
  • [4] Gene function prediction with knowledge from gene ontology
    Shen, Ying
    Zhang, Lin
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 13 (01) : 50 - 62
  • [5] Applying Support Vector Machines for Gene ontology based gene function prediction
    Arunachalam Vinayagam
    Rainer König
    Jutta Moormann
    Falk Schubert
    Roland Eils
    Karl-Heinz Glatting
    Sándor Suhai
    BMC Bioinformatics, 5
  • [6] A Literature Review of Gene Function Prediction by Modeling Gene Ontology
    Zhao, Yingwen
    Wang, Jun
    Chen, Jian
    Zhang, Xiangliang
    Guo, Maozu
    Yu, Guoxian
    FRONTIERS IN GENETICS, 2020, 11
  • [7] STRUCTURE PREDICTION ALGORITHM FOR PROTEIN COMPLEXES BASED ON GENE ONTOLOGY
    Hadarovich, Anna Yu
    Anishchenko, Ivan, V
    Kundrotas, Petras
    Vakser, Ilya
    Tuzikov, Alexander, V
    DOKLADY NATSIONALNOI AKADEMII NAUK BELARUSI, 2020, 64 (02): : 150 - 158
  • [8] Isoform function prediction by Gene Ontology embedding
    Qiu, Sichao
    Yu, Guoxian
    Lu, Xudong
    Domeniconi, Carlotta
    Guo, Maozu
    BIOINFORMATICS, 2022, 38 (19) : 4581 - 4588
  • [9] Prediction of Human Gene - Phenotype Associations by Exploiting the Hierarchical Structure of the Human Phenotype Ontology
    Valentini, Giorgio
    Koehler, Sebastian
    Re, Matteo
    Notaro, Marco
    Robinson, Peter N.
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2015), PT I, 2015, 9043 : 66 - 77
  • [10] Hierarchical Multi-label Associative Classification for Protein Function Prediction Using Gene Ontology
    Sangsuriyun, Sawinee
    Rakthanmanon, Thanawin
    Waiyamai, Kitsana
    CHIANG MAI JOURNAL OF SCIENCE, 2019, 46 (01): : 165 - 179