Gene Function Prediction Based on the Gene Ontology Hierarchical Structure

被引:25
|
作者
Cheng, Liangxi [1 ]
Lin, Hongfei [2 ]
Hu, Yuncui [2 ]
Wang, Jian [2 ]
Yang, Zhihao [2 ]
机构
[1] Dalian Univ Technol, Dept Biomed Engn, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Liaoning, Peoples R China
来源
PLOS ONE | 2014年 / 9卷 / 09期
关键词
D O I
10.1371/journal.pone.0107187
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The information of the Gene Ontology annotation is helpful in the explanation of life science phenomena, and can provide great support for the research of the biomedical field. The use of the Gene Ontology is gradually affecting the way people store and understand bioinformatic data. To facilitate the prediction of gene functions with the aid of text mining methods and existing resources, we transform it into a multi-label top-down classification problem and develop a method that uses the hierarchical relationships in the Gene Ontology structure to relieve the quantitative imbalance of positive and negative training samples. Meanwhile the method enhances the discriminating ability of classifiers by retaining and highlighting the key training samples. Additionally, the top-down classifier based on a tree structure takes the relationship of target classes into consideration and thus solves the incompatibility between the classification results and the Gene Ontology structure. Our experiment on the Gene Ontology annotation corpus achieves an F-value performance of 50.7% (precision: 52.7% recall: 48.9%). The experimental results demonstrate that when the size of training set is small, it can be expanded via topological propagation of associated documents between the parent and child nodes in the tree structure. The top-down classification model applies to the set of texts in an ontology structure or with a hierarchical relationship.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Prediction of human protein function according to Gene Ontology categories
    Jensen, LJ
    Gupta, R
    Stærfeldt, HH
    Brunak, S
    BIOINFORMATICS, 2003, 19 (05) : 635 - 642
  • [22] Protein Function Prediction With Functional and Topological Knowledge of Gene Ontology
    Zhao, Yingwen
    Yang, Zhihao
    Hong, Yongkai
    Yang, Yumeng
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    Wang, Jian
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2023, 22 (04) : 755 - 762
  • [23] FFPred 3: feature-based function prediction for all Gene Ontology domains
    Cozzetto, Domenico
    Minneci, Federico
    Currant, Hannah
    Jones, David T.
    SCIENTIFIC REPORTS, 2016, 6
  • [24] An experimental analysis of graph representation learning for Gene Ontology based protein function prediction
    Vu, Thi Thuy Duong
    Kim, Jeongho
    Jung, Jaehee
    PEERJ, 2024, 12
  • [25] Gene Ontology-Based Protein Function Prediction by Using Sequence Composition Information
    Dong, Qiwen
    Zhou, Shuigeng
    Deng, Lei
    Guan, Jihong
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (06): : 789 - 795
  • [26] FFPred 3: feature-based function prediction for all Gene Ontology domains
    Domenico Cozzetto
    Federico Minneci
    Hannah Currant
    David T. Jones
    Scientific Reports, 6
  • [27] Gene-based and semantic structure of the Gene Ontology as a complex network
    Coronnello, Claudia
    Tumminello, Michele
    Micciche, Salvatore
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2016, 458 : 313 - 328
  • [28] Prediction of submitochondrial proteins localization based on Gene Ontology
    Wang, Jingyu
    Zhou, Haihang
    Wang, Yuxiang
    Xu, Mengdie
    Yu, Yun
    Wang, Junjie
    Liu, Yun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 167
  • [29] Prediction of key gene function in spinal muscular atrophy using guilt by association method based on network and gene ontology
    Yang, Wenjiu
    Han, Jing
    Ma, Jinfeng
    Feng, Yujie
    Hou, Qingxian
    Wang, Zhijie
    Yu, Tengbo
    EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2019, 17 (04) : 2561 - 2566
  • [30] Gene function prediction based on combining gene ontology hierarchy with multi-instance multi-label learning
    Li, Zejun
    Liao, Bo
    Li, Yun
    Liu, Wenhua
    Chen, Min
    Cai, Lijun
    RSC ADVANCES, 2018, 8 (50) : 28503 - 28509