Gene Function Prediction Based on the Gene Ontology Hierarchical Structure

被引:25
|
作者
Cheng, Liangxi [1 ]
Lin, Hongfei [2 ]
Hu, Yuncui [2 ]
Wang, Jian [2 ]
Yang, Zhihao [2 ]
机构
[1] Dalian Univ Technol, Dept Biomed Engn, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Liaoning, Peoples R China
来源
PLOS ONE | 2014年 / 9卷 / 09期
关键词
D O I
10.1371/journal.pone.0107187
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The information of the Gene Ontology annotation is helpful in the explanation of life science phenomena, and can provide great support for the research of the biomedical field. The use of the Gene Ontology is gradually affecting the way people store and understand bioinformatic data. To facilitate the prediction of gene functions with the aid of text mining methods and existing resources, we transform it into a multi-label top-down classification problem and develop a method that uses the hierarchical relationships in the Gene Ontology structure to relieve the quantitative imbalance of positive and negative training samples. Meanwhile the method enhances the discriminating ability of classifiers by retaining and highlighting the key training samples. Additionally, the top-down classifier based on a tree structure takes the relationship of target classes into consideration and thus solves the incompatibility between the classification results and the Gene Ontology structure. Our experiment on the Gene Ontology annotation corpus achieves an F-value performance of 50.7% (precision: 52.7% recall: 48.9%). The experimental results demonstrate that when the size of training set is small, it can be expanded via topological propagation of associated documents between the parent and child nodes in the tree structure. The top-down classification model applies to the set of texts in an ontology structure or with a hierarchical relationship.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Gene structure conservation aids similarity based gene prediction
    Meyer, IM
    Durbin, R
    NUCLEIC ACIDS RESEARCH, 2004, 32 (02) : 776 - 783
  • [42] From Ontology-Based Gene Function to Physiological Model
    Sharma, Ajay Shiv
    Gupta, Hari Om
    Mitrasinovic, Petar M.
    CURRENT BIOINFORMATICS, 2012, 7 (04) : 436 - 446
  • [43] Ontology based text mining of gene-phenotype associations: application to candidate gene prediction
    Kafkas, Senay
    Hoehndorf, Robert
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2019,
  • [44] A hierarchical multi-label classification method based on neural networks for gene function prediction
    Feng, Shou
    Fu, Ping
    Zheng, Wenbin
    BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2018, 32 (06) : 1613 - 1621
  • [45] Protein function prediction with gene ontology: from traditional to deep learning models
    Thi Thuy Duong Vu
    Jung, Jaehee
    PEERJ, 2021, 9
  • [46] Correlating expression data with gene function using gene ontology
    Liu Qi
    Deng Yong
    Wang Chuan
    Shi Tie-Liu
    Li Yi-Xue
    CHINESE JOURNAL OF CHEMISTRY, 2006, 24 (09) : 1247 - 1254
  • [47] Hierarchical Classification of Gene Ontology-based Protein Functions with Neural Networks
    Cerri, Ricardo
    Barros, Rodrigo C.
    de Carvalho, Andre C. P. L. F.
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [48] HEMDAG: a family of modular and scalable hierarchical ensemble methods to improve Gene Ontology term prediction
    Notaro, Marco
    Frasca, Marco
    Petrini, Alessandro
    Gliozzo, Jessica
    Casiraghi, Elena
    Robinson, Peter N.
    Valentini, Giorgio
    BIOINFORMATICS, 2021, 37 (23) : 4526 - 4533
  • [49] Hierarchical Classification of Gene Ontology with Learning Classifier Systems
    Romao, Luiz Melo
    Nievola, Julio Cesar
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2012, 2012, 7637 : 120 - 129
  • [50] Prediction of midbody, centrosome and kinetochore proteins based on gene ontology information
    Chen, Wei
    Lin, Hao
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2010, 401 (03) : 382 - 384