Gene Function Prediction Based on the Gene Ontology Hierarchical Structure

被引:25
|
作者
Cheng, Liangxi [1 ]
Lin, Hongfei [2 ]
Hu, Yuncui [2 ]
Wang, Jian [2 ]
Yang, Zhihao [2 ]
机构
[1] Dalian Univ Technol, Dept Biomed Engn, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Liaoning, Peoples R China
来源
PLOS ONE | 2014年 / 9卷 / 09期
关键词
D O I
10.1371/journal.pone.0107187
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The information of the Gene Ontology annotation is helpful in the explanation of life science phenomena, and can provide great support for the research of the biomedical field. The use of the Gene Ontology is gradually affecting the way people store and understand bioinformatic data. To facilitate the prediction of gene functions with the aid of text mining methods and existing resources, we transform it into a multi-label top-down classification problem and develop a method that uses the hierarchical relationships in the Gene Ontology structure to relieve the quantitative imbalance of positive and negative training samples. Meanwhile the method enhances the discriminating ability of classifiers by retaining and highlighting the key training samples. Additionally, the top-down classifier based on a tree structure takes the relationship of target classes into consideration and thus solves the incompatibility between the classification results and the Gene Ontology structure. Our experiment on the Gene Ontology annotation corpus achieves an F-value performance of 50.7% (precision: 52.7% recall: 48.9%). The experimental results demonstrate that when the size of training set is small, it can be expanded via topological propagation of associated documents between the parent and child nodes in the tree structure. The top-down classification model applies to the set of texts in an ontology structure or with a hierarchical relationship.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] On gene ontology and function annotation
    Pal, Debnath
    BIOINFORMATION, 2006, 1 (03) : 97 - 98
  • [32] Comparative gene prediction based on gene structure conservation
    Hsieh, Shu Ju
    Lin, Chun Yuan
    Liu, Ning Han
    Tang, Chuan Yi
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2006, 4146 : 32 - +
  • [33] Gene Ontology Capsule GAN: an improved architecture for protein function prediction
    Mansoor, Musadaq
    Nauman, Mohammad
    Rehman, Hafeez Ur
    Omar, Maryam
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [34] Ranked Gene Ontology Based Protein Function Prediction by Analysis of Protein-Protein Interactions
    Sengupta, Kaustav
    Saha, Sovan
    Chatterjee, Piyali
    Kundu, Mahantapas
    Nasipuri, Mita
    Basu, Subhadip
    INFORMATION AND DECISION SCIENCES, 2018, 701 : 419 - 427
  • [35] Gene Ontology Capsule GAN: an improved architecture for protein function prediction
    Mansoor M.
    Nauman M.
    Rehman H.U.
    Omar M.
    PeerJ Computer Science, 2022, 8
  • [36] Gene Ontology GAN (GOGAN): a novel architecture for protein function prediction
    Musadaq Mansoor
    Mohammad Nauman
    Hafeez Ur Rehman
    Alfredo Benso
    Soft Computing, 2022, 26 : 7653 - 7667
  • [37] Partial order relation-based gene ontology embedding improves protein function prediction
    Li, Wenjing
    Wang, Bin
    Dai, Jin
    Kou, Yan
    Chen, Xiaojun
    Pan, Yi
    Hu, Shuangwei
    Xu, Zhenjiang Zech
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [38] Gene Ontology GAN (GOGAN): a novel architecture for protein function prediction
    Mansoor, Musadaq
    Nauman, Mohammad
    Rehman, Hafeez Ur
    Benso, Alfredo
    SOFT COMPUTING, 2022, 26 (16) : 7653 - 7667
  • [39] Ontology-Based Prediction and Prioritization of Gene Functional Annotations
    Chicco, Davide
    Masseroli, Marco
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (02) : 248 - 260
  • [40] Prediction of Colorectal Cancer Related Genes Based on Gene Ontology
    Li, Bi-Qing
    Huang, Guo-Hua
    Huang, Tao
    Feng, Kai-Yan
    Liu, Lei
    Cai, Yu-Dong
    CURRENT BIOINFORMATICS, 2015, 10 (01) : 22 - 30