Collective prediction of protein functions from protein-protein interaction networks

被引:4
|
作者
Wu, Qingyao [1 ,2 ]
Ye, Yunming [1 ,2 ]
Ng, Michael K. [3 ]
Ho, Shen-Shyang [4 ]
Shi, Ruichao [1 ,2 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Dept Comp Sci, Shenzhen, Peoples R China
[2] Internet Informat Collaborat, Shenzhen Key Lab, Shenzhen, Peoples R China
[3] Hong Kong Baptist Univ, Dept Math, Hong Kong, Hong Kong, Peoples R China
[4] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
protein function prediction; protein-protein interaction network; collective classification;
D O I
10.1186/1471-2105-15-S2-S9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Automated assignment of functions to unknown proteins is one of the most important task in computational biology. The development of experimental methods for genome scale analysis of molecular interaction networks offers new ways to infer protein function from protein-protein interaction (PPI) network data. Existing techniques for collective classification (CC) usually increase accuracy for network data, wherein instances are interlinked with each other, using a large amount of labeled data for training. However, the labeled data are time-consuming and expensive to obtain. On the other hand, one can easily obtain large amount of unlabeled data. Thus, more sophisticated methods are needed to exploit the unlabeled data to increase prediction accuracy for protein function prediction. Results: In this paper, we propose an effective Markov chain based CC algorithm (ICAM) to tackle the label deficiency problem in CC for interrelated proteins from PPI networks. Our idea is to model the problem using two distinct Markov chain classifiers to make separate predictions with regard to attribute features from protein data and relational features from relational information. The ICAM learning algorithm combines the results of the two classifiers to compute the ranks of labels to indicate the importance of a set of labels to an instance, and uses an ICA framework to iteratively refine the learning models for improving performance of protein function prediction from PPI networks in the paucity of labeled data. Conclusion: Experimental results on the real-world Yeast protein-protein interaction datasets show that our proposed ICAM method is better than the other ICA-type methods given limited labeled training data. This approach can serve as a valuable tool for the study of protein function prediction from PPI networks.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Collective prediction of protein functions from protein-protein interaction networks
    Qingyao Wu
    Yunming Ye
    Michael K Ng
    Shen-Shyang Ho
    Ruichao Shi
    [J]. BMC Bioinformatics, 15
  • [2] Global protein function prediction from protein-protein interaction networks
    Vazquez, A
    Flammini, A
    Maritan, A
    Vespignani, A
    [J]. NATURE BIOTECHNOLOGY, 2003, 21 (06) : 697 - 700
  • [3] Global protein function prediction from protein-protein interaction networks
    Alexei Vazquez
    Alessandro Flammini
    Amos Maritan
    Alessandro Vespignani
    [J]. Nature Biotechnology, 2003, 21 : 697 - 700
  • [4] Protein function prediction by collective classification with explicit and implicit edges in protein-protein interaction networks
    Xiong, Wei
    Liu, Hui
    Guan, Jihong
    Zhou, Shuigeng
    [J]. BMC BIOINFORMATICS, 2013, 14
  • [5] Protein function prediction by collective classification with explicit and implicit edges in protein-protein interaction networks
    Wei Xiong
    Hui Liu
    Jihong Guan
    Shuigeng Zhou
    [J]. BMC Bioinformatics, 14
  • [6] Computational Methods to Predict Protein Functions from Protein-Protein Interaction Networks
    Zhao, Bihai
    Wang, Jianxin
    Wu, Fang-Xiang
    [J]. CURRENT PROTEIN & PEPTIDE SCIENCE, 2017, 18 (11) : 1120 - 1131
  • [7] Protein-protein Interaction Prediction via Collective Matrix Factorization
    Xu, Qian
    Xiang, Evan Wei
    Yang, Qiang
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 62 - 67
  • [8] Better Link Prediction for Protein-Protein Interaction Networks
    Yuen, Ho Yin
    Jansson, Jesper
    [J]. 2020 IEEE 20TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE 2020), 2020, : 53 - 60
  • [9] The Intrinsic Geometric Structure of Protein-Protein Interaction Networks for Protein Interaction Prediction
    Fang, Yi
    Sun, Mengtian
    Dai, Guoxian
    Ramani, Karthik
    [J]. INTELLIGENT COMPUTING IN BIOINFORMATICS, 2014, 8590 : 487 - 493
  • [10] The Intrinsic Geometric Structure of Protein-Protein Interaction Networks for Protein Interaction Prediction
    Fang, Yi
    Sun, Mengtian
    Dai, Guoxian
    Ramain, Karthik
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (01) : 76 - 85