Improving protein function prediction using domain and protein complexes in PPI networks

被引:40
|
作者
Peng, Wei [1 ,2 ]
Wang, Jianxin [1 ]
Cai, Juan [1 ]
Chen, Lu [1 ]
Li, Min [1 ]
Wu, Fang-Xiang [1 ,3 ,4 ]
机构
[1] Cent South Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650093, Yunnan, Peoples R China
[3] Univ Saskatchewan, Dept Mech Engn, Saskatoon, SK S7N 5A9, Canada
[4] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK S7N 5A9, Canada
基金
中国国家自然科学基金;
关键词
GENE ONTOLOGY; DATABASE; YEAST; GENERATION; ANNOTATION; ALGORITHM; MODULES;
D O I
10.1186/1752-0509-8-35
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Characterization of unknown proteins through computational approaches is one of the most challenging problems in silico biology, which has attracted world-wide interests and great efforts. There have been some computational methods proposed to address this problem, which are either based on homology mapping or in the context of protein interaction networks. Results: In this paper, two algorithms are proposed by integrating the protein-protein interaction (PPI) network, proteins' domain information and protein complexes. The one is domain combination similarity (DCS), which combines the domain compositions of both proteins and their neighbors. The other is domain combination similarity in context of protein complexes (DSCP), which extends the protein functional similarity definition of DCS by combining the domain compositions of both proteins and the complexes including them. The new algorithms are tested on networks of the model species of Saccharomyces cerevisiae to predict functions of unknown proteins using cross validations. Comparing with other several existing algorithms, the results have demonstrated the effectiveness of our proposed methods in protein function prediction. Furthermore, the algorithm DSCP using experimental determined complex data is robust when a large percentage of the proteins in the network is unknown, and it outperforms DCS and other several existing algorithms. Conclusions: The accuracy of predicting protein function can be improved by integrating the protein-protein interaction (PPI) network, proteins' domain information and protein complexes.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] PPI_SVM: Prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables
    Chatterjee, Piyali
    Basu, Subhadip
    Kundu, Mahantapas
    Nasipuri, Mita
    Plewczynski, Dariusz
    CELLULAR & MOLECULAR BIOLOGY LETTERS, 2011, 16 (02) : 264 - 278
  • [22] Prediction of Protein Function Using Deep Neural Networks
    Ma, Ge
    Gu, Wei-Xi
    Wang, Qing-Chun
    Zhu, Guo-Wei
    Hu, Zi-Ang
    Huang, Qi-Yang
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2021, 128 : 10 - 10
  • [23] Identifying protein complexes and functional modules-from static PPI networks to dynamic PPI networks
    Chen, Bolin
    Fan, Weiwei
    Liu, Juan
    Wu, Fang-Xiang
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (02) : 177 - 194
  • [24] PPRODO: Prediction of protein domain boundaries using neural networks
    Sim, J
    Kim, SY
    Lee, J
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 59 (03) : 627 - 632
  • [25] Gene Ontology Based Function Prediction of Human Protein Using Protein Sequence and Neighborhood Property of PPI Network
    Saha, Sovan
    Chatterjee, Piyali
    Basu, Subhadip
    Nasipuri, Mita
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, (FICTA 2016), VOL 2, 2017, 516 : 109 - 118
  • [26] Prediction of protein function using common-neighbors in protein-protein interaction networks
    Lin, Chuan
    Jiang, Daxin
    Zhang, Aidong
    BIBE 2006: SIXTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2006, : 251 - +
  • [27] Improving protein function prediction using protein sequence and GO-term similarities
    Makrodimitris, Stavros
    van Ham, Roeland C. H. J.
    Reinders, Marcel J. T.
    BIOINFORMATICS, 2019, 35 (07) : 1116 - 1124
  • [28] Prediction of Protein Function Using Gaussian Mixture Model in Protein-Protein Interaction Networks
    Koura, A. M.
    Kamal, A. H.
    Abdul-Rahman, I. F.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (04): : 114 - 119
  • [29] IMPRECO: A tool for improving the prediction of protein complexes
    Cannataro, Mario
    Guzzi, Pietro Hiram
    Veltri, Pierangelo
    COMPUTATIONAL SCIENCE - ICCS 2008, PT 3, 2008, 5103 : 148 - 157
  • [30] Improving prediction of heterodimeric protein complexes using combination with pairwise kernel
    Peiying Ruan
    Morihiro Hayashida
    Tatsuya Akutsu
    Jean-Philippe Vert
    BMC Bioinformatics, 19