Improving protein function prediction using domain and protein complexes in PPI networks

被引:40
|
作者
Peng, Wei [1 ,2 ]
Wang, Jianxin [1 ]
Cai, Juan [1 ]
Chen, Lu [1 ]
Li, Min [1 ]
Wu, Fang-Xiang [1 ,3 ,4 ]
机构
[1] Cent South Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650093, Yunnan, Peoples R China
[3] Univ Saskatchewan, Dept Mech Engn, Saskatoon, SK S7N 5A9, Canada
[4] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK S7N 5A9, Canada
基金
中国国家自然科学基金;
关键词
GENE ONTOLOGY; DATABASE; YEAST; GENERATION; ANNOTATION; ALGORITHM; MODULES;
D O I
10.1186/1752-0509-8-35
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Characterization of unknown proteins through computational approaches is one of the most challenging problems in silico biology, which has attracted world-wide interests and great efforts. There have been some computational methods proposed to address this problem, which are either based on homology mapping or in the context of protein interaction networks. Results: In this paper, two algorithms are proposed by integrating the protein-protein interaction (PPI) network, proteins' domain information and protein complexes. The one is domain combination similarity (DCS), which combines the domain compositions of both proteins and their neighbors. The other is domain combination similarity in context of protein complexes (DSCP), which extends the protein functional similarity definition of DCS by combining the domain compositions of both proteins and the complexes including them. The new algorithms are tested on networks of the model species of Saccharomyces cerevisiae to predict functions of unknown proteins using cross validations. Comparing with other several existing algorithms, the results have demonstrated the effectiveness of our proposed methods in protein function prediction. Furthermore, the algorithm DSCP using experimental determined complex data is robust when a large percentage of the proteins in the network is unknown, and it outperforms DCS and other several existing algorithms. Conclusions: The accuracy of predicting protein function can be improved by integrating the protein-protein interaction (PPI) network, proteins' domain information and protein complexes.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Semantic and layered protein function prediction from PPI networks
    Zhu, Wei
    Hou, Jingyu
    Chen, Yi-Ping Phoebe
    JOURNAL OF THEORETICAL BIOLOGY, 2010, 267 (02) : 129 - 136
  • [2] Protein Complexes Prediction via Positive and Unlabeled Learning of the PPI networks
    Zhao, Jichao
    Liang, Xun
    Wang, Yi
    Xu, Zhiming
    Liu, Yu
    2016 13TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, 2016,
  • [3] Protein function prediction using domain families
    Rentzsch, Robert
    Orengo, Christine A.
    BMC BIOINFORMATICS, 2013, 14
  • [4] Protein function prediction using domain families
    Robert Rentzsch
    Christine A Orengo
    BMC Bioinformatics, 14
  • [5] Exploiting Local and Global Context In PPI networks For Efficient Protein Function Prediction
    Kumar, D. Satheesh
    Goyal, Siddharth
    Reddy, V. Prashant
    Logane, Ramesh
    PROCEEDINGS OF THE THIRD ACM IKDD CONFERENCE ON DATA SCIENCES (CODS), 2016,
  • [6] Spectral clustering for detecting protein complexes in protein-protein interaction (PPI) networks
    Qin, Guimin
    Gao, Lin
    MATHEMATICAL AND COMPUTER MODELLING, 2010, 52 (11-12) : 2066 - 2074
  • [7] Spectral Clustering for Detecting Protein Complexes in PPI Networks
    Qin, Guimin
    Gao, Lin
    2009 FOURTH INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PROCEEDINGS, 2009, : 175 - 182
  • [8] WCOACH: Protein complex prediction in weighted PPI networks
    Kouhsar, Morteza
    Zare-Mirakabad, Fatemeh
    Jamali, Yousef
    GENES & GENETIC SYSTEMS, 2015, 90 (05) : 317 - 324
  • [9] Improving the prediction of yeast protein function using weighted protein-protein interactions
    Ahmed, Khaled S.
    Saloma, Nahed H.
    Kadah, Yasser M.
    THEORETICAL BIOLOGY AND MEDICAL MODELLING, 2011, 8
  • [10] Mining Protein Complexes from PPI Networks Using the Minimum Vertex Cut
    Xiaojun Ding 1
    2
    1. School of Information Science and Engineering
    2. Department of Computer Science
    Tsinghua Science and Technology, 2012, 17 (06) : 674 - 681