CRF-based models of protein surfaces improve protein-protein interaction site predictions

被引:18
|
作者
Dong, Zhijie [1 ]
Wang, Keyu [1 ]
Truong Khanh Linh Dang [1 ]
Gueltas, Mehmet [2 ]
Welter, Marlon [1 ]
Wierschin, Torsten [3 ]
Stanke, Mario [3 ]
Waack, Stephan [1 ]
机构
[1] Univ Gottingen, Inst Comp Sci, D-37077 Gottingen, Germany
[2] Univ Gottingen, Inst Bioinformat, D-37077 Gottingen, Germany
[3] Inst Math & Informat, D-17487 Greifswald, Germany
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
RESIDUE CONSERVATION; SEQUENCE PROFILE; HOT-SPOTS; INTERFACES; COMPLEXES; IDENTIFY;
D O I
10.1186/1471-2105-15-277
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The identification of protein-protein interaction sites is a computationally challenging task and important for understanding the biology of protein complexes. There is a rich literature in this field. A broad class of approaches assign to each candidate residue a real-valued score that measures how likely it is that the residue belongs to the interface. The prediction is obtained by thresholding this score. Some probabilistic models classify the residues on the basis of the posterior probabilities. In this paper, we introduce pairwise conditional random fields (pCRFs) in which edges are not restricted to the backbone as in the case of linear-chain CRFs utilized by Li et al. (2007). In fact, any 3D-neighborhood relation can be modeled. On grounds of a generalized Viterbi inference algorithm and a piecewise training process for pCRFs, we demonstrate how to utilize pCRFs to enhance a given residue-wise score-based protein-protein interface predictor on the surface of the protein under study. The features of the pCRF are solely based on the interface predictions scores of the predictor the performance of which shall be improved. Results: We performed three sets of experiments with synthetic scores assigned to the surface residues of proteins taken from the data set PlaneDimers compiled by Zellner et al. (2011), from the list published by Keskin et al. (2004) and from the very recent data set due to Cukuroglu et al. (2014). That way we demonstrated that our pCRF-based enhancer is effective given the interface residue score distribution and the non-interface residue score are unimodal. Moreover, the pCRF-based enhancer is also successfully applicable, if the distributions are only unimodal over a certain sub-domain. The improvement is then restricted to that domain. Thus we were able to improve the prediction of the PresCont server devised by Zellner et al. (2011) on PlaneDimers. Conclusions: Our results strongly suggest that pCRFs form a methodological framework to improve residue-wise score-based protein-protein interface predictors given the scores are appropriately distributed. A prototypical implementation of our method is accessible at http://ppicrf.informatik.uni-goettingen.de/index.html.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Clustering protein-protein docking predictions
    Tong, W
    Weng, Z
    [J]. PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2004, 26 : 2999 - 3002
  • [22] Adapting to Complexity: Deep Learnable Architecture for Protein-protein Interaction Predictions
    Wu, Junzheng
    Paquet, Eric
    Viktor, Herna L.
    Michalowski, Wojtek
    [J]. MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2022, PT I, 2023, 13810 : 528 - 542
  • [23] Protein-protein docking predictions with RosettaDock
    Gray, JJ
    Baker, D
    [J]. BIOPHYSICAL JOURNAL, 2004, 86 (01) : 306A - 306A
  • [24] De novo signaling pathway predictions based on protein-protein interaction, targeted therapy and protein microarray analysis
    Ruths, Derek
    Tseng, Jen-Te
    Nakhleh, Luay
    Ram, Prahlad T.
    [J]. SYSTEMS BIOLOGY AND COMPUTATIONAL PROTEOMICS, 2007, 4532 : 108 - +
  • [25] Use of immunomatrix methods to improve protein-protein interaction detection
    Qoronfleh, MW
    Ren, L
    Emery, D
    Perr, M
    Kaboord, B
    [J]. JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2003, (05): : 291 - 298
  • [26] Protein-protein interaction based on pairwise similarity
    Nazar Zaki
    Sanja Lazarova-Molnar
    Wassim El-Hajj
    Piers Campbell
    [J]. BMC Bioinformatics, 10
  • [27] Protein-protein interaction based on pairwise similarity
    Zaki, Nazar
    Lazarova-Molnar, Sanja
    El-Hajj, Wassim
    Campbell, Piers
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [28] Protein-protein interaction and site prediction using transfer learning
    Liu, Tuoyu
    Gao, Han
    Ren, Xiaopu
    Xu, Guoshun
    Liu, Bo
    Wu, Ningfeng
    Luo, Huiying
    Wang, Yuan
    Tu, Tao
    Yao, Bin
    Guan, Feifei
    Teng, Yue
    Huang, Huoqing
    Tian, Jian
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [29] Predicting Protein Phenotypes Based on Protein-Protein Interaction Network
    Hu, Lele
    Huang, Tao
    Liu, Xiao-Jun
    Cai, Yu-Dong
    [J]. PLOS ONE, 2011, 6 (03):
  • [30] An enhanced protein-protein interaction based on enzymatic complex through replacement of the recognition site
    Jeon, Sang Duck
    Kim, Su Jung
    Park, Sung Hyun
    Choi, Gi-Wook
    Han, Sung Ok
    [J]. INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2015, 75 : 1 - 6