CRF-based models of protein surfaces improve protein-protein interaction site predictions

被引:18
|
作者
Dong, Zhijie [1 ]
Wang, Keyu [1 ]
Truong Khanh Linh Dang [1 ]
Gueltas, Mehmet [2 ]
Welter, Marlon [1 ]
Wierschin, Torsten [3 ]
Stanke, Mario [3 ]
Waack, Stephan [1 ]
机构
[1] Univ Gottingen, Inst Comp Sci, D-37077 Gottingen, Germany
[2] Univ Gottingen, Inst Bioinformat, D-37077 Gottingen, Germany
[3] Inst Math & Informat, D-17487 Greifswald, Germany
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
RESIDUE CONSERVATION; SEQUENCE PROFILE; HOT-SPOTS; INTERFACES; COMPLEXES; IDENTIFY;
D O I
10.1186/1471-2105-15-277
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The identification of protein-protein interaction sites is a computationally challenging task and important for understanding the biology of protein complexes. There is a rich literature in this field. A broad class of approaches assign to each candidate residue a real-valued score that measures how likely it is that the residue belongs to the interface. The prediction is obtained by thresholding this score. Some probabilistic models classify the residues on the basis of the posterior probabilities. In this paper, we introduce pairwise conditional random fields (pCRFs) in which edges are not restricted to the backbone as in the case of linear-chain CRFs utilized by Li et al. (2007). In fact, any 3D-neighborhood relation can be modeled. On grounds of a generalized Viterbi inference algorithm and a piecewise training process for pCRFs, we demonstrate how to utilize pCRFs to enhance a given residue-wise score-based protein-protein interface predictor on the surface of the protein under study. The features of the pCRF are solely based on the interface predictions scores of the predictor the performance of which shall be improved. Results: We performed three sets of experiments with synthetic scores assigned to the surface residues of proteins taken from the data set PlaneDimers compiled by Zellner et al. (2011), from the list published by Keskin et al. (2004) and from the very recent data set due to Cukuroglu et al. (2014). That way we demonstrated that our pCRF-based enhancer is effective given the interface residue score distribution and the non-interface residue score are unimodal. Moreover, the pCRF-based enhancer is also successfully applicable, if the distributions are only unimodal over a certain sub-domain. The improvement is then restricted to that domain. Thus we were able to improve the prediction of the PresCont server devised by Zellner et al. (2011) on PlaneDimers. Conclusions: Our results strongly suggest that pCRFs form a methodological framework to improve residue-wise score-based protein-protein interface predictors given the scores are appropriately distributed. A prototypical implementation of our method is accessible at http://ppicrf.informatik.uni-goettingen.de/index.html.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] CRF-based models of protein surfaces improve protein-protein interaction site predictions
    Zhijie Dong
    Keyu Wang
    Truong Khanh Linh Dang
    Mehmet Gültas
    Marlon Welter
    Torsten Wierschin
    Mario Stanke
    Stephan Waack
    [J]. BMC Bioinformatics, 15
  • [2] Protein-Protein Interaction Site Predictions with Three-Dimensional Probability Distributions of Interacting Atoms on Protein Surfaces
    Chen, Ching-Tai
    Peng, Hung-Pin
    Jian, Jhih-Wei
    Tsai, Keng-Chang
    Chang, Jeng-Yih
    Yang, Ei-Wen
    Chen, Jun-Bo
    Ho, Shinn-Ying
    Hsu, Wen-Lian
    Yang, An-Suei
    [J]. PLOS ONE, 2012, 7 (06):
  • [3] YAPPIS-Finder: A novel method for protein-protein interaction site predictions
    Kumar, Vicky
    Sood, Ashita
    Munshi, Anjana
    Gautam, Tarkeshwar
    Kulharia, Mahesh
    [J]. BIOINFORMATION, 2022, 18 (07) : 604 - 612
  • [4] Protein-protein interaction site predictions with minimum covariance determinant and Mahalanobis distance
    Qiu, Zhijun
    Zhou, Bo
    Yuan, Jiangfeng
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2017, 433 : 57 - 63
  • [5] Assessing predictions of protein-protein interaction: The CAPRI experiment
    Janin, J
    [J]. PROTEIN SCIENCE, 2005, 14 (02) : 278 - 283
  • [6] Geometric Deep Learning for Protein-Protein Interaction Predictions
    Lemieux, Gabriel St-Pierre
    Paquet, Eric
    Viktor, Herna L.
    Michalowski, Wojtek
    [J]. IEEE ACCESS, 2022, 10 : 90045 - 90055
  • [7] Transient pockets on protein surfaces involved in protein-protein interaction
    Eyrisch, Susanne
    Helms, Volkhard
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2007, 50 (15) : 3457 - 3464
  • [8] Protein-protein interaction site prediction based on conditional random fields
    Li, Ming-Hui
    Lin, Lei
    Wang, Xiao-Long
    Liu, Tao
    [J]. BIOINFORMATICS, 2007, 23 (05) : 597 - 604
  • [9] Effect of sampling on topology predictions of protein-protein interaction networks
    Jing-Dong J Han
    Denis Dupuy
    Nicolas Bertin
    Michael E Cusick
    Marc Vidal
    [J]. Nature Biotechnology, 2005, 23 : 839 - 844
  • [10] Effect of sampling on topology predictions of protein-protein interaction networks
    Han, JDJ
    Dupuy, D
    Bertin, N
    Cusick, ME
    Vidal, M
    [J]. NATURE BIOTECHNOLOGY, 2005, 23 (07) : 839 - 844