Protein-Protein Interaction Site Predictions with Three-Dimensional Probability Distributions of Interacting Atoms on Protein Surfaces

被引:25
|
作者
Chen, Ching-Tai [1 ,4 ,5 ]
Peng, Hung-Pin [1 ,2 ,5 ]
Jian, Jhih-Wei [1 ,2 ,5 ]
Tsai, Keng-Chang [1 ]
Chang, Jeng-Yih [1 ]
Yang, Ei-Wen [1 ]
Chen, Jun-Bo [1 ,3 ]
Ho, Shinn-Ying [4 ]
Hsu, Wen-Lian
Yang, An-Suei [1 ]
机构
[1] Acad Sinica, Genom Res Ctr, Taipei 115, Taiwan
[2] Natl Yang Ming Univ, Inst Biomed Informat, Taipei 112, Taiwan
[3] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30043, Taiwan
[4] Natl Chiao Tung Univ, Inst Bioinformat & Syst Biol, Hsinchu, Taiwan
[5] Acad Sinica, Inst Informat Sci, Taiwan Int Grad Program, Bioinformat Program, Taipei, Taiwan
来源
PLOS ONE | 2012年 / 7卷 / 06期
关键词
INTERACTION PARTNERS; HOT-SPOTS; RECOGNITION; INTERFACES; PREFERENCES; CLASSIFIER; PRINCIPLES; IDENTIFY; REGIONS;
D O I
10.1371/journal.pone.0037706
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein-protein interactions are key to many biological processes. Computational methodologies devised to predict protein-protein interaction (PPI) sites on protein surfaces are important tools in providing insights into the biological functions of proteins and in developing therapeutics targeting the protein-protein interaction sites. One of the general features of PPI sites is that the core regions from the two interacting protein surfaces are complementary to each other, similar to the interior of proteins in packing density and in the physicochemical nature of the amino acid composition. In this work, we simulated the physicochemical complementarities by constructing three-dimensional probability density maps of non-covalent interacting atoms on the protein surfaces. The interacting probabilities were derived from the interior of known structures. Machine learning algorithms were applied to learn the characteristic patterns of the probability density maps specific to the PPI sites. The trained predictors for PPI sites were cross-validated with the training cases (consisting of 432 proteins) and were tested on an independent dataset (consisting of 142 proteins). The residue-based Matthews correlation coefficient for the independent test set was 0.423; the accuracy, precision, sensitivity, specificity were 0.753, 0.519, 0.677, and 0.779 respectively. The benchmark results indicate that the optimized machine learning models are among the best predictors in identifying PPI sites on protein surfaces. In particular, the PPI site prediction accuracy increases with increasing size of the PPI site and with increasing hydrophobicity in amino acid composition of the PPI interface; the core interface regions are more likely to be recognized with high prediction confidence. The results indicate that the physicochemical complementarity patterns on protein surfaces are important determinants in PPIs, and a substantial portion of the PPI sites can be predicted correctly with the physicochemical complementarity features based on the non-covalent interaction data derived from protein interiors.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Prediction of FMN-binding residues with three-dimensional probability distributions of interacting atoms on protein surfaces
    Mahalingam, Rajasekaran
    Peng, Hung-Pin
    Yang, An-Suei
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2014, 343 : 154 - 161
  • [2] Prediction of fatty acid-binding residues on protein surfaces with three-dimensional probability distributions of interacting atoms
    Mahalingam, Rajasekaran
    Peng, Hung-Pin
    Yang, An-Suei
    [J]. BIOPHYSICAL CHEMISTRY, 2014, 192 : 10 - 19
  • [3] CRF-based models of protein surfaces improve protein-protein interaction site predictions
    Zhijie Dong
    Keyu Wang
    Truong Khanh Linh Dang
    Mehmet Gültas
    Marlon Welter
    Torsten Wierschin
    Mario Stanke
    Stephan Waack
    [J]. BMC Bioinformatics, 15
  • [4] CRF-based models of protein surfaces improve protein-protein interaction site predictions
    Dong, Zhijie
    Wang, Keyu
    Truong Khanh Linh Dang
    Gueltas, Mehmet
    Welter, Marlon
    Wierschin, Torsten
    Stanke, Mario
    Waack, Stephan
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [5] SpatialPPI: Three-dimensional space protein-protein interaction prediction with AlphaFold Multimer
    Hu, Wenxing
    Ohue, Masahito
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 1214 - 1225
  • [6] Predicting Ligand Binding Sites on Protein Surfaces by 3-Dimensional Probability Density Distributions of Interacting Atoms
    Jian, Jhih-Wei
    Elumalai, Pavadai
    Pitti, Thejkiran
    Wu, Chih Yuan
    Tsai, Keng-Chang
    Chang, Jeng-Yih
    Peng, Hung-Pin
    Yang, An-Suei
    [J]. PLOS ONE, 2016, 11 (08):
  • [7] Prediction of Carbohydrate Binding Sites on Protein Surfaces with 3-Dimensional Probability Density Distributions of Interacting Atoms
    Tsai, Keng-Chang
    Jian, Jhih-Wei
    Yang, Ei-Wen
    Hsu, Po-Chiang
    Peng, Hung-Pin
    Chen, Ching-Tai
    Chen, Jun-Bo
    Chang, Jeng-Yih
    Hsu, Wen-Lian
    Yang, An-Suei
    [J]. PLOS ONE, 2012, 7 (07):
  • [8] YAPPIS-Finder: A novel method for protein-protein interaction site predictions
    Kumar, Vicky
    Sood, Ashita
    Munshi, Anjana
    Gautam, Tarkeshwar
    Kulharia, Mahesh
    [J]. BIOINFORMATION, 2022, 18 (07) : 604 - 612
  • [9] Protein-protein interaction site predictions with minimum covariance determinant and Mahalanobis distance
    Qiu, Zhijun
    Zhou, Bo
    Yuan, Jiangfeng
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2017, 433 : 57 - 63
  • [10] Assessing predictions of protein-protein interaction: The CAPRI experiment
    Janin, J
    [J]. PROTEIN SCIENCE, 2005, 14 (02) : 278 - 283