Prediction of protein-protein binding affinity using diverse protein-protein interface features

被引:7
|
作者
Ma, Duo [1 ]
Guo, Yanzhi [1 ]
Luo, Jiesi [1 ]
Pu, Xuemei [1 ]
Li, Menglong [1 ]
机构
[1] Sichuan Univ, Coll Chem, Chengdu 610064, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein-protein interaction; Binding; Affinity prediction; Random forest; Feature importance evaluation; FREE-ENERGY CALCULATIONS; EVOLUTIONARY CONSERVATION; COMPUTATIONAL DESIGN; MEAN FORCE; HOT-SPOTS; LIGAND; FLEXIBILITY; TRANSIENT;
D O I
10.1016/j.chemolab.2014.07.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Protein-protein interactions play fundamental roles in almost all biological processes. Determining the protein-protein binding affinity has been recognized not only as an important step but also as a challenging task for further understanding of the molecular mechanism and the modeling of the biological systems. Unlike the traditional methods like empirical scoring algorithms and molecular dynamic which are time consuming, we developed a fast and reliable machine learning method for the prediction of protein-protein binding affinity. Based on diverse protein-protein interface features calculated using commonly used available tools, 432 features were obtained to represent hydrogen bond, Van der Waals force, hydrophobic interaction, electrostatic force, interface shape and configuration and allosteric effect. Considering the limited number of the available structures and affinity-known protein complexes, in order to avoid overfitting and remove noises in the feature set, feature importance evaluation was implemented and 154 optimal features were selected, then the prediction model based on random forest (RF) was constructed. We demonstrate that the RE model yields promising results and the predictive power of our method is better than other existing methods. Using leave-one-out cross-validation, our model gives a correlation coefficient (r) of 0.708 on the whole benchmark dataset of 133 complexes and a high r of 0.806 on the validated set of 53 samples. When performing the same two independent datasets, our method outperforms other two methods and achieves a high r of 0.793 and 0.907 respectively. All results indicate that our method can be a useful implement in determining protein-protein binding affinity. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:7 / 13
页数:7
相关论文
共 50 条
  • [2] Prediction of protein-protein binding affinity using diverse protein-protein interface features
    Ma, Duo
    Guo, Yanzhi
    Luo, Jiesi
    Pu, Xuemei
    Li, Menglong
    Chemometrics and Intelligent Laboratory Systems, 2014, 138 : 7 - 13
  • [3] Protein-protein binding affinity prediction on a diverse set of structures
    Moal, Iain H.
    Agius, Rudi
    Bates, Paul A.
    BIOINFORMATICS, 2011, 27 (21) : 3002 - 3009
  • [4] Prediction of structure and binding affinity of protein-protein complexes
    Bonvin, Alexandre M. J. J.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 245
  • [5] A functional feature analysis on diverse protein-protein interactions: application for the prediction of binding affinity
    Luo, Jiesi
    Guo, Yanzhi
    Zhong, Yun
    Ma, Duo
    Li, Wenling
    Li, Menglong
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2014, 28 (06) : 619 - 629
  • [6] Protein docking prediction using predicted protein-protein interface
    Bin Li
    Daisuke Kihara
    BMC Bioinformatics, 13
  • [7] Protein docking prediction using predicted protein-protein interface
    Li, Bin
    Kihara, Daisuke
    BMC BIOINFORMATICS, 2012, 13
  • [8] BeAtMuSiC: prediction of changes in protein-protein binding affinity on mutations
    Dehouck, Yves
    Kwasigroch, Jean Marc
    Rooman, Marianne
    Gilis, Dimitri
    NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) : W333 - W339
  • [9] Machine learning methods for protein-protein binding affinity prediction in protein design
    Guo, Zhongliang
    Yamaguchi, Rui
    FRONTIERS IN BIOINFORMATICS, 2022, 2
  • [10] Discrimination and Prediction of Protein-Protein Binding Affinity Using Deep Learning Approach
    Nikam, Rahul
    Yugandhar, K.
    Gromiha, M. Michael
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II, 2018, 10955 : 809 - 815