Statistical Potential for Modeling and Ranking of Protein-Ligand Interactions

被引:54
|
作者
Fan, Hao [1 ,2 ,3 ]
Schneidman-Duhovny, Dina [2 ,3 ]
Irwin, John J. [1 ,2 ]
Dong, Guangqiang [2 ,3 ]
Shoichet, Brian K. [1 ,2 ]
Sail, Andrej [2 ,3 ]
机构
[1] Univ Calif San Francisco, Dept Pharmaceut Chem, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Calif Inst Quantitat Biosci, San Francisco, CA 94143 USA
[3] Univ Calif San Francisco, Dept Bioengn & Therapeut Sci, Dept Pharmaceut Chem, San Francisco, CA 94143 USA
基金
美国国家卫生研究院;
关键词
STRUCTURE-DERIVED POTENTIALS; EMPIRICAL SCORING FUNCTIONS; KNOWLEDGE-BASED POTENTIALS; FREE-ENERGY CALCULATIONS; FORCE-FIELD; MEAN-FORCE; MOLECULAR RECOGNITION; BINDING-AFFINITY; AUTOMATED DOCKING; FLEXIBLE DOCKING;
D O I
10.1021/ci200377u
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Applications in structural biology and medicinal chemistry require protein-ligand scoring functions for two distinct tasks: (i) ranking different poses of a small molecule in a protein binding site and (ii) ranking different small molecules by their complementarity to a protein site. Using probability theory, we developed two atomic distance-dependent statistical scoring functions; PoseScore was optimized for recognizing native binding geometries of ligands from other poses and RankScore was optimized for distinguishing ligands from nonbinding molecules. Both scores are based on a set of 8,885 crystallographic structures of protein-ligand complexes but differ in the values of three key parameters. Factors influencing the accuracy of scoring were investigated, including the maximal atomic distance and non-native ligand geometries used for scoring, as well as the use of protein models instead of crystallographic structures for training and testing the scoring function. For the test set of 19 targets, RankScore improved the ligand enrichment (logAUC) and early enrichment (EF1) scores computed by DOCK 3.6 for 13 and 14 targets, respectively. In addition, RankScore performed better at rescoring than each of seven other scoring functions tested. Accepting both the crystal structure and decoy geometries with all-atom root-mean-square errors of up to 2 angstrom from the crystal structure as correct binding poses, PoseScore gave the best score to a correct binding pose among 100 decoys for 88% of all cases in a benchmark set containing 100 protein-ligand complexes. PoseScore accuracy is comparable to that of DrugScore(CSD) and ITScore/SE and superior to 12 other tested scoring functions. Therefore, RankScore can facilitate ligand discovery, by ranking complexes of the target with different small molecules; PoseScore can be used for protein-ligand complex structure prediction, by ranking different conformations of a given protein-ligand pair. The statistical potentials are available through the Integrative Modeling Platform (IMP) software package (http://salilab.org/imp/) and the LigScore Web server (http://salilab.org/ligscore/).
引用
收藏
页码:3078 / 3092
页数:15
相关论文
共 50 条