The Poisson Index: a new probabilistic model for proteinligand binding site similarity

被引:15
|
作者
Davies, J. R.
Jackson, R. M. [1 ]
Mardia, K. V.
Taylor, C. C.
机构
[1] Univ Leeds, Sch Math, Leeds LS2 9JT, W Yorkshire, England
[2] Univ Leeds, Inst Mol & Cellular Biol, Leeds LS2 9JT, W Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1093/bioinformatics/btm470
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The large-scale comparison of proteinligand binding sites is problematic, in that measures of structural similarity are difficult to quantify and are not easily understood in terms of statistical similarity that can ultimately be related to structure and function. We present a binding site matching score the Poisson Index (PI) based upon a well-defined statistical model. PI requires only the number of matching atoms between two sites and the size of the two sitesthe same information used by the Tanimoto Index (TI), a comparable and widely used measure for molecular similarity. We apply PI and TI to a previously automatically extracted set of binding sites to determine the robustness and usefulness of both scores. Results: We found that PI outperforms TI; moreover, site similarity is poorly defined for TI at values around the 99.5 confidence level for which PI is well defined. A difference map at this confidence level shows that PI gives much more meaningful information than TI. We show individual examples where TI fails to distinguish either a false or a true site paring in contrast to PI, which performs much better. TI cannot handle large or small sites very well, or the comparison of large and small sites, in contrast to PI that is shown to be much more robust. Despite the difficulty of determining a biological ground truth for binding site similarity we conclude that PI is a suitable measure of binding site similarity and could form the basis for a binding site classification scheme comparable to existing protein domain classification schema. Availability: Pl is implemented in SitesBase www.modelling.leeds.ac.uk/sb/ Contact: r.m.jackson@leeds.ac.uk.
引用
收藏
页码:3001 / 3008
页数:8
相关论文
共 50 条
  • [1] PROBABILISTIC SIMILARITY INDEX
    GOODALL, DW
    NATURE, 1964, 203 (494) : 1098 - &
  • [2] The probabilistic basis of Jaccard's index of similarity
    Real, R
    Vargas, JM
    SYSTEMATIC BIOLOGY, 1996, 45 (03) : 380 - 385
  • [3] Jaccard index based similarity measure to compare transcription factor binding site models
    Ilya E Vorontsov
    Ivan V Kulakovskiy
    Vsevolod J Makeev
    Algorithms for Molecular Biology, 8
  • [4] Jaccard index based similarity measure to compare transcription factor binding site models
    Vorontsov, Ilya E.
    Kulakovskiy, Ivan V.
    Makeev, Vsevolod J.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2013, 8
  • [5] An introductory classroom exercise on protein molecular model visualization and detailed analysis of proteinligand binding
    Poeylaut-Palena Andres, A.
    de los Angeles Laborde, Maria
    BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION, 2013, 41 (02) : 118 - 124
  • [6] Duality of protein binding site similarity and cognate ligand similarity
    Jain, Ajay
    Cleves, Ann
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [7] Binding site characterization - similarity, promiscuity, and druggability
    Ehrt, Christiane
    Brinkjost, Tobias
    Koch, Oliver
    MEDCHEMCOMM, 2019, 10 (07) : 1145 - 1159
  • [8] GPGPU enhanced binding site similarity determination
    Repasky, Matt
    Babin, Volodymyr
    Shelley, John
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [9] Ligand Binding Site Similarity Identification Based on Chemical and Geometric Similarity
    Tu, Haibo
    Shi, Tieliu
    PROTEIN JOURNAL, 2013, 32 (05): : 373 - 385
  • [10] Ligand Binding Site Similarity Identification Based on Chemical and Geometric Similarity
    Haibo Tu
    Tieliu Shi
    The Protein Journal, 2013, 32 : 373 - 385