Development of a Fingerprint Reduction Approach for Bayesian Similarity Searching Based on Kullback-Leibler Divergence Analysis

被引:23
|
作者
Nisius, Britta [1 ]
Vogt, Martin [1 ]
Bajorath, Juergen [1 ]
机构
[1] Rhein Freidrich Wilhelms Univ Bonn, Dept Life Sci Informat, B IT, LIMES Program Unit Chem Biol & Med Chem, D-53113 Bonn, Germany
关键词
DIMENSIONAL DESCRIPTOR SPACES; ACTIVE COMPOUNDS; PERFORMANCE; MOLECULES; DATABASE; FUSION; 2D;
D O I
10.1021/ci900087y
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The contribution of individual fingerprint bit positions to similarity search performance is systematically evaluated. A method is introduced to determine bit significance on the basis of Kullback-Leibler divergence analysis of bit distributions in active and database compounds. Bit divergence analysis and Bayesian compound screening share a common methodological foundation. Hence, given the significance ranking of all individual bit positions comprising a fingerprint, subsets of bits are evaluated in the context of Bayesian screening, and minimal fingerprint representations are determined that meet or exceed the search performance of unmodified fingerprints. For fingerprints of different design evaluated on many compound activity classes, we consistently find that subsets of fingerprint bit positions are responsible for search performance. In part, these subsets are very small and contain in some cases only a few fingerprint bit positions. Structural or pharmacophore patterns captured by preferred bit positions can often be directly associated with characteristic features of active compounds. In some cases, reduced fingerprint representations clearly exceed the search performance of the original fingerprints. Thus, fingerprint reduction likely represents a promising approach for practical applications.
引用
收藏
页码:1347 / 1358
页数:12
相关论文
共 50 条
  • [31] Kullback-Leibler Divergence (KLD) Based Anomaly Detection and Monotonic Sequence Analysis
    Anderson, Alan
    Haas, Harald
    2011 IEEE VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2011,
  • [32] Texture similarity measure using Kullback-Leibler divergence between gamma distributions
    Mathiassen, JR
    Skavhaug, A
    Bo, K
    COMPUTER VISION - ECCV 2002 PT III, 2002, 2352 : 133 - 147
  • [33] Biological Data Outlier Detection Based on Kullback-Leibler Divergence
    Oh, Jung Hun
    Gao, Jean
    Rosenblatt, Kevin
    2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2008, : 249 - +
  • [34] HIF detection in distribution networks based on Kullback-Leibler divergence
    Nezamzadeh-Ejieh, Shiva
    Sadeghkhani, Iman
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2020, 14 (01) : 29 - 36
  • [35] A Robust Cooperative Spectrum Sensing Based on Kullback-Leibler Divergence
    Hiep Vu-Van
    Koo, Insoo
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2012, E95B (04) : 1286 - 1290
  • [36] Comparing Score-Based Methods for Estimating Bayesian Networks Using the Kullback-Leibler Divergence
    Kasza, Jessica
    Solomon, Patty
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2015, 44 (01) : 135 - 152
  • [37] A Kullback-Leibler Divergence Variant of the Bayesian Cram?r-Rao Bound
    Fauss, Michael
    Dytso, Alex
    Poor, H. Vincent
    SIGNAL PROCESSING, 2023, 207
  • [38] Deterministic sampling based on Kullback-Leibler divergence and its applications
    Wang, Sumin
    Sun, Fasheng
    STATISTICAL PAPERS, 2024, 65 (03) : 1411 - 1436
  • [39] Constrained ensemble Kalman filter based on Kullback-Leibler divergence
    Li, Ruoxia
    Jan, Nabil Magbool
    Huang, Biao
    Prasad, Vinay
    JOURNAL OF PROCESS CONTROL, 2019, 81 : 150 - 161
  • [40] Entropy and the Kullback-Leibler Divergence for Bayesian Networks: Computational Complexity and Efficient Implementation
    Scutari, Marco
    ALGORITHMS, 2024, 17 (01)