Advanced exact structure searching in large databases of chemical compounds

被引:19
|
作者
Trepalin, SV [1 ]
Skorenko, AV [1 ]
Balakin, KV [1 ]
Nasonov, AF [1 ]
Lang, SA [1 ]
Ivashchenko, AA [1 ]
Savchuk, NP [1 ]
机构
[1] Chem Divers Labs Inc, San Diego, CA 92121 USA
关键词
D O I
10.1021/ci025582d
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Efficient recognition of tautomeric compound forms in large corporate or commercially available compound databases is a difficult and labor intensive task. Our data indicate that up to 0.5% of commercially available compound collections for bioscreening contain tautomers. Though in the large registry databases, such as Beilstein and CAS, the tautomers are found in an automated fashion using high-performance computational technologies, their real-time recognition in the nonregistry corporate databases, as a rule, remains problematic. We have developed an effective algorithm for tautomer searching based on the proprietary chemoinformatics platform. This algorithm reduces the compound to a canonical structure. This feature enables rapid, automated computer searching of most of the known tautomeric transformations that occur in databases of organic compounds. Another useful extension of this methodology is related to the ability to effectively search for different forms of compounds that contain ionic and semipolar bonds. The computations are performed in the Windows environment on a standard personal computer, a very useful feature. The practical application of the proposed methodology is illustrated by several examples of successful recovery of tautomers and different forms of ionic compounds from real commercially available nonregistry databases.
引用
收藏
页码:852 / 860
页数:9
相关论文
共 50 条
  • [31] Computer-aided discovery of compounds with combined mechanism of pharmacological action in large chemical databases
    Lagunin, AA
    Gomazkov, OA
    Filimonov, DA
    Solovyeva, NI
    Poroikov, VV
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2003, 225 : U560 - U560
  • [32] Variable selection for linear regression in large databases: exact methods
    Pacheco, Joaquin
    Casado, Silvia
    [J]. APPLIED INTELLIGENCE, 2021, 51 (06) : 3736 - 3756
  • [33] Variable selection for linear regression in large databases: exact methods
    Joaquín Pacheco
    Silvia Casado
    [J]. Applied Intelligence, 2021, 51 : 3736 - 3756
  • [34] SEARCHING PROTEIN-STRUCTURE DATABASES HAS COME OF AGE
    HOLM, L
    SANDER, C
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 19 (03): : 165 - 173
  • [35] Chemical structures and reactions in CAS databases - Searching for prior art.
    Toussant, MJ
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2002, 224 : U348 - U348
  • [36] Searching protein structure databases with DaliLite v.3
    Holm, L.
    Kaariainen, S.
    Rosenstrom, P.
    Schenkel, A.
    [J]. BIOINFORMATICS, 2008, 24 (23) : 2780 - 2781
  • [37] SEARCHING FOR INFORMATION ON CHEMICAL-SUBSTANCES IN SELECTED BIOMEDICAL BIBLIOGRAPHIC DATABASES
    LUDL, H
    SCHOPE, K
    MANGELSDORF, I
    [J]. CHEMOSPHERE, 1995, 31 (02) : 2611 - 2628
  • [38] Sorted Dominant Local Color for Searching Large and Heterogeneous Image Databases
    Vidal, Marcio Luiz A.
    Cavalcanti, Joao M. B.
    de Moura, Edleno Silva
    da Silva, Altigran Soares
    Torres, Ricardo da Silva
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1960 - 1963
  • [39] Searching for pharmacophoric patterns in databases of three-dimensional chemical structures
    Willett, P
    [J]. JOURNAL OF MOLECULAR RECOGNITION, 1995, 8 (05) : 290 - 303
  • [40] A framework for the evaluation of chemical structure databases
    Cooke, F
    Schofield, H
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (05): : 1131 - 1140