Advanced exact structure searching in large databases of chemical compounds

被引:19
|
作者
Trepalin, SV [1 ]
Skorenko, AV [1 ]
Balakin, KV [1 ]
Nasonov, AF [1 ]
Lang, SA [1 ]
Ivashchenko, AA [1 ]
Savchuk, NP [1 ]
机构
[1] Chem Divers Labs Inc, San Diego, CA 92121 USA
关键词
D O I
10.1021/ci025582d
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Efficient recognition of tautomeric compound forms in large corporate or commercially available compound databases is a difficult and labor intensive task. Our data indicate that up to 0.5% of commercially available compound collections for bioscreening contain tautomers. Though in the large registry databases, such as Beilstein and CAS, the tautomers are found in an automated fashion using high-performance computational technologies, their real-time recognition in the nonregistry corporate databases, as a rule, remains problematic. We have developed an effective algorithm for tautomer searching based on the proprietary chemoinformatics platform. This algorithm reduces the compound to a canonical structure. This feature enables rapid, automated computer searching of most of the known tautomeric transformations that occur in databases of organic compounds. Another useful extension of this methodology is related to the ability to effectively search for different forms of compounds that contain ionic and semipolar bonds. The computations are performed in the Windows environment on a standard personal computer, a very useful feature. The practical application of the proposed methodology is illustrated by several examples of successful recovery of tautomers and different forms of ionic compounds from real commercially available nonregistry databases.
引用
收藏
页码:852 / 860
页数:9
相关论文
共 50 条
  • [41] RScan: fast searching structural similarities for structured RNAs in large databases
    Chenghai Xue
    Guo-Ping Liu
    [J]. BMC Genomics, 8
  • [42] RScan: fast searching structural similarities for structured RNAs in large databases
    Xue, Chenghai
    Liu, Guo-Ping
    [J]. BMC GENOMICS, 2007, 8 (1)
  • [43] STRUCTURAL MOLECULAR FORMULA FOR FLEXIBLE AND EFFICIENT SUBSTRUCTURE SEARCHING OF LARGE DATABASES
    DROMEY, RG
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1978, 18 (03): : 163 - 168
  • [44] CORRELATIVE ANALYSIS FOR EXPLORATION OF LARGE CHEMICAL DATABASES
    FISANICK, W
    LIPKUS, AH
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1994, 208 : 125 - COMP
  • [45] Experience with data handling in large chemical databases
    Langerman, Neal
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 250
  • [46] Visualizing the structure of large relational databases
    Antis, JM
    Eick, SG
    Pyrce, JD
    [J]. IEEE SOFTWARE, 1996, 13 (01) : 72 - 79
  • [47] Mining and visualizing the chemical content of large databases
    Villar, Hugo O.
    Hansen, Mark R.
    Hodges, Jason
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2007, 233
  • [48] Mining and visualizing the chemical content of large databases
    Villar, Hugo O.
    Hansen, Mark R.
    [J]. CURRENT OPINION IN DRUG DISCOVERY & DEVELOPMENT, 2009, 12 (03) : 367 - 375
  • [49] Precision structure searching for chemical entities
    Cheeseman, Elaine
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 252
  • [50] INORGANIC CHEMICAL-STRUCTURE SEARCHING
    RUSCH, PF
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1986, 192 : 29 - CNIF