Advanced exact structure searching in large databases of chemical compounds

被引:19
|
作者
Trepalin, SV [1 ]
Skorenko, AV [1 ]
Balakin, KV [1 ]
Nasonov, AF [1 ]
Lang, SA [1 ]
Ivashchenko, AA [1 ]
Savchuk, NP [1 ]
机构
[1] Chem Divers Labs Inc, San Diego, CA 92121 USA
关键词
D O I
10.1021/ci025582d
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Efficient recognition of tautomeric compound forms in large corporate or commercially available compound databases is a difficult and labor intensive task. Our data indicate that up to 0.5% of commercially available compound collections for bioscreening contain tautomers. Though in the large registry databases, such as Beilstein and CAS, the tautomers are found in an automated fashion using high-performance computational technologies, their real-time recognition in the nonregistry corporate databases, as a rule, remains problematic. We have developed an effective algorithm for tautomer searching based on the proprietary chemoinformatics platform. This algorithm reduces the compound to a canonical structure. This feature enables rapid, automated computer searching of most of the known tautomeric transformations that occur in databases of organic compounds. Another useful extension of this methodology is related to the ability to effectively search for different forms of compounds that contain ionic and semipolar bonds. The computations are performed in the Windows environment on a standard personal computer, a very useful feature. The practical application of the proposed methodology is illustrated by several examples of successful recovery of tautomers and different forms of ionic compounds from real commercially available nonregistry databases.
引用
收藏
页码:852 / 860
页数:9
相关论文
共 50 条
  • [1] SEARCHING OF LARGE DATABASES OF CHEMICAL-REACTIONS
    MILNE, GWA
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1995, 210 : 65 - CINF
  • [2] Mixed text and structure searching of chemical databases.
    Delany, J
    Bradshaw, J
    Ford, M
    Lipkin, M
    Lippi, F
    Salt, D
    Sayle, R
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1999, 217 : U558 - U558
  • [3] STRUCTURE SEARCHING IN CHEMICAL DATABASES BY DIRECT LOOKUP METHODS
    CHRISTIE, BD
    LELAND, BA
    NOURSE, JG
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1993, 33 (04): : 545 - 547
  • [4] An overview of large chemical structure databases
    Stephen Heller
    [J]. Chemistry Central Journal, 2 (Suppl 1)
  • [5] Current strategies for searching through structure and chemical compound databases
    Fic, Grzegorz
    Skomra, Mariusz
    Debska, Barbara
    [J]. CHEMIK, 2016, 70 (08): : 415 - 418
  • [6] NUMERIC SEARCHING IN CHEMICAL DATABASES
    FAMINI, GR
    CROSIER, RB
    COON, PA
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1990, 199 : 36 - CINF
  • [7] Efficient maximum common subgraph (MCS) searching of large chemical databases
    Roger A Sayle
    Jose Batista
    J Andrew Grant
    [J]. Journal of Cheminformatics, 5 (Suppl 1)
  • [8] CROSSFILE SEARCHING IN COMPLEMENTARY CHEMICAL DATABASES
    WECKEND, BR
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1986, 192 : 55 - CINF
  • [9] A sequential approach for identifying lead compounds in large chemical databases
    Abt, M
    Lim, Y
    Sacks, J
    Xie, M
    Young, SS
    [J]. STATISTICAL SCIENCE, 2001, 16 (02) : 154 - 168
  • [10] CINF 102-Comparing chemical structure searching in multiple structural databases
    Walter, Donald
    Stewart, Bob
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2008, 235