The Synthesizability of Molecules Proposed by Generative Models

被引:157
|
作者
Gao, Wenhao [1 ,2 ]
Coley, Connor W. [1 ,3 ]
机构
[1] MIT, Dept Chem Engn, Cambridge, MA 02139 USA
[2] Johns Hopkins Univ, Dept Chem & Biomol Engn, Baltimore, MD 21218 USA
[3] Broad Inst Harvard & MIT, Cambridge, MA 02142 USA
关键词
SYNTHETIC ACCESSIBILITY; DRUG; DESIGN; COMPLEXITY; DISCOVERY; CHEMISTS; ZINC; TOOL;
D O I
10.1021/acs.jcim.0c00174
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The discovery of functional molecules is an expensive and time-consuming process, exemplified by the rising costs of small molecule therapeutic discovery. One class of techniques of growing interest for early stage drug discovery is de novo molecular generation and optimization, catalyzed by the development of new deep learning approaches. These techniques can suggest novel molecular structures intended to maximize a multiobjective function, e.g., suitability as a therapeutic against a particular target, without relying on brute-force exploration of a chemical space. However, the utility of these approaches is stymied by ignorance of synthesizability. To highlight the severity of this issue, we use a data-driven computer-aided synthesis planning program to quantify how often molecules proposed by state-of-the-art generative models cannot be readily synthesized. Our analysis demonstrates that there are several tasks for which these models generate unrealistic molecular structures despite performing well on popular quantitative benchmarks. Synthetic complexity heuristics can successfully bias generation toward synthetically tractable chemical space, although doing so necessarily detracts from the primary objective. This analysis suggests that to improve the utility of these models in real discovery workflows, new algorithm development is warranted.
引用
收藏
页码:5714 / 5723
页数:10
相关论文
共 50 条
  • [1] Probabilistic generative transformer language models for generative design of molecules
    Wei, Lai
    Fu, Nihang
    Song, Yuqi
    Wang, Qian
    Hu, Jianjun
    [J]. JOURNAL OF CHEMINFORMATICS, 2023, 15 (01)
  • [2] Probabilistic generative transformer language models for generative design of molecules
    Lai Wei
    Nihang Fu
    Yuqi Song
    Qian Wang
    Jianjun Hu
    [J]. Journal of Cheminformatics, 15
  • [3] Computational Discovery of TTF Molecules with Deep Generative Models
    Yakubovich, Alexander
    Odinokov, Alexey
    Nikolenko, Sergey
    Jung, Yongsik
    Choi, Hyeonho
    [J]. FRONTIERS IN CHEMISTRY, 2021, 9
  • [4] Frechet ChemNet Distance: A Metric for Generative Models for Molecules in Drug Discovery
    Preuer, Kristina
    Renz, Philipp
    Unterthiner, Thomas
    Hochreiter, Sepp
    Klambauer, Guenter
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2018, 58 (09) : 1736 - 1741
  • [5] Target-specific novel molecules with their recipe: Incorporating synthesizability in the design process
    Krishnan, Sowmya Ramaswamy
    Bung, Navneet
    Srinivasan, Rajgopal
    Roy, Arijit
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2024, 129
  • [6] Can androids dream of electric molecules? Generative models for chemistry and molecular properties
    Aspuru-Guzik, Alan
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [7] Generative Models
    Sim-Hui Tee
    [J]. Erkenntnis, 2023, 88 : 23 - 41
  • [8] Generative Models
    Tee, Sim-Hui
    [J]. ERKENNTNIS, 2023, 88 (01) : 23 - 41
  • [9] Generative Models Should at Least Be Able to Design Molecules That Dock Well: A New Benchmark
    Cieplinski, Tobiasz
    Danel, Tomasz
    Podlewska, Sabina
    Jastrzebski, Stanislaw
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (11) : 3238 - 3247
  • [10] Dependability analysis using a fault injection tool based on synthesizability of HDL models
    Zarandi, HR
    Miremadi, SG
    Ejlali, A
    [J]. 18TH IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI SYSTEMS, PROCEEDINGS, 2003, : 485 - 492