Constrained Bayesian optimization for automatic chemical design using variational autoencoders

被引:187
|
作者
Griffiths, Ryan-Rhys [1 ]
Hernandez-Lobato, Jose Miguel [2 ,3 ,4 ]
机构
[1] Univ Cambridge, Dept Phys, Cavendish Lab, Cambridge, England
[2] Univ Cambridge, Dept Engn, Cambridge, England
[3] Alan Turing Inst, London, England
[4] Microsoft Res, Cambridge, England
关键词
ORGANIC PHOTOVOLTAICS; SCREENING LIBRARIES; DRUG DISCOVERY; CHEMISTRY; MODEL;
D O I
10.1039/c9sc04026a
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Automatic Chemical Design is a framework for generating novel molecules with optimized properties. The original scheme, featuring Bayesian optimization over the latent space of a variational autoencoder, suffers from the pathology that it tends to produce invalid molecular structures. First, we demonstrate empirically that this pathology arises when the Bayesian optimization scheme queries latent space points far away from the data on which the variational autoencoder has been trained. Secondly, by reformulating the search procedure as a constrained Bayesian optimization problem, we show that the effects of this pathology can be mitigated, yielding marked improvements in the validity of the generated molecules. We posit that constrained Bayesian optimization is a good approach for solving this kind of training set mismatch in many generative tasks involving Bayesian optimization over the latent space of a variational autoencoder.
引用
收藏
页码:577 / 586
页数:10
相关论文
共 50 条
  • [1] Constrained Graph Variational Autoencoders for Molecule Design
    Liu, Qi
    Allamanis, Miltiadis
    Brockschmidt, Marc
    Gaunt, Alexander L.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [2] Conditional Constrained Graph Variational Autoencoders for Molecule Design
    Rigoni, Davide
    Navarin, Nicola
    Sperduti, Alessandro
    [J]. 2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 729 - 736
  • [3] Constrained Bayesian Optimization for Automatic Underwater Vehicle Hull Design
    Vardhan, Harsh
    Volgyesi, Peter
    Hedgecock, Will
    Sztipanovits, Janos
    [J]. 2023 CYBER-PHYSICAL SYSTEMS AND INTERNET-OF-THINGS WEEK, CPS-IOT WEEK WORKSHOPS, 2023, : 116 - 121
  • [4] A Bayesian Nonlinear Reduced Order Modeling Using Variational AutoEncoders
    Akkari, Nissrine
    Casenave, Fabien
    Hachem, Elie
    Ryckelynck, David
    [J]. FLUIDS, 2022, 7 (10)
  • [5] 3D SHAPE SYNTHESIS FOR CONCEPTUAL DESIGN AND OPTIMIZATION USING VARIATIONAL AUTOENCODERS
    Zhang, Wentai
    Yang, Zhangsihao
    Jiang, Haoliang
    Nigam, Suyash
    Yamakawa, Soji
    Furuhata, Tomotake
    Shimada, Kenji
    Kara, Levent Burak
    [J]. PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 2A, 2020,
  • [6] Optimizing training trajectories in variational autoencoders via latent Bayesian optimization approach *
    Biswas, Arpan
    Vasudevan, Rama
    Ziatdinov, Maxim
    Kalinin, Sergei, V
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (01):
  • [7] Statistical Guarantees for Variational Autoencoders using PAC-Bayesian Theory
    Mbacke, Sokhna Diarra
    Clerc, Florence
    Germain, Pascal
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders
    Stanton, Samuel
    Maddox, Wesley J.
    Gruver, Nate
    Maffettone, Phillip
    Delaney, Emily
    Greenside, Peyton
    Wilson, Andrew Gordon
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [9] Variational Bayesian Methods for Stochastically Constrained System Design Problems
    Jaiswal, Prateek
    Honnappa, Harsh
    Rao, Vinayak A.
    [J]. SYMPOSIUM ON ADVANCES IN APPROXIMATE BAYESIAN INFERENCE, VOL 118, 2019, 118
  • [10] Design of metalloproteins and novel protein folds using variational autoencoders
    Joe G. Greener
    Lewis Moffat
    David T Jones
    [J]. Scientific Reports, 8