On the design of optimal computer experiments to model solvent effects on reaction kinetics

被引:1
|
作者
Gui, Lingfeng [1 ,2 ]
Armstrong, Alan [3 ,4 ]
Galindo, Amparo [1 ,2 ]
Sayyed, Fareed Bhasha [5 ]
Kolis, Stanley P. [6 ]
Adjiman, Claire S. [1 ,2 ]
机构
[1] Imperial Coll London, Sargent Ctr Proc Syst Engn, Dept Chem Engn, London SW7 2AZ, England
[2] Imperial Coll London, Inst Mol Sci & Engn, London SW7 2AZ, England
[3] Imperial Coll London, Dept Chem, White City Campus, London W12 0BZ, England
[4] Imperial Coll London, Inst Mol Sci & Engn, Mol Sci Res Hub, White City Campus, London W12 0BZ, England
[5] Eli Lilly Serv India Pvt Ltd, Synthet Mol Design & Dev, Bengaluru 560103, India
[6] Eli Lilly & Co, Lilly Corp Ctr, Synthet Mol Design & Dev, Indianapolis, IN 46285 USA
来源
基金
英国工程与自然科学研究理事会;
关键词
SOLVATION ENERGY RELATIONSHIPS; SOLVATOCHROMIC PARAMETERS; CONSTANTS; SELECTION;
D O I
10.1039/d4me00074a
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Developing an accurate predictive model of solvent effects on reaction kinetics is a challenging task, yet it can play an important role in process development. While first-principles or machine learning models are often compute- or data-intensive, simple surrogate models, such as multivariate linear or quadratic regression models, are useful when computational resources and data are scarce. The judicious choice of a small set of training data, i.e., a set of solvents in which quantum mechanical (QM) calculations of liquid-phase rate constants are to be performed, is critical to obtaining a reliable model. This is, however, made especially challenging by the highly irregular shape of the discrete space of possible experiments (solvent choices). In this work, we demonstrate that when choosing a set of computer experiments to generate training data, the D-optimality criterion value of the chosen set correlates well with the likelihood of achieving good model performance. With the Menshutkin reaction of pyridine and phenacyl bromide as a case study, this finding is further verified via the evaluation of the surrogate models regressed using D-optimal solvent sets generated from four distinct selection spaces. We also find that incorporating quadratic terms in the surrogate model and choosing the D-optimal solvent set from a selection space similar to the test set can significantly improve the accuracy of reaction rate constant predictions while using a small training dataset. Our approach holds promise for the use of statistical optimality criteria for other types of computer experiments, supporting the construction of surrogate models with reduced resource and data requirements. Model-based design of experiments using the D-optimality criterion can help select computer experiments to generate more information-rich training sets and leads to more reliable surrogate models that can be used for efficient molecular design.
引用
收藏
页码:1254 / 1274
页数:21
相关论文
共 50 条
  • [31] Optimal design of computer experiments for metamodel generation using I-OPT™
    Crary, SB
    Cousseau, P
    Armstrong, D
    Woodcock, DM
    Mok, EH
    Dubochet, O
    Lerch, P
    Renaud, P
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2000, 1 (01): : 127 - 139
  • [32] Computer-aided molecular design of solvents for accelerated reaction kinetics
    Struebing, Heiko
    Ganase, Zara
    Karamertzanis, Panagiotis G.
    Siougkrou, Eirini
    Haycock, Peter
    Piccione, Patrick M.
    Armstrong, Alan
    Galindo, Amparo
    Adjiman, Claire S.
    NATURE CHEMISTRY, 2013, 5 (11) : 952 - 957
  • [33] Computer-aided molecular design of solvents for accelerated reaction kinetics
    Heiko Struebing
    Zara Ganase
    Panagiotis G. Karamertzanis
    Eirini Siougkrou
    Peter Haycock
    Patrick M. Piccione
    Alan Armstrong
    Amparo Galindo
    Claire S. Adjiman
    Nature Chemistry, 2013, 5 : 952 - 957
  • [34] An in silico study of solvent effects on the Kolbe-Schmitt reaction kinetics
    Achenie, Luke E. K.
    Stanescu, Ioana
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2007, 233 : 287 - 287
  • [35] STUDY OF SOLVENT EFFECTS ON THE KINETICS OF THE CD(II)/CD(HG) REACTION
    BRISARD, GM
    LASIA, A
    JOURNAL OF ELECTROANALYTICAL CHEMISTRY, 1991, 314 (1-2) : 103 - 116
  • [36] A theoretical study of solvent effects on Kolbe-Schmitt reaction kinetics
    Stanescu, Ioana
    Achenie, Luke E. K.
    CHEMICAL ENGINEERING SCIENCE, 2006, 61 (18) : 6199 - 6212
  • [37] PRESSURE AND SOLVENT EFFECTS ON THE KINETICS OF A MENSHUTKIN REACTION IN ALIPHATIC-ALCOHOLS
    VIANA, CAN
    CALADO, ART
    PINHEIRO, LMV
    JOURNAL OF PHYSICAL ORGANIC CHEMISTRY, 1995, 8 (02) : 63 - 70
  • [38] SOLVENT EFFECTS DURING THE REACTION OF COAL MODEL COMPOUNDS
    ABRAHAM, MA
    KLEIN, MT
    ACS SYMPOSIUM SERIES, 1987, 329 : 67 - 76
  • [39] SOLVENT EFFECTS ON EQUILIBRIA VIA THE REACTION FIELD MODEL
    WIBERG, KB
    WONG, MW
    FRISCH, MJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1991, 201 : 120 - PHYS
  • [40] Universally optimal designs for computer experiments
    Xu, HQ
    STATISTICA SINICA, 1999, 9 (04) : 1083 - 1088