Accurate prediction of chemical reactions in solution is challenging for current state-of-the-art approaches based on transition state modelling with density functional theory. Models based on machine learning have emerged as a promising alternative to address these problems, but these models currently lack the precision to give crucial information on the magnitude of barrier heights, influence of solvents and catalysts and extent of regio- and chemoselectivity. Here, we construct hybrid models which combine the traditional transition state modelling and machine learning to accurately predict reaction barriers. We train a Gaussian Process Regression model to reproduce high-quality experimental kinetic data for the nucleophilic aromatic substitution reaction and use it to predict barriers with a mean absolute error of 0.77 kcal mol(-1) for an external test set. The model was further validated on regio- and chemoselectivity prediction on patent reaction data and achieved a competitive top-1 accuracy of 86%, despite not being trained explicitly for this task. Importantly, the model gives error bars for its predictions that can be used for risk assessment by the end user. Hybrid models emerge as the preferred alternative for accurate reaction prediction in the very common low-data situation where only 100-150 rate constants are available for a reaction class. With recent advances in deep learning for quickly predicting barriers and transition state geometries from density functional theory, we envision that hybrid models will soon become a standard alternative to complement current machine learning approaches based on ground-state physical organic descriptors or structural information such as molecular graphs or fingerprints.
机构:
Argonne Natl Lab, Data Sci & Learning Div, Lemont, IL 60439 USA
Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USAArgonne Natl Lab, Data Sci & Learning Div, Lemont, IL 60439 USA
Ward, Logan
论文数: 引用数:
h-index:
机构:
Blaiszik, Ben
论文数: 引用数:
h-index:
机构:
Foster, Ian
Assary, Rajeev S.
论文数: 0引用数: 0
h-index: 0
机构:
Argonne Natl Lab, JCESR, Lemont, IL USA
Argonne Natl Lab, Mat Sci Div, Lemont, IL USAArgonne Natl Lab, Data Sci & Learning Div, Lemont, IL 60439 USA
Assary, Rajeev S.
Narayanan, Badri
论文数: 0引用数: 0
h-index: 0
机构:
Argonne Natl Lab, Mat Sci Div, Lemont, IL USA
Univ Louisville, Dept Mech Engn, Louisville, KY 40292 USAArgonne Natl Lab, Data Sci & Learning Div, Lemont, IL 60439 USA
Narayanan, Badri
Curtiss, Larry
论文数: 0引用数: 0
h-index: 0
机构:
Argonne Natl Lab, JCESR, Lemont, IL USA
Argonne Natl Lab, Mat Sci Div, Lemont, IL USAArgonne Natl Lab, Data Sci & Learning Div, Lemont, IL 60439 USA
机构:
Harvard Univ, Dept Phys, Cambridge, MA 02138 USAHarvard Univ, Dept Phys, Cambridge, MA 02138 USA
Hoyt, Robert A.
Montemore, Matthew M.
论文数: 0引用数: 0
h-index: 0
机构:
Harvard Univ, Dept Chem & Chem Biol, Cambridge, MA 02138 USA
Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USAHarvard Univ, Dept Phys, Cambridge, MA 02138 USA
Montemore, Matthew M.
Fampiou, Ioanna
论文数: 0引用数: 0
h-index: 0
机构:
Harvard Univ, Dept Chem & Chem Biol, Cambridge, MA 02138 USAHarvard Univ, Dept Phys, Cambridge, MA 02138 USA
Fampiou, Ioanna
Chen, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Harvard Univ, Dept Phys, Cambridge, MA 02138 USA
Harvard Univ, Dept Chem & Chem Biol, Cambridge, MA 02138 USAHarvard Univ, Dept Phys, Cambridge, MA 02138 USA
Chen, Wei
论文数: 引用数:
h-index:
机构:
Tritsaris, Georgios
Kaxiras, Efthimios
论文数: 0引用数: 0
h-index: 0
机构:
Harvard Univ, Dept Phys, Cambridge, MA 02138 USA
Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USAHarvard Univ, Dept Phys, Cambridge, MA 02138 USA
机构:
Indian Assoc Cultivat Sci, Sch Math & Computat Sci, Kolkata 700032, W Bengal, IndiaIndian Assoc Cultivat Sci, Sch Math & Computat Sci, Kolkata 700032, W Bengal, India
Bose, Samik
Dhawan, Diksha
论文数: 0引用数: 0
h-index: 0
机构:
Indian Assoc Cultivat Sci, Sch Math & Computat Sci, Kolkata 700032, W Bengal, IndiaIndian Assoc Cultivat Sci, Sch Math & Computat Sci, Kolkata 700032, W Bengal, India
Dhawan, Diksha
论文数: 引用数:
h-index:
机构:
Nandi, Sutanu
Sarkar, Ram Rup
论文数: 0引用数: 0
h-index: 0
机构:
CSIR Natl Chem Lab, CEPD, Pune 411008, Maharashtra, India
Acad Sci & Innovat Res AcSIR, CSIR NCL Campus, Pune, Maharashtra, IndiaIndian Assoc Cultivat Sci, Sch Math & Computat Sci, Kolkata 700032, W Bengal, India
Sarkar, Ram Rup
Ghosh, Debashree
论文数: 0引用数: 0
h-index: 0
机构:
Indian Assoc Cultivat Sci, Sch Math & Computat Sci, Kolkata 700032, W Bengal, IndiaIndian Assoc Cultivat Sci, Sch Math & Computat Sci, Kolkata 700032, W Bengal, India