Machine learning meets mechanistic modelling for accurate prediction of experimental activation energies

被引:132
|
作者
Jorner, Kjell [1 ]
Brinck, Tore [2 ]
Norrby, Per-Ola [3 ]
Buttar, David [1 ]
机构
[1] AstraZeneca, Early Chem Dev, Pharmaceut Sci, R&D, Macclesfield, Cheshire, England
[2] KTH Royal Inst Technol, Dept Chem, Appl Phys Chem, CBH, Stockholm, Sweden
[3] AstraZeneca, Data Sci & Modelling, Pharmaceut Sci, R&D, Gothenburg, Sweden
基金
瑞典研究理事会;
关键词
NUCLEOPHILIC-SUBSTITUTION; ELECTROSTATIC POTENTIALS; REACTIVITY; REGIOSELECTIVITY; CLASSIFICATION; EFFICIENT;
D O I
10.1039/d0sc04896h
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Accurate prediction of chemical reactions in solution is challenging for current state-of-the-art approaches based on transition state modelling with density functional theory. Models based on machine learning have emerged as a promising alternative to address these problems, but these models currently lack the precision to give crucial information on the magnitude of barrier heights, influence of solvents and catalysts and extent of regio- and chemoselectivity. Here, we construct hybrid models which combine the traditional transition state modelling and machine learning to accurately predict reaction barriers. We train a Gaussian Process Regression model to reproduce high-quality experimental kinetic data for the nucleophilic aromatic substitution reaction and use it to predict barriers with a mean absolute error of 0.77 kcal mol(-1) for an external test set. The model was further validated on regio- and chemoselectivity prediction on patent reaction data and achieved a competitive top-1 accuracy of 86%, despite not being trained explicitly for this task. Importantly, the model gives error bars for its predictions that can be used for risk assessment by the end user. Hybrid models emerge as the preferred alternative for accurate reaction prediction in the very common low-data situation where only 100-150 rate constants are available for a reaction class. With recent advances in deep learning for quickly predicting barriers and transition state geometries from density functional theory, we envision that hybrid models will soon become a standard alternative to complement current machine learning approaches based on ground-state physical organic descriptors or structural information such as molecular graphs or fingerprints.
引用
收藏
页码:1163 / 1175
页数:13
相关论文
共 50 条
  • [21] Interactive-quantum-chemical-descriptors enabling accurate prediction of an activation energy through machine learning
    Mikami, Koichiro
    POLYMER, 2020, 203
  • [22] Machine learning prediction of accurate atomization energies of organic molecules from low-fidelity quantum chemical calculations
    Ward, Logan
    Blaiszik, Ben
    Foster, Ian
    Assary, Rajeev S.
    Narayanan, Badri
    Curtiss, Larry
    MRS COMMUNICATIONS, 2019, 9 (03) : 891 - 899
  • [23] Machine learning prediction of accurate atomization energies of organic molecules from low-fidelity quantum chemical calculations
    Logan Ward
    Ben Blaiszik
    Ian Foster
    Rajeev S. Assary
    Badri Narayanan
    Larry Curtiss
    MRS Communications, 2019, 9 : 891 - 899
  • [24] Machine Learning Prediction of H Adsorption Energies on Ag Alloys
    Hoyt, Robert A.
    Montemore, Matthew M.
    Fampiou, Ioanna
    Chen, Wei
    Tritsaris, Georgios
    Kaxiras, Efthimios
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (04) : 1357 - 1365
  • [25] Explainable artificial intelligence for machine learning prediction of bandgap energies
    Masuda, Taichi
    Tanabe, Katsuaki
    JOURNAL OF APPLIED PHYSICS, 2024, 136 (17)
  • [26] Machine learning prediction of interaction energies in rigid water clusters
    Bose, Samik
    Dhawan, Diksha
    Nandi, Sutanu
    Sarkar, Ram Rup
    Ghosh, Debashree
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2018, 20 (35) : 22987 - 22996
  • [27] Beyond the AJR: Radiomics Meets Machine Learning to Improve Outcome Prediction
    De Cecco, Carlo N.
    Monti, Caterina B.
    AMERICAN JOURNAL OF ROENTGENOLOGY, 2022, 219 (05) : 844 - 844
  • [29] A machine learning approach to the accurate prediction of monitor units for a compact proton machine
    Sun, Baozhou
    Lam, Dao
    Yang, Deshan
    Grantham, Kevin
    Zhang, Tiezhi
    Mutic, Sasa
    Zhao, Tianyu
    MEDICAL PHYSICS, 2018, 45 (05) : 2243 - 2251
  • [30] Comment on "Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning" Reply
    Rupp, Matthias
    Tkatchenko, Alexandre
    Mueller, Klaus-Robert
    von Lilienfeld, O. Anatole
    PHYSICAL REVIEW LETTERS, 2012, 109 (05)