Machine Learning for Chemical Reactivity: The Importance of Failed Experiments

被引:82
|
作者
Strieth-Kalthoff, Felix [1 ]
Sandfort, Frederik [1 ]
Kuhnemund, Marius [2 ]
Schaefer, Felix R. [1 ]
Kuchen, Herbert [2 ]
Glorius, Frank [1 ]
机构
[1] Westfalische Wilhelms Univ Munster, Organ Chem Inst, Corrensstr 40, D-48149 Munster, Germany
[2] Westfalische Wilhelms Univ Munster, Dept Informat Syst, Leonardo Campus 3, D-48149 Munster, Germany
关键词
Cross-Coupling; Data Bias; Machine Learning; Reaction Data; Yield Prediction; NEURAL-NETWORKS; PREDICTION; BIAS;
D O I
10.1002/anie.202204647
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Assessing the outcomes of chemical reactions in a quantitative fashion has been a cornerstone across all synthetic disciplines. Classically approached through empirical optimization, data-driven modelling bears an enormous potential to streamline this process. However, such predictive models require significant quantities of high-quality data, the availability of which is limited: Main reasons for this include experimental errors and, importantly, human biases regarding experiment selection and result reporting. In a series of case studies, we investigate the impact of these biases for drawing general conclusions from chemical reaction data, revealing the utmost importance of "negative" examples. Eventually, case studies into data expansion approaches showcase directions to circumvent these limitations-and demonstrate perspectives towards a long-term data quality enhancement in chemistry.
引用
下载
收藏
页数:7
相关论文
共 50 条
  • [1] Machine-learning-assisted materials discovery using failed experiments
    Paul Raccuglia
    Katherine C. Elbert
    Philip D. F. Adler
    Casey Falk
    Malia B. Wenny
    Aurelio Mollo
    Matthias Zeller
    Sorelle A. Friedler
    Joshua Schrier
    Alexander J. Norquist
    Nature, 2016, 533 : 73 - 76
  • [2] Machine-learning-assisted materials discovery using failed experiments
    Raccuglia, Paul
    Elbert, Katherine C.
    Adler, Philip D. F.
    Falk, Casey
    Wenny, Malia B.
    Mollo, Aurelio
    Zeller, Matthias
    Friedler, Sorelle A.
    Schrier, Joshua
    Norquist, Alexander J.
    NATURE, 2016, 533 (7601) : 73 - +
  • [3] Putting Chemical Knowledge to Work in Machine Learning for Reactivity
    Jorner, Kjell
    CHIMIA, 2023, 77 (1-2) : 22 - 30
  • [4] Machine learning of chemical reactivity from databases of organic reactions
    Carrera, Goncalo V. S. M.
    Gupta, Sunil
    Aires-de-Sousa, Joao
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2009, 23 (07) : 419 - 429
  • [5] Machine learning of chemical reactivity from databases of organic reactions
    Gonçalo V. S. M. Carrera
    Sunil Gupta
    João Aires-de-Sousa
    Journal of Computer-Aided Molecular Design, 2009, 23 : 419 - 429
  • [6] Dark reactions project: Machine learning-assisted materials discovery using failed experiments
    Schrier, Joshua
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [7] Dark Reactions Project: Machine learning-assisted materials discovery using failed experiments
    Schrier, Joshua
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 252
  • [8] Machine learning and design of experiments with an application to product innovation in the chemical industry
    Arboretti, Rosa
    Ceccato, Riccardo
    Pegoraro, Luca
    Salmaso, Luigi
    Housmekerides, Chris
    Spadoni, Luca
    Pierangelo, Elisabetta
    Quaggia, Sara
    Tveit, Catherine
    Vianello, Sebastiano
    JOURNAL OF APPLIED STATISTICS, 2022, 49 (10) : 2674 - 2699
  • [9] Predicting the chemical reactivity of organic materials using a machine-learning approach
    Lee, Byungju
    Yoo, Jaekyun
    Kang, Kisuk
    CHEMICAL SCIENCE, 2020, 11 (30) : 7813 - 7822
  • [10] The Importance of Reaction Energy in Predicting Chemical Reaction Barriers with Machine Learning Models
    Lalith, Nithin
    Singh, Aayush R.
    Gauthier, Joseph A.
    CHEMPHYSCHEM, 2024, 25 (13)