Machine Learning for Chemical Reactivity: The Importance of Failed Experiments

被引:82
|
作者
Strieth-Kalthoff, Felix [1 ]
Sandfort, Frederik [1 ]
Kuhnemund, Marius [2 ]
Schaefer, Felix R. [1 ]
Kuchen, Herbert [2 ]
Glorius, Frank [1 ]
机构
[1] Westfalische Wilhelms Univ Munster, Organ Chem Inst, Corrensstr 40, D-48149 Munster, Germany
[2] Westfalische Wilhelms Univ Munster, Dept Informat Syst, Leonardo Campus 3, D-48149 Munster, Germany
关键词
Cross-Coupling; Data Bias; Machine Learning; Reaction Data; Yield Prediction; NEURAL-NETWORKS; PREDICTION; BIAS;
D O I
10.1002/anie.202204647
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Assessing the outcomes of chemical reactions in a quantitative fashion has been a cornerstone across all synthetic disciplines. Classically approached through empirical optimization, data-driven modelling bears an enormous potential to streamline this process. However, such predictive models require significant quantities of high-quality data, the availability of which is limited: Main reasons for this include experimental errors and, importantly, human biases regarding experiment selection and result reporting. In a series of case studies, we investigate the impact of these biases for drawing general conclusions from chemical reaction data, revealing the utmost importance of "negative" examples. Eventually, case studies into data expansion approaches showcase directions to circumvent these limitations-and demonstrate perspectives towards a long-term data quality enhancement in chemistry.
引用
下载
收藏
页数:7
相关论文
共 50 条
  • [21] The Prevalence of Errors in Machine Learning Experiments
    Shepperd, Martin
    Guo, Yuchen
    Li, Ning
    Arzoky, Mahir
    Capiluppi, Andrea
    Counsell, Steve
    Destefanis, Giuseppe
    Swift, Stephen
    Tucker, Allan
    Yousefi, Leila
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2019, PT I, 2019, 11871 : 102 - 109
  • [22] A review on machine learning for neutrino experiments
    Psihas, Fernanda
    Groh, Micah
    Tunnell, Christopher
    Warburton, Karl
    INTERNATIONAL JOURNAL OF MODERN PHYSICS A, 2020, 35 (33):
  • [23] The Importance of Generalizability in Machine Learning for Systems
    Gohil, Varun
    Dev, Sundar
    Upasani, Gaurang
    Lo, David
    Ranganathan, Parthasarathy
    Delimitrou, Christina
    IEEE COMPUTER ARCHITECTURE LETTERS, 2024, 23 (01) : 95 - 98
  • [24] Quantum Mechanics and Machine Learning Synergies: Graph Attention Neural Networks to Predict Chemical Reactivity
    Tavakoli, Mohammadamin
    Mood, Aaron
    Van Vranken, David
    Baldi, Pierre
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (09) : 2121 - 2132
  • [25] Machine learning for chemical discovery
    Alexandre Tkatchenko
    Nature Communications, 11
  • [26] Machine learning for chemical processes
    Aviso, Kathleen
    Zhang, Dongda
    Cameron, David
    Xuan, Jin
    DIGITAL CHEMICAL ENGINEERING, 2022, 5
  • [27] Machine Learning for Chemical Reactions
    Meuwly, Markus
    CHEMICAL REVIEWS, 2021, 121 (16) : 10218 - 10239
  • [28] Machine Learning in Chemical Dynamics
    Biswas, Rupayan
    Rashmi, Richa
    Lourderaj, Upakarasamy
    RESONANCE-JOURNAL OF SCIENCE EDUCATION, 2020, 25 (01): : 59 - 75
  • [29] Machine learning for chemical discovery
    Tkatchenko, Alexandre
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [30] Machine Learning in Chemical Dynamics
    Rupayan Biswas
    Richa Rashmi
    Upakarasamy Lourderaj
    Resonance, 2020, 25 : 59 - 75