Machine Learning for Chemical Reactivity: The Importance of Failed Experiments

被引:82
|
作者
Strieth-Kalthoff, Felix [1 ]
Sandfort, Frederik [1 ]
Kuhnemund, Marius [2 ]
Schaefer, Felix R. [1 ]
Kuchen, Herbert [2 ]
Glorius, Frank [1 ]
机构
[1] Westfalische Wilhelms Univ Munster, Organ Chem Inst, Corrensstr 40, D-48149 Munster, Germany
[2] Westfalische Wilhelms Univ Munster, Dept Informat Syst, Leonardo Campus 3, D-48149 Munster, Germany
关键词
Cross-Coupling; Data Bias; Machine Learning; Reaction Data; Yield Prediction; NEURAL-NETWORKS; PREDICTION; BIAS;
D O I
10.1002/anie.202204647
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Assessing the outcomes of chemical reactions in a quantitative fashion has been a cornerstone across all synthetic disciplines. Classically approached through empirical optimization, data-driven modelling bears an enormous potential to streamline this process. However, such predictive models require significant quantities of high-quality data, the availability of which is limited: Main reasons for this include experimental errors and, importantly, human biases regarding experiment selection and result reporting. In a series of case studies, we investigate the impact of these biases for drawing general conclusions from chemical reaction data, revealing the utmost importance of "negative" examples. Eventually, case studies into data expansion approaches showcase directions to circumvent these limitations-and demonstrate perspectives towards a long-term data quality enhancement in chemistry.
引用
下载
收藏
页数:7
相关论文
共 50 条
  • [31] Organic reactivity from mechanism to machine learning
    Kjell Jorner
    Anna Tomberg
    Christoph Bauer
    Christian Sköld
    Per-Ola Norrby
    Nature Reviews Chemistry, 2021, 5 : 240 - 255
  • [32] Tunability: Importance of Hyperparameters of Machine Learning Algorithms
    Probst, Philipp
    Boulesteix, Anne-Laure
    Bischl, Bernd
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [33] An Interoperable Service for the Provenance of Machine Learning Experiments
    Duarte, Julio Cesar
    Reis Cavalcanti, Maria Claudia
    Costa, Igor de Souza
    Esteves, Diego
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 132 - 138
  • [34] Tunability: Importance of hyperparameters of machine learning algorithms
    Probst, Philipp
    Boulesteix, Anne-Laure
    Bischl, Bernd
    Journal of Machine Learning Research, 2019, 20
  • [35] Quantum Machine Learning Implementations: Proposals and Experiments
    Lamata, Lucas
    ADVANCED QUANTUM TECHNOLOGIES, 2023, 6 (07)
  • [36] Learning Machine Diagnostics Through Laboratory Experiments
    Zamorano, M.
    Gomez, M. J.
    Castejon, C.
    Garcia-Prada, J. C.
    NEW TRENDS IN EDUCATIONAL ACTIVITY IN THE FIELD OF MECHANISM AND MACHINE THEORY 2014-2017, 2019, 64 : 57 - 63
  • [37] Flambe: A Customizable Framework for Machine Learning Experiments
    Wohlwend, Jeremy
    Matthews, Nicholas
    Itzcovich, Ivan
    PROCEEDINGS OF THE 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, (ACL 2019), 2019, : 181 - 188
  • [38] Leveraging Business Transformation with Machine Learning Experiments
    Mattos, David Issa
    Bosch, Jan
    Olsson, Helena Holmstrom
    SOFTWARE BUSINESS (ICSOB 2019), 2019, 370 : 183 - 191
  • [39] Preliminary Experiments on the Performance of Machine Learning Models
    Banda, Misheck
    Ngassam, Ernest Ketcha
    Mnkandla, Ernest
    2022 IST-AFRICA CONFERENCE, 2022,
  • [40] Experiments on Code Clone Detection and Machine Learning
    Schaefer, Andre
    Amme, Wolfram
    Heinze, Thomas S.
    2022 IEEE 16TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC 2022), 2022, : 46 - 52