Machine-learned metrics for predicting the likelihood of success in materials discovery

被引:0
|
作者
Yoolhee Kim
Edward Kim
Erin Antono
Bryce Meredig
Julia Ling
机构
[1] Citrine Informatics,
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Materials discovery is often compared to the challenge of finding a needle in a haystack. While much work has focused on accurately predicting the properties of candidate materials with machine learning (ML), which amounts to evaluating whether a given candidate is a piece of straw or a needle, less attention has been paid to a critical question: are we searching in the right haystack? We refer to the haystack as the design space for a particular materials discovery problem (i.e., the set of possible candidate materials to synthesize), and thus frame this question as one of design space selection. In this paper, we introduce two metrics, the predicted fraction of improved candidates (PFIC), and the cumulative maximum likelihood of improvement (CMLI), which we demonstrate can identify discovery-rich and discovery-poor design spaces, respectively. A combined classification system, composed of the CMLI and PFIC metrics, is then used to identify optimal design spaces with high precision, and thus show the potential to significantly accelerate ML-driven materials discovery.
引用
收藏
相关论文
共 50 条
  • [1] Machine-learned metrics for predicting the likelihood of success in materials discovery
    Kim, Yoolhee
    Kim, Edward
    Antono, Erin
    Meredig, Bryce
    Ling, Julia
    [J]. NPJ COMPUTATIONAL MATERIALS, 2020, 6 (01)
  • [2] INSIGHTS FROM MACHINE-LEARNED DIET SUCCESS PREDICTION
    Weber, Ingmar
    Achananuparp, Palakorn
    [J]. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2016, 2016, : 540 - 551
  • [3] Predicting trajectory behaviour via machine-learned invariant manifolds
    Krajnak, Vladimir
    Naik, Shibabrat
    Wiggins, Stephen
    [J]. CHEMICAL PHYSICS LETTERS, 2022, 789
  • [4] Machine-learned and codified synthesis parameters of oxide materials
    Edward Kim
    Kevin Huang
    Alex Tomala
    Sara Matthews
    Emma Strubell
    Adam Saunders
    Andrew McCallum
    Elsa Olivetti
    [J]. Scientific Data, 4
  • [5] Predicting subcellular localization of proteins using machine-learned classifiers
    Lu, Z
    Szafron, D
    Greiner, R
    Lu, P
    Wishart, DS
    Poulin, B
    Anvik, J
    Macdonell, C
    Eisner, R
    [J]. BIOINFORMATICS, 2004, 20 (04) : 547 - 556
  • [6] Topogivity: A Machine-Learned Chemical Rule for Discovering Topological Materials
    Ma, Andrew
    Zhang, Yang
    Christensen, Thomas
    Po, Hoi Chu
    Jing, Li
    Fu, Liang
    Soljacic, Marin
    [J]. NANO LETTERS, 2023, 23 (03) : 772 - 778
  • [7] The machine-learned radii of atoms
    Nikolaienko, Tymofii Yu.
    Bulavin, Leonid A.
    [J]. COMPUTATIONAL AND THEORETICAL CHEMISTRY, 2021, 1204
  • [8] Machine-learned multi-system surrogate models for materials prediction
    Nyshadham, Chandramouli
    Rupp, Matthias
    Bekker, Brayden
    Shapeev, Alexander, V
    Mueller, Tim
    Rosenbrock, Conrad W.
    Csanyi, Gabor
    Wingate, David W.
    Hart, Gus L. W.
    [J]. NPJ COMPUTATIONAL MATERIALS, 2019, 5 (1)
  • [9] Data Descriptor: Machine-learned and codified synthesis parameters of oxide materials
    Kim, Edward
    Huang, Kevin
    Tomala, Alex
    Matthews, Sara
    Strubell, Emma
    Saunders, Adam
    McCallum, Andrew
    Olivetti, Elsa
    [J]. SCIENTIFIC DATA, 2017, 4 : 170127
  • [10] Machine-learned multi-system surrogate models for materials prediction
    Chandramouli Nyshadham
    Matthias Rupp
    Brayden Bekker
    Alexander V. Shapeev
    Tim Mueller
    Conrad W. Rosenbrock
    Gábor Csányi
    David W. Wingate
    Gus L. W. Hart
    [J]. npj Computational Materials, 5