Analyzing machine learning models to accelerate generation of fundamental materials insights

被引:65
|
作者
Umehara, Mitsutaro [1 ,2 ]
Stein, Helge S. [1 ]
Guevarra, Dan [1 ]
Newhouse, Paul F. [1 ]
Boyd, David A. [1 ]
Gregoire, John M. [1 ]
机构
[1] CALTECH, Joint Ctr Artificial Photosynth, Pasadena, CA 91125 USA
[2] Toyota Res Inst North Amer, Future Mobil Res Dept, Ann Arbor, MI 48105 USA
关键词
DEEP NEURAL-NETWORKS; DISCOVERY; IDENTIFICATION; BEHAVIOR;
D O I
10.1038/s41524-019-0172-5
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Machine learning for materials science envisions the acceleration of basic science research through automated identification of key data relationships to augment human interpretation and gain scientific understanding. A primary role of scientists is extraction of fundamental knowledge from data, and we demonstrate that this extraction can be accelerated using neural networks via analysis of the trained data model itself rather than its application as a prediction tool. Convolutional neural networks excel at modeling complex data relationships in multi-dimensional parameter spaces, such as that mapped by a combinatorial materials science experiment. Measuring a performance metric in a given materials space provides direct information about (locally) optimal materials but not the underlying materials science that gives rise to the variation in performance. By building a model that predicts performance (in this case photoelectrochemical power generation of a solar fuels photoanode) from materials parameters (in this case composition and Raman signal), subsequent analysis of gradients in the trained model reveals key data relationships that are not readily identified by human inspection or traditional statistical analyses. Human interpretation of these key relationships produces the desired fundamental understanding, demonstrating a framework in which machine learning accelerates data interpretation by leveraging the expertize of the human scientist. We also demonstrate the use of neural network gradient analysis to automate prediction of the directions in parameter space, such as the addition of specific alloying elements, that may increase performance by moving beyond the confines of existing data.
引用
收藏
页数:9
相关论文
共 50 条
  • [11] Machine learning and statistical models for analyzing multilevel patent data
    Qi, Sunyun
    Zhang, Yu
    Gu, Hua
    Zhu, Fei
    Gao, Meiying
    Liang, Hongxiao
    Zhang, Qifeng
    Gao, Yanchao
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [12] Machine learning and statistical models for analyzing multilevel patent data
    Sunyun Qi
    Yu Zhang
    Hua Gu
    Fei Zhu
    Meiying Gao
    Hongxiao Liang
    Qifeng Zhang
    Yanchao Gao
    Scientific Reports, 13
  • [13] Machine-learning models for analyzing TSOM images of nanostructures
    Qu, Yufu
    Hao, Jialin
    Peng, Renju
    OPTICS EXPRESS, 2019, 27 (23) : 33979 - 33999
  • [14] Analyzing Effective Factors of Online Learning Performance by Interpreting Machine Learning Models
    Xiao, Wen
    Hu, Juan
    IEEE ACCESS, 2023, 11 : 132435 - 132447
  • [15] Health Equity Insights from Machine Learning Models
    Dmitry Tumin
    Journal of General Internal Medicine, 2021, 36 : 2475 - 2475
  • [16] Health Equity Insights from Machine Learning Models
    Tumin, Dmitry
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2021, 36 (08) : 2475 - 2475
  • [17] Coreset selection can accelerate quantum machine learning models with provable generalization
    Huang, Yiming
    Yuan, Xiao
    Wang, Huiyuan
    Du, Yuxuan
    PHYSICAL REVIEW APPLIED, 2024, 22 (01):
  • [18] Machine learning models to accelerate the design of polymeric long-acting injectables
    Bannigan, Pauric
    Bao, Zeqing
    Hickman, Riley J.
    Aldeghi, Matteo
    Hase, Florian
    Aspuru-Guzik, Alan
    Allen, Christine
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [19] Machine learning models to accelerate the design of polymeric long-acting injectables
    Pauric Bannigan
    Zeqing Bao
    Riley J. Hickman
    Matteo Aldeghi
    Florian Häse
    Alán Aspuru-Guzik
    Christine Allen
    Nature Communications, 14 (1)
  • [20] A Machine-Learning-Assisted Crystalline Structure Prediction Framework To Accelerate Materials Discovery
    An, Ran
    Xie, Congwei
    Chu, Dongdong
    Li, Fuming
    Pan, Shilie
    Yang, Zhihua
    ACS APPLIED MATERIALS & INTERFACES, 2024, 16 (28) : 36658 - 36666