Analyzing machine learning models to accelerate generation of fundamental materials insights

被引:65
|
作者
Umehara, Mitsutaro [1 ,2 ]
Stein, Helge S. [1 ]
Guevarra, Dan [1 ]
Newhouse, Paul F. [1 ]
Boyd, David A. [1 ]
Gregoire, John M. [1 ]
机构
[1] CALTECH, Joint Ctr Artificial Photosynth, Pasadena, CA 91125 USA
[2] Toyota Res Inst North Amer, Future Mobil Res Dept, Ann Arbor, MI 48105 USA
关键词
DEEP NEURAL-NETWORKS; DISCOVERY; IDENTIFICATION; BEHAVIOR;
D O I
10.1038/s41524-019-0172-5
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Machine learning for materials science envisions the acceleration of basic science research through automated identification of key data relationships to augment human interpretation and gain scientific understanding. A primary role of scientists is extraction of fundamental knowledge from data, and we demonstrate that this extraction can be accelerated using neural networks via analysis of the trained data model itself rather than its application as a prediction tool. Convolutional neural networks excel at modeling complex data relationships in multi-dimensional parameter spaces, such as that mapped by a combinatorial materials science experiment. Measuring a performance metric in a given materials space provides direct information about (locally) optimal materials but not the underlying materials science that gives rise to the variation in performance. By building a model that predicts performance (in this case photoelectrochemical power generation of a solar fuels photoanode) from materials parameters (in this case composition and Raman signal), subsequent analysis of gradients in the trained model reveals key data relationships that are not readily identified by human inspection or traditional statistical analyses. Human interpretation of these key relationships produces the desired fundamental understanding, demonstrating a framework in which machine learning accelerates data interpretation by leveraging the expertize of the human scientist. We also demonstrate the use of neural network gradient analysis to automate prediction of the directions in parameter space, such as the addition of specific alloying elements, that may increase performance by moving beyond the confines of existing data.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Analyzing machine learning models to accelerate generation of fundamental materials insights
    Mitsutaro Umehara
    Helge S. Stein
    Dan Guevarra
    Paul F. Newhouse
    David A. Boyd
    John M. Gregoire
    npj Computational Materials, 5
  • [2] Applying machine learning to accelerate new materials development
    Wu Wei
    Sun Qiang
    SCIENTIA SINICA-PHYSICA MECHANICA & ASTRONOMICA, 2018, 48 (10)
  • [3] Structural Embedding Methods for Machine Learning Models Accelerate Research on Stacked 2D Materials
    Chen, Xinyu
    Ru, Guoliang
    Qi, Weihong
    JOURNAL OF PHYSICAL CHEMISTRY C, 2024, 128 (37): : 15512 - 15521
  • [4] Machine learning models for predictive materials science from fundamental physics: An application to titanium and zirconium
    Nitol, Mashroor S.
    Dickel, Doyl E.
    Barrett, Christopher D.
    ACTA MATERIALIA, 2022, 224
  • [5] Predicting and analyzing stability in perovskite solar cells: Insights from machine learning models and SHAP analysis
    Chen, Jiacheng
    Zhan, Yaohui
    Yang, Zhenhai
    Zang, Yue
    Yan, Wensheng
    Li, Xiaofeng
    MATERIALS TODAY ENERGY, 2025, 48
  • [6] Machine Learning Based Approaches to Accelerate Energy Materials Discovery and Optimization
    Krishnamurthy, Dilip
    Weiland, Hasso
    Farimani, Amir Barati
    Antono, Erin
    Green, Josh
    Viswanathan, Venkatasubramanian
    ACS ENERGY LETTERS, 2019, 4 (01) : 187 - 191
  • [7] Learning and Analyzing Generation Order for Undirected Sequence Models
    Jiang, Yichen
    Bansal, Mohit
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3513 - 3523
  • [8] Machine learning applied to accelerate energy consumption models in computing simulators
    Castane, Gabriel G.
    Mateos, Alejandro Calderon
    SIMULATION MODELLING PRACTICE AND THEORY, 2020, 102 (102)
  • [9] Dual-event machine learning models to accelerate drug discovery
    Ekins, Sean
    Reynolds, Robert C.
    Kim, Hiyun
    Koo, Mi-Sun
    Ekonomidis, Marilyn
    Talaue, Meliza
    Paget, Steve
    Woolhiser, Lisa
    Lenaerts, Anne J.
    Bunin, Barry A.
    Connell, Nancy
    Freundlich, Joel S.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 245
  • [10] Generation of human computational models with machine learning
    Campuzano, Francisco
    Garcia-Valverde, Teresa
    Botia, Juan A.
    Serrano, Emilio
    INFORMATION SCIENCES, 2015, 293 : 97 - 114