Uncertainty-Quantified Hybrid Machine Learning/Density Functional Theory High Throughput Screening Method for Crystals

被引:36
|
作者
Noh, Juhwan [1 ]
Gu, Geun Ho [1 ]
Kim, Sungwon [1 ]
Jung, Yousung [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Dept Chem & Biomol Engn, Daejeon 34141, South Korea
[2] Korea Adv Inst Sci & Technol KAIST, Saudi Aramco KAIST CO2 Management Ctr, Daejeon 34141, South Korea
关键词
NEURAL-NETWORKS; PREDICTION; STABILITY;
D O I
10.1021/acs.jcim.0c00003
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Computational high throughput screening (HTS) has emerged as a significant tool in material science to accelerate the discovery of new materials with target properties in recent years. However, despite many successful cases in which HTS led to the novel discovery, currently, the major bottleneck in HTS is a large computational cost of density functional theory (DFT) calculations that scale cubically with system size, limiting the chemical space that can be explored. The present work aims at addressing this computational burden of HTS by presenting a machine learning (ML) framework that can efficiently explore the chemical space. Our model is built upon an existing crystal graph convolutional neural network (CGCNN) to obtain formation energy of a crystal structure but is modified to allow uncertainty quantification for each prediction using the hyperbolic tangent activation function and dropout algorithm (CGCNN-HD). The uncertainty quantification is particularly important since typical usage of CGCNN (due to the lack of gradient implementation) does not involve structural relaxation which could cause substantial prediction errors. The proposed method is benchmarked against an existing application that identified promising photoanode material among the >7,000 hypothetical Mg-Mn-O ternary compounds using all DFT-HTS. In our approach, we perform the approximate HTS using CGCNN-HD and refine the results using full DFT for those selected (denoted as ML/DFT-HTS). The proposed hybrid model reduces the required DFT calculations by a factor of >50 compared to the previous DFT-HTS in making the same discovery of Mg2MnO4, experimentally validated new photoanode material. Further analysis demonstrates that the addition of HD components with uncertainty measures in the CGCNN-HD model increased the discoverability of promising materials relative to all DFT-HTS from 30% (CGCNN) to 68% (CGCNN-HD). The present ML/DFT-HTS with uncertainty quantification can thus be a fast alternative to DFT-HTS for efficient exploration of the vast chemical space.
引用
收藏
页码:1996 / 2003
页数:8
相关论文
共 50 条
  • [21] Machine-Learning-Driven High-Throughput Screening for High-Energy Density and Stable NASICON Cathodes
    Jeong, Jinyoung
    Kim, Juo
    Sun, Jiwon
    Min, Kyoungmin
    ACS APPLIED MATERIALS & INTERFACES, 2024, 16 (19) : 24431 - 24441
  • [22] High-throughput screening of bimetallic catalysts enabled by machine learning
    Li, Zheng
    Wang, Siwen
    Chin, Wei Shan
    Achenie, Luke E.
    Xin, Hongliang
    JOURNAL OF MATERIALS CHEMISTRY A, 2017, 5 (46) : 24131 - 24138
  • [23] Application of machine learning for high-throughput tumor marker screening
    Fu, Xingxing
    Ma, Wanting
    Zuo, Qi
    Qi, Yanfei
    Zhang, Shubiao
    Zhao, Yinan
    LIFE SCIENCES, 2024, 348
  • [24] Machine learning the derivative discontinuity of density-functional theory
    Gedeon, Johannes
    Schmidt, Jonathan
    Hodgson, Matthew J. P.
    Wetherell, Jack
    Benavides-Riveros, Carlos L.
    Marques, Miguel A. L.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (01):
  • [25] Density functional theory and material databases in the era of machine learning
    Kashyap, Arti
    APPLIED PHYSICS LETTERS, 2024, 125 (22)
  • [26] Hole Localization in Molecular Crystals from Hybrid Density Functional Theory
    Sai, Na
    Barbara, Paul F.
    Leung, Kevin
    PHYSICAL REVIEW LETTERS, 2011, 106 (22)
  • [27] Accelerated screening of functional atomic impurities in halide perovskites using high-throughput computations and machine learning
    Arun Mannodi-Kanakkithodi
    Maria K. Y. Chan
    Journal of Materials Science, 2022, 57 : 10736 - 10754
  • [28] Accelerated screening of functional atomic impurities in halide perovskites using high-throughput computations and machine learning
    Mannodi-Kanakkithodi, Arun
    Chan, Maria K. Y.
    JOURNAL OF MATERIALS SCIENCE, 2022, 57 (23) : 10736 - 10754
  • [29] Discovery of multi-functional polyimides through high-throughput screening using explainable machine learning
    Tao, Lei
    He, Jinlong
    Munyaneza, Nuwayo Eric
    Varshney, Vikas
    Chen, Wei
    Liu, Guoliang
    Li, Ying
    CHEMICAL ENGINEERING JOURNAL, 2023, 465
  • [30] High-throughput hybrid-functional DFT calculations of bandgaps and formation energies and multifidelity learning with uncertainty quantification
    Liu, Mohan
    Gopakumar, Abhijith
    Hegde, Vinay Ishwar
    He, Jiangang
    Wolverton, Chris
    PHYSICAL REVIEW MATERIALS, 2024, 8 (04):