Efficient structure-informed featurization and property prediction of ordered, dilute, and random atomic structures

被引:0
|
作者
Krajewski, Adam M. [1 ]
Siegel, Jonathan W. [2 ]
Liu, Zi-Kui [1 ]
机构
[1] Penn State Univ, Dept Mat Sci & Engn, State Coll, PA 16802 USA
[2] Texas A&M Univ, Dept Math, College Stn, TX USA
基金
美国国家科学基金会;
关键词
Atomistic; Machine learning; Structure-informed; Featurization; Throughput; Symmetry-equivalent; Voronoi; Efficient; Tessellation; Dilute; Defect; Solid solution; !text type='Python']Python[!/text; CRYSTALLOGRAPHY OPEN DATABASE; OPEN-ACCESS COLLECTION; CRYSTAL; FRAMEWORK; LIBRARY; CALPHAD; TOOL;
D O I
10.1016/j.commatsci.2024.113495
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Structure-informed materials informatics is a rapidly evolving discipline of materials science relying on the featurization of atomic structures or configurations to construct vector, voxel, graph, graphlet, and other representations useful for machine learning prediction of properties, fingerprinting, and generative design. This work discusses how current featurizers typically perform redundant calculations and how their efficiency could be improved by considering (1) fundamentals of crystallographic (orbits) equivalency to optimize ordered structures and (2) representation-dependent equivalency to optimize dilute, doped, and defect structures with broken symmetry. It also discusses and contrasts ways of (3) approximating random solid solutions occupying arbitrary lattices under such representations. Efficiency improvements discussed in this work were implemented within pySIPFENN or python toolset for Structure-Informed Property and Feature Engineering with Neural Networks developed by authors since 2019 and shown to increase performance from 2 to 10 times for typical inputs. Throughout this work, the authors explicitly discuss how these advances can be applied to different kinds of similar tools in the community.
引用
收藏
页数:11
相关论文
共 18 条
  • [1] Extended study on atomic featurization in graph neural networks for molecular property prediction
    Wojtuch, Agnieszka
    Danel, Tomasz
    Podlewska, Sabina
    Maziarka, Lukasz
    JOURNAL OF CHEMINFORMATICS, 2023, 15 (01)
  • [2] Extended study on atomic featurization in graph neural networks for molecular property prediction
    Agnieszka Wojtuch
    Tomasz Danel
    Sabina Podlewska
    Łukasz Maziarka
    Journal of Cheminformatics, 15
  • [3] Variant effect prediction using structure-informed protein language models
    Sun, Yuanfei
    Shen, Yang
    BIOPHYSICAL JOURNAL, 2023, 122 (03) : 473A - 473A
  • [4] Extensible Structure-Informed Prediction of Formation Energy with improved accuracy and usability employing neural networks
    Krajewski, Adam M.
    Siegel, Jonathan W.
    Xu, Jinchao
    Liu, Zi-Kui
    COMPUTATIONAL MATERIALS SCIENCE, 2022, 208
  • [5] Structure-Informed Graph Learning of Networked Dependencies for Online Prediction of Power System Transient Dynamics
    Zhao, Tianqiao
    Yue, Meng
    Wang, Jianhui
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2022, 37 (06) : 4885 - 4895
  • [7] Prediction-based random replacement method: an application for efficient polynomial model building in quantitative structure activity/property relationship studies
    Bahl, Jyotsna
    Das, Shyam S.
    Koneti, Geervani
    Narayanan, Ramamurthi
    DRUG METABOLISM REVIEWS, 2016, 48 : 66 - 67
  • [8] LM-GVP: an extensible sequence and structure informed deep learning framework for protein property prediction
    Zichen Wang
    Steven A. Combs
    Ryan Brand
    Miguel Romero Calvo
    Panpan Xu
    George Price
    Nataliya Golovach
    Emmanuel O. Salawu
    Colby J. Wise
    Sri Priya Ponnapalli
    Peter M. Clark
    Scientific Reports, 12
  • [9] LM-GVP: an extensible sequence and structure informed deep learning framework for protein property prediction
    Wang, Zichen
    Combs, Steven A.
    Brand, Ryan
    Calvo, Miguel Romero
    Xu, Panpan
    Price, George
    Golovach, Nataliya
    Salawu, Emmanuel O.
    Wise, Colby J.
    Ponnapalli, Sri Priya
    Clark, Peter M.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [10] More Efficient Prediction for Ordinary Kriging to Solve a Problem in the Structure of Some Random Fields
    Saber, Mohammad Mehdi
    Aldallal, Ramy Abdelhamid
    COMPLEXITY, 2022, 2022