Interpretable, extensible linear and symbolic regression models for charge density prediction using a hierarchy of many-body correlation descriptors

被引:0
|
作者
Iyer, Gopal R. [1 ,2 ]
Kumar, Shashikant [1 ,3 ]
Borda, Edgar Josue Landinez [1 ]
Sadigh, Babak [1 ]
Hamel, Sebastien [1 ]
Bulatov, Vasily [4 ]
Lordi, Vincenzo [4 ]
Samanta, Amit [1 ]
机构
[1] Lawrence Livermore Natl Lab, Phys Div, Livermore, CA 94550 USA
[2] Brown Univ, Providence, RI 02906 USA
[3] Georgia Inst Technol, Coll Engn, Atlanta, GA 30332 USA
[4] Lawrence Livermore Natl Lab, Mat Sci Div, Livermore, CA 94550 USA
关键词
Electron density prediction; Many body expansion; Symbolic regression; TOTAL-ENERGY CALCULATIONS; ELECTRON-DENSITIES;
D O I
10.1016/j.commatsci.2024.113433
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Density functional theory (DFT) is routinely used to make electronic structure predictions for high-throughput screening of materials and molecules for technologically relevant areas, like the identification of better catalysts, electronic materials, and drug discovery. However, the DFT formalism is limited by (a) its poor (quadratic-to-quartic) scaling, and (b) the need to perform repeated eigenvalue computations of the electronic Hamiltonian as part of its self-consistent field (SCF) iteration procedure to obtain the converged ground state electron density, p ( r ). Approaches that directly predict p ( r ) of a structure with high accuracy can accelerate conventional SCF calculations and can also be used in linearly scaling methods such as orbital-free DFT. To this end, we present a procedure to predict the ground state electron density of molecular and periodic threedimensional systems directly from the atomic structure with a particular emphasis on physical interpretability. In our framework, p (r) r ) is modeled using many-body correlation descriptors that accurately capture the effects of local atomic arrangements in the neighborhood of a grid point. Our use of a linear regression scheme to fit to charge density data enables transparent analysis of the relative contributions of various types of local atomic correlations. By systematically including increasingly complex correlations, our model is shown to accurately predict p ( r ) for a variety of chemically and electronically diverse systems - amorphous Ge, Al(001) slab, crystalline Ga2O3, 2 O 3 , molecular benzene, and polyethylene. We then demonstrate a symbolic regression-based protocol to construct easily computable, interpretable features from lower-order correlations that significantly improves our electron density predictions with effectively no increase in the computational cost.
引用
收藏
页数:17
相关论文
empty
未找到相关数据