Large-Scale Glass-Transition Temperature Prediction with an Equivariant Neural Network for Screening Polymers

被引:0
|
作者
Long, Zheng [1 ]
Lu, Hongmei [1 ]
Zhang, Zhimin [1 ]
机构
[1] Cent South Univ, Coll Chem & Chem Engn, Changsha 410083, Peoples R China
来源
ACS OMEGA | 2024年 / 9卷 / 05期
基金
中国国家自然科学基金;
关键词
EXAMPLE; DESIGN; QSPR;
D O I
10.1021/acsomega.3c06843
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The practically infinite chemical and morphological space of polymers makes them pervasive with applications in materials science but challenges the rational discovery of new materials with favorable properties. Polymer informatics aims to accelerate materials design through property prediction and large-scale virtual screening. In this study, a new method (Lieconv-Tg) has been developed to predict glass-transition temperature (Tg) values from repeating units of polymers based on Lieconv, which is equivariant with transformations from any specified Lie group. The introduction of equivariance allows the prediction of molecular properties from their 3D structures, independent of orientation and position. A total of 27,659 homopolymers with Tg values were collected from PolyInfo, and a standard data set containing 7166 polymers (named data set_Tg) was created for training a robust Lieconv-Tg model. Using the 3D coordinates as input, Lieconv-Tg performs better than Edge-Conditioned Convolution (ECC), and the mean absolute error (MAE) is significantly reduced by similar to 6 from similar to 30 to similar to 24 on both the validation set and the test set, and the R-2 value for both the validation set and the test set can reach 0.90. Lieconv-Tg is thus used to screen promising candidates from a benchmark database named PI1M with 995,800 generated polymers. However, there are some implausible repeating units in PI1M. To get more reasonable candidates from PI1M, a new filtering method has been accomplished by utilizing Morgan fingerprints at the polymerization points (MF@PP) of repeating units in data set_Tg. The combination of a standard data set, Lieconv-Tg, and a more reasonable screening strategy provides new directions in materials design for polymers.
引用
收藏
页码:5452 / 5462
页数:11
相关论文
共 50 条