Using interpretable machine learning techniques to analyze the thermal decomposition behavior of high-energy compounds for dataset partitioning

被引:0
|
作者
Huang, Chaoyi [1 ]
Yang, Chunming [1 ]
He, Yingjie [1 ]
机构
[1] Southwest Univ Sci & Technol, Mianyang, Sichuan, Peoples R China
关键词
Feature Engineering; D-MPNN; XGBoost; Dataset Partitioning; Information Theory;
D O I
10.1145/3670105.3670131
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In recent years, machine learning has shown great potential in the field of molecular research. However, for energy-containing molecules with small datasets, the difficulty of model prediction is high, and improving model prediction performance is a challenge. This paper aims to analyze the thermal decomposition behavior of energetic compounds using interpretable machine learning techniques, build a dataset containing nearly 900 energetic compound data, and employ interpretable machine learning methods to construct a set of molecular descriptors. After reasonable partitioning of the dataset, directed message passing neural network (D-MPNN) is used for training and performance evaluation. The method in this paper can improve model training effectiveness, with a decrease of approximately 15% in MAE and RMSE, and an increase of about 8% in R-2. This has important value for model training on small-scale datasets.
引用
收藏
页码:155 / 160
页数:6
相关论文
共 23 条