Accelerating CALYPSO structure prediction with machine learning

被引:0
|
作者
Wei X.-H. [1 ]
Zhou C.-B. [1 ]
Shen X.-X. [1 ]
Liu Y.-Y. [1 ]
Tong Q.-C. [2 ]
机构
[1] College of Computer Science & Technology, Jilin University, Changchun
[2] State Key Lab of Superhard Materials, Jilin University, Changchun
关键词
Computer application; Confidence; Energy calculation; Root mean square error; Structure prediction;
D O I
10.13229/j.cnki.jdxbgxb20191070
中图分类号
学科分类号
摘要
The potential of accelerating CALYPSO structure prediction by replacing DFT methods with machine learning was studied. The performance in predicting the potential energy of boron clusters was evaluated with five machine learning methods. Firstly, the original data was represented as structural information metrics with Coulomb matrix. Then the eigenvalue vector pair of the matrix was extracted and used as the input of machine learning algorithms to training model, five algorithms were trained and tested using the same dataset. Also, factors affecting the performance were explored. Finally, a method of comparing the similarity of predicted values and ground truth was proposed based on the characters of potential energy surface (PES), and a confidence model was constructed to validate the best kernel ridge regression (KRR) method. It is suggested that PES fitted by KRR is similar with the PES by DFT, and the confidence of the algorithm is closed to 90% while permissible error is 1 kcal/mol. The result of time test to KRR shows that the method's time complexity is O(n), which is improved by 1 to 2 orders of magnitude compared with DFT methods. © 2021, Jilin University Press. All right reserved.
引用
收藏
页码:667 / 676
页数:9
相关论文
共 24 条
  • [1] Hansen K, Montavon G, Biegler F, Et al., Assessment and validation of machine learning methods for predicting molecular atomization energies, Journal of Chemical Theory and Computation, 9, 8, pp. 3404-3419, (2013)
  • [2] Doye J P, Wales D J., Structural consequences of the range of the interatomic potential a menagerie of clusters, Journal of the Chemical Society, Faraday Transactions, 93, 24, pp. 4233-4243, (1997)
  • [3] Salamat A, Garbarino G, Dewaele A, Et al., Dense close-packed phase of tin above 157 GPa observed experimentally via angle-dispersive x-ray diffraction, Physical Review B, 84, 14, (2011)
  • [4] Wang Y C, Lv J, Zhu L, Et al., Crystal structure prediction via particle-swarm optimization, Physical Review B, 82, 9, (2010)
  • [5] Wang Y C, Lv J, Zhu L, Et al., CALYPSO: A method for crystal structure prediction, Computer Physics Communications, 183, 10, pp. 2063-2070, (2012)
  • [6] Lv J, Wang Y C, Zhu L, Et al., Particle-swarm structure prediction on clusters, The Journal of Chemical Physics, 137, 8, (2012)
  • [7] Tadmor E B, Miller R E., Modeling Materials: Continuum, Atomistic and Multiscale Techniques, (2011)
  • [8] Zhao Dong, Zang Xue-bai, Zhao Hong-wei, Random forest prediction method based on optimization of fruit fly, Journal of Jilin University (Engineering and Technology Edition), 47, 2, pp. 609-614, (2017)
  • [9] Tong Q C, Xue L T, Lv J, Et al., Accelerating CALYPSO structure prediction by data-driven learning of potential energy surface, Faraday Discussions, 211, pp. 31-43, (2018)
  • [10] Zhang Yao-long, Zhou Xue-yao, Jiang Bin, Accelerating the construction of neural network potential energy surfaces: a fast hybrid training algorithm, Chinese Journal of Chemical Physics, 30, 6, pp. 727-734, (2017)