Training machine learning potentials for reactive systems: A Colab tutorial on basic models

被引:3
|
作者
Pan, Xiaoliang [1 ]
Snyder, Ryan [2 ]
Wang, Jia-Ning [3 ]
Lander, Chance [1 ]
Wickizer, Carly [1 ]
Van, Richard [1 ,4 ]
Chesney, Andrew [1 ]
Xue, Yuanfei [3 ]
Mao, Yuezhi [5 ]
Mei, Ye [3 ,6 ,7 ]
Pu, Jingzhi [2 ]
Shao, Yihan [1 ]
机构
[1] Univ Oklahoma, Dept Chem & Biochem, Norman, OK 73019 USA
[2] Indiana Univ Purdue Univ, Dept Chem & Chem Biol, Indianapolis, IN 46202 USA
[3] East China Normal Univ, Sch Phys & Elect Sci, State Key Lab Precis Spect, Shanghai, Peoples R China
[4] Natl Heart Lung & Blood Inst, Lab Computat Biol, NIH, Bethesda, MD USA
[5] San Diego State Univ, Dept Chem & Biochem, San Diego, CA 92182 USA
[6] NYU, NYU ECNU Ctr Computat Chem, Shanghai, Peoples R China
[7] Shanxi Univ, Collaborat Innovat Ctr Extreme Opt, Taiyuan, Shanxi, Peoples R China
基金
美国国家卫生研究院; 美国国家科学基金会; 中国国家自然科学基金;
关键词
Gaussian process regression; machine learning potential; neural network; tutorial; QUANTUM MECHANICS/MOLECULAR MECHANICS; MOLECULAR-DYNAMICS SIMULATIONS; NEURAL-NETWORK POTENTIALS; ENERGY SURFACES; DATA-EFFICIENT; ACCURATE; IMPLEMENTATION;
D O I
10.1002/jcc.27269
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In the last several years, there has been a surge in the development of machine learning potential (MLP) models for describing molecular systems. We are interested in a particular area of this field - the training of system-specific MLPs for reactive systems - with the goal of using these MLPs to accelerate free energy simulations of chemical and enzyme reactions. To help new members in our labs become familiar with the basic techniques, we have put together a self-guided Colab tutorial (), which we expect to be also useful to other young researchers in the community. Our tutorial begins with the introduction of simple feedforward neural network (FNN) and kernel-based (using Gaussian process regression, GPR) models by fitting the two-dimensional Muller-Brown potential. Subsequently, two simple descriptors are presented for extracting features of molecular systems: symmetry functions (including the ANI variant) and embedding neural networks (such as DeepPot-SE). Lastly, these features will be fed into FNN and GPR models to reproduce the energies and forces for the molecular configurations in a Claisen rearrangement reaction. A self-guided Colab tutorial about machine learning potential for reactive systems are presented in this work. The tutorial begins with the introduction of feedforward neural network and kernel-based models by fitting the two-dimensional Muller-Brown potential. Subsequently, two simple descriptors are presented for extracting features of molecular systems: symmetry functions and embedding neural networks. Lastly, these features will be used to reproduce the energies and forces for the molecular configurations in a Claisen rearrangement reaction.image
引用
收藏
页码:638 / 647
页数:10
相关论文
共 50 条
  • [41] Learning from models: high-dimensional analyses on the performance of machine learning interatomic potentials
    Liu, Yunsheng
    Mo, Yifei
    NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [42] A systematic review on machine learning models for online learning and examination systems
    Kaddoura S.
    Popescu D.E.
    Hemanth J.D.
    PeerJ Comput. Sci., 2022,
  • [43] A systematic review on machine learning models for online learning and examination systems
    Kaddoura, Sanaa
    Popescu, Daniela Elena
    Hemanth, Jude D.
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [44] Accelerating explicit solvent models of heterogeneous catalysts with machine learning interatomic potentials
    Chen, Benjamin W. J.
    Zhang, Xinglong
    Zhang, Jia
    CHEMICAL SCIENCE, 2023, 14 (31) : 8338 - 8354
  • [45] Estimation of Potentials in Lithium-Ion Batteries Using Machine Learning Models
    Li, Weihan
    Limoge, Damas W.
    Zhang, Jiawei
    Sauer, Dirk Uwe
    Annaswamy, Anuradha M.
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2022, 30 (02) : 680 - 695
  • [46] Machine learning systems as mentors inhuman learning: A user study on machine bias transmission in medical training
    Vicente, Lucia
    Matute, Helena
    Fregosi, Caterina
    Cabitza, Federico
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2025, 198
  • [47] Training the machine: Deployment of federated learning algorithms for machine learning models across electronic health record networks
    Ruffin, Jon
    Wang, Yunlong
    Mack, Christina
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 333 - 334
  • [48] Toward Reliable and Transferable Machine Learning Potentials: Uniform Training by Overcoming Sampling Bias
    Jeong, Wonseok
    Lee, Kyuhyun
    Yoo, Dongsun
    Lee, Dongheon
    Han, Seungwu
    JOURNAL OF PHYSICAL CHEMISTRY C, 2018, 122 (39): : 22790 - 22795
  • [49] Metadynamics sampling in atomic environment space for collecting training data for machine learning potentials
    Dongsun Yoo
    Jisu Jung
    Wonseok Jeong
    Seungwu Han
    npj Computational Materials, 7
  • [50] Training machine-learning potentials for crystal structure prediction using disordered structures
    Hong, Changho
    Choi, Jeong Min
    Jeong, Wonseok
    Kang, Sungwoo
    Ju, Suyeon
    Lee, Kyeongpung
    Jung, Jisu
    Youn, Yong
    Han, Seungwu
    PHYSICAL REVIEW B, 2020, 102 (22)