Variational Monte Carlo on a Budget - Fine-tuning pre-trained NeuralWavefunctions

被引：0

作者：

Scherbela, Michael ^{[1
]}

Gerard, Leon ^{[1
]}

Grohs, Philipp ^{[1
]}

机构：

[1] Univ Vienna, Vienna, Austria

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

奥地利科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Obtaining accurate solutions to the Schrodinger equation is the key challenge in computational quantum chemistry. Deep-learning-based Variational Monte Carlo (DL-VMC) has recently outperformed conventional approaches in terms of accuracy, but only at large computational cost. Whereas in many domains models are trained once and subsequently applied for inference, accurate DL-VMC so far requires a full optimization for every new problem instance, consuming thousands of GPUhs even for small molecules. We instead propose a DL-VMC model which has been pre-trained using self-supervised wavefunction optimization on a large and chemically diverse set of molecules. Applying this model to new molecules without any optimization, yields wavefunctions and absolute energies that outperform established methods such as CCSD(T)-2Z. To obtain accurate relative energies, only few fine-tuning steps of this base model are required. We accomplish this with a fully end-to-end machine-learned model, consisting of an improved geometry embedding architecture and an existing SE(3)-equivariant model to represent molecular orbitals. Combining this architecture with continuous sampling of geometries, we improve zero-shot accuracy by two orders of magnitude compared to the state of the art. We extensively evaluate the accuracy, scalability and limitations of our base model on a wide variety of test systems.

引用

页数：19

共 50 条

[1] Pruning Pre-trained Language ModelsWithout Fine-Tuning
Jiang, Ting
Wang, Deqing
Zhuang, Fuzhen
Xie, Ruobing
Xia, Feng
[J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 594 - 605
[2] Span Fine-tuning for Pre-trained Language Models
Bao, Rongzhou
Zhang, Zhuosheng
Zhao, Hai
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1970 - 1979
[3] Overcoming Catastrophic Forgetting for Fine-Tuning Pre-trained GANs
Zhang, Zeren
Li, Xingjian
Hong, Tao
Wang, Tianyang
Ma, Jinwen
Xiong, Haoyi
Xu, Cheng-Zhong
[J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT V, 2023, 14173 : 293 - 308
[4] Waste Classification by Fine-Tuning Pre-trained CNN and GAN
Alsabei, Amani
Alsayed, Ashwaq
Alzahrani, Manar
Al-Shareef, Sarah
[J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (08): : 65 - 70
[5] Debiasing Pre-Trained Language Models via Efficient Fine-Tuning
Gira, Michael
Zhang, Ruisu
Lee, Kangwook
[J]. PROCEEDINGS OF THE SECOND WORKSHOP ON LANGUAGE TECHNOLOGY FOR EQUALITY, DIVERSITY AND INCLUSION (LTEDI 2022), 2022, : 59 - 69
[6] Fine-Tuning Pre-Trained CodeBERT for Code Search in Smart Contract
JIN Huan
LI Qinying
[J]. Wuhan University Journal of Natural Sciences, 2023, 28 (03) : 237 - 245
[7] Exploiting Syntactic Information to Boost the Fine-tuning of Pre-trained Models
Liu, Chaoming
Zhu, Wenhao
Zhang, Xiaoyu
Zhai, Qiuhong
[J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 575 - 582
[8] Improving Fine-tuning Pre-trained Models on Small Source Code Datasets via Variational Information Bottleneck
Liu, Jiaxing
Sha, Chaofeng
Peng, Xin
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING, SANER, 2023, : 331 - 342
[9] Pathologies of Pre-trained Language Models in Few-shot Fine-tuning
Chen, Hanjie
Zheng, Guoqing
Awadallah, Ahmed Hassan
Ji, Yangfeng
[J]. PROCEEDINGS OF THE THIRD WORKSHOP ON INSIGHTS FROM NEGATIVE RESULTS IN NLP (INSIGHTS 2022), 2022, : 144 - 153
[10] Fine-tuning the hyperparameters of pre-trained models for solving multiclass classification problems
Kaibassova, D.
Nurtay, M.
Tau, A.
Kissina, M.
[J]. COMPUTER OPTICS, 2022, 46 (06) : 971 - 979

← 1 2 3 4 5 →