A fine-tuned large language model based molecular dynamics agent for code generation to obtain material thermodynamic parameters

被引:0
|
作者
Zhuofan Shi [1 ]
Chunxiao Xin [2 ]
Tong Huo [3 ]
Yuntao Jiang [1 ]
Bowen Wu [2 ]
Xingyue Chen [3 ]
Wei Qin [2 ]
Xinjian Ma [3 ]
Gang Huang [4 ]
Zhenyu Wang [1 ]
Xiang Jing [2 ]
机构
[1] Peking University,School of Software and Microelectronics
[2] National Key Laboratory of Data Space Technology and System,Institute of Information Engineering
[3] Advanced Institute of Big Data,undefined
[4] Chinese Academy of Sciences,undefined
关键词
LLM; Agent; Materials science;
D O I
10.1038/s41598-025-92337-6
中图分类号
学科分类号
摘要
In the field of materials science, addressing the complex relationship between the material structure and properties has increasingly involved leveraging the text generation capabilities of AI-generated content (AIGC) models for tasks that include literature mining and data analysis. However, theoretical calculations and code development remain labor-intensive challenges. This paper proposes a novel approach based on text-to-code generation, utilizing large language models to automate the implementation of simulation programs in materials science. The effectiveness of automated code generation and review is validated with thermodynamics simulations based on the LAMMPS software as a foundation. This study introduces Molecular Dynamics Agent (MDAgent), a framework designed to guide large models in automatically generating, executing, and refining simulation code. In addition, a thermodynamic simulation code dataset for LAMMPS was constructed to fine-tune the language model. Expert evaluation scores demonstrate that MDAgent significantly improves the code generation and review capabilities. The proposed approach reduces the average task time by 42.22%, as compared to traditional models, thus highlighting its potential applications in the field of materials science.
引用
收藏
相关论文
共 24 条
  • [1] CentralBankRoBERTa: A fine-tuned large language model for central bank communications☆
    Pfeifer, Moritz
    Marohl, Vincent P.
    JOURNAL OF FINANCE AND DATA SCIENCE, 2023, 9
  • [2] Comparative Analysis of Generic and Fine-Tuned Large Language Models for Conversational Agent Systems
    Villa, Laura
    Carneros-Prado, David
    Dobrescu, Cosmin C.
    Sanchez-Miguel, Adrian
    Cubero, Guillermo
    Hervas, Ramon
    ROBOTICS, 2024, 13 (05)
  • [4] Taiyi: a bilingual fine-tuned large language model for diverse biomedical tasks
    Luo, Ling
    Ning, Jinzhong
    Zhao, Yingwen
    Wang, Zhijun
    Ding, Zeyuan
    Chen, Peng
    Fu, Weiru
    Han, Qinyu
    Xu, Guangtao
    Qiu, Yunzhi
    Pan, Dinghao
    Li, Jiru
    Li, Hao
    Feng, Wenduo
    Tu, Senbo
    Liu, Yuqi
    Yang, Zhihao
    Wang, Jian
    Sun, Yuanyuan
    Lin, Hongfei
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09) : 1865 - 1874
  • [5] BASHEXPLAINER: Retrieval-Augmented Bash Code Comment Generation based on Fine-tuned CodeBERT
    Yu, Chi
    Yang, Guang
    Chen, Xiang
    Liu, Ke
    Zhou, Yanlin
    2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 82 - 93
  • [6] Fine Tuning Large Language Model for Secure Code Generation
    Li, Junjie
    Sangalay, Aseem
    Cheng, Cheng
    Tian, Yuan
    Yang, Jinqiu
    PROCEEDINGS 2024 IEEE/ACM FIRST INTERNATIONAL CONFERENCE ON AI FOUNDATION MODELS AND SOFTWARE ENGINEERING, FORGE 2024, 2024, : 86 - 90
  • [7] EpilepsyLLM: Domain-Specific Large Language Model Fine-tuned with Epilepsy Medical Knowledge
    Zhao, Xuyang
    Zhao, Qibin
    Tanaka, Toshihisa
    arXiv,
  • [8] An open-source fine-tuned large language model for radiological impression generation: a multi-reader performance study
    Adrian Serapio
    Gunvant Chaudhari
    Cody Savage
    Yoo Jin Lee
    Maya Vella
    Shravan Sridhar
    Jamie Lee Schroeder
    Jonathan Liu
    Adam Yala
    Jae Ho Sohn
    BMC Medical Imaging, 24 (1)
  • [9] Extracting structured data from organic synthesis procedures using a fine-tuned large language model
    Ai, Qianxiang
    Meng, Fanwang
    Shi, Jiale
    Pelkie, Brenden
    Coley, Connor W.
    DIGITAL DISCOVERY, 2024, 3 (09): : 1822 - 1831
  • [10] The Fine-Tuned Large Language Model for Extracting the Progressive Bone Metastasis from Unstructured Radiology Reports
    Kanemaru, Noriko
    Yasaka, Koichiro
    Fujita, Nana
    Kanzawa, Jun
    Abe, Osamu
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, : 865 - 872