Evaluation of Open-Source Large Language Models for Metal-Organic Frameworks Research

被引:5
|
作者
Bai, Xuefeng [1 ,2 ]
Xie, Yabo [1 ,2 ]
Zhang, Xin [1 ,2 ]
Han, Honggui [3 ,4 ]
Li, Jian-Rong [1 ,2 ]
机构
[1] Beijing Univ Technol, Coll Mat Sci & Engn, Beijing Key Lab Green Catalysis & Separat, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Coll Mat Sci & Engn, Dept Chem Engn, Beijing 100124, Peoples R China
[3] Beijing Univ Technol, Fac Informat Technol, Engn Res Ctr Digital Community, Beijing Lab Urban Mass Transit,Minist Educ, Beijing 100124, Peoples R China
[4] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Compendex;
D O I
10.1021/acs.jcim.4c00065
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Along with the development of machine learning, deep learning, and large language models (LLMs) such as GPT-4 (GPT: Generative Pre-Trained Transformer), artificial intelligence (AI) tools have been playing an increasingly important role in chemical and material research to facilitate the material screening and design. Despite the exciting progress of GPT-4 based AI research assistance, open-source LLMs have not gained much attention from the scientific community. This work primarily focused on metal-organic frameworks (MOFs) as a subdomain of chemistry and evaluated six top-rated open-source LLMs with a comprehensive set of tasks including MOFs knowledge, basic chemistry knowledge, in-depth chemistry knowledge, knowledge extraction, database reading, predicting material property, experiment design, computational scripts generation, guiding experiment, data analysis, and paper polishing, which covers the basic units of MOFs research. In general, these LLMs were capable of most of the tasks. Especially, Llama2-7B and ChatGLM2-6B were found to perform particularly well with moderate computational resources. Additionally, the performance of different parameter versions of the same model was compared, which revealed the superior performance of higher parameter versions.
引用
收藏
页码:4958 / 4965
页数:8
相关论文
共 50 条
  • [21] Iterative Refactoring of Real-World Open-Source Programs with Large Language Models
    Choi, Jinsu
    An, Gabin
    Yoo, Shin
    SEARCH-BASED SOFTWARE ENGINEERING, SSBSE 2024, 2024, 14767 : 49 - 55
  • [22] Fine-Tuning and Evaluating Open-Source Large Language Models for the Army Domain
    Ruiz, Maj Daniel C.
    Sell, John
    arXiv,
  • [23] Metal-organic macrocycles, metal-organic polyhedra and metal-organic frameworks
    Prakash, M. Jaya
    Lah, Myoung Soo
    CHEMICAL COMMUNICATIONS, 2009, (23) : 3326 - 3341
  • [24] Evaluation of Language Runtimes in Open-source Serverless Platforms
    Djemame, Karim
    Datsev, Daniel
    Kelefouras, Vasilios
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE (CLOSER), 2022, : 123 - 132
  • [25] Large-Scale Production of Metal-Organic Frameworks
    Chakraborty, Debanjan
    Yurdusen, Aysu
    Mouchaham, Georges
    Nouar, Farid
    Serre, Christian
    ADVANCED FUNCTIONAL MATERIALS, 2023, 34 (43)
  • [26] Metal-organic frameworks
    James, SL
    CHEMICAL SOCIETY REVIEWS, 2003, 32 (05) : 276 - 288
  • [27] Metal-organic frameworks
    Birkett, Jim
    CHEMICAL & ENGINEERING NEWS, 2017, 95 (30) : 2 - 2
  • [28] Water Adsorption in Metal-Organic Frameworks with Open-Metal Sites
    Peng, Xuan
    Lin, Li-Chiang
    Sun, Weizhen
    Smit, Berend
    AICHE JOURNAL, 2015, 61 (02) : 677 - 687
  • [29] Heterobimetallic metal-organic frameworks with tunable open-metal sites
    Butler, Derek P.
    Beauvais, Laurence
    Smythe, Nathan
    McGowan, William
    Abeykoon, Brian
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2012, 243
  • [30] Metal Acetylacetonates as a Source of Metals for Aqueous Synthesis of Metal-Organic Frameworks
    Avci-Camur, Ceren
    Perez-Carvajal, Javier
    Imaz, Inhar
    Maspoch, Daniel
    ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 2018, 6 (11): : 14554 - 14560