Large language models design sequence-defined macromolecules via evolutionary optimization

被引:0
|
作者
Reinhart, Wesley F. [1 ,2 ]
Statt, Antonia [3 ]
机构
[1] Penn State Univ, Dept Mat Sci & Engn, University Pk, PA 16802 USA
[2] Penn State Univ, Inst Computat & Data Sci, University Pk, PA 16802 USA
[3] Univ Illinois, Grainger Coll Engn, Dept Mat Sci & Engn, Champaign, IL 61801 USA
基金
美国国家科学基金会;
关键词
Active learning - Soft materials;
D O I
10.1038/s41524-024-01449-6
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
We demonstrate the ability of a large language model to perform evolutionary optimization for materials discovery. Anthropic's Claude 3.5 model outperforms an active learning scheme with handcrafted surrogate models and an evolutionary algorithm in selecting monomer sequences to produce targeted morphologies in macromolecular self-assembly. Utilizing pre-trained language models can potentially reduce the need for hyperparameter tuning while offering new capabilities such as self-reflection. The model performs this task effectively with or without context about the task itself, but domain-specific context sometimes results in faster convergence to good solutions. Furthermore, when this context is withheld, the model infers an approximate notion of the task (e.g., calling it a protein folding problem). This work provides evidence of Claude 3.5's ability to act as an evolutionary optimizer, a recently discovered emergent behavior of large language models, and demonstrates a practical use case in the study and design of soft materials.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Direct comparison of solution and solid phase synthesis of sequence-defined macromolecules
    Holloway, Joshua O.
    Wetzel, Katharina S.
    Martens, Steven
    Du Prez, Filip E.
    Meier, Michael A. R.
    POLYMER CHEMISTRY, 2019, 10 (28) : 3859 - 3867
  • [22] A Scalable and High-Yield Strategy for the Synthesis of Sequence-Defined Macromolecules
    Solleder, Susanne C.
    Zengel, Deniz
    Wetzel, Katharina S.
    Meier, Michael A. R.
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2016, 55 (03) : 1204 - 1207
  • [23] Scalable synthesis of sequence-defined, unimolecular macromolecules by Flow-IEG
    Leibfarth, Frank A.
    Johnson, Jeremiah A.
    Jamison, Timothy F.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (34) : 10617 - 10622
  • [24] Sequence-Defined Mikto-Arm Star-Shaped Macromolecules
    Reith, Melissa A.
    De Franceschi, Irene
    Soete, Matthieu
    Badi, Nezha
    Aksakal, Resat
    Du Prez, Filip E.
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2022, 144 (16) : 7236 - 7244
  • [25] Controlling the Surface Functionalization of Ultrasmall Gold Nanoparticles by Sequence-Defined Macromolecules
    van Der Meer, Selina Beatrice
    Seiler, Theresa
    Buchmann, Christin
    Partalidou, Georgia
    Boden, Sophia
    Loza, Kateryna
    Heggen, Marc
    Linders, Jurgen
    Prymak, Oleg
    Oliveira, Cristiano L. P.
    Hartmann, Laura
    Epple, Matthias
    CHEMISTRY-A EUROPEAN JOURNAL, 2021, 27 (04) : 1451 - 1464
  • [26] Exploring Cyclic Sulfamidate Building Blocks for the Synthesis of Sequence-Defined Macromolecules
    Hill, Stephen Andrew
    Steinfort, Robert
    Muecke, Sandra
    Reifenberger, Josefine
    Sengpiel, Tobias
    Hartmann, Laura
    MACROMOLECULAR RAPID COMMUNICATIONS, 2021, 42 (15)
  • [27] Influence of Position Isomerism on the Chiral Properties in Sequence-Defined Conjugated Macromolecules
    Milis, Wout
    Calzolai, Ginevra
    Salatelli, Elisabetta
    Koeckelberghs, Guy
    MACROMOLECULES, 2025, 58 (04) : 2106 - 2114
  • [28] Perfecting self-organization of covalent and supramolecular mega macromolecules via sequence-defined and monodisperse components
    Percec, Virgil
    Xiao, Qi
    Lligadas, Gerard
    Monteiro, Michael J.
    POLYMER, 2020, 211
  • [29] Sequence-defined acrylate oligomers: Synthesis optimization and upscaling
    Junkers, Thomas
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 252
  • [30] Reading mixtures of uniform sequence-defined macromolecules to increase data storage capacity
    Froelich, Maximiliane
    Hofheinz, Dennis
    Meier, Michael A. R.
    COMMUNICATIONS CHEMISTRY, 2020, 3 (01)