Large language models design sequence-defined macromolecules via evolutionary optimization

被引:0
|
作者
Reinhart, Wesley F. [1 ,2 ]
Statt, Antonia [3 ]
机构
[1] Penn State Univ, Dept Mat Sci & Engn, University Pk, PA 16802 USA
[2] Penn State Univ, Inst Computat & Data Sci, University Pk, PA 16802 USA
[3] Univ Illinois, Grainger Coll Engn, Dept Mat Sci & Engn, Champaign, IL 61801 USA
基金
美国国家科学基金会;
关键词
Active learning - Soft materials;
D O I
10.1038/s41524-024-01449-6
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
We demonstrate the ability of a large language model to perform evolutionary optimization for materials discovery. Anthropic's Claude 3.5 model outperforms an active learning scheme with handcrafted surrogate models and an evolutionary algorithm in selecting monomer sequences to produce targeted morphologies in macromolecular self-assembly. Utilizing pre-trained language models can potentially reduce the need for hyperparameter tuning while offering new capabilities such as self-reflection. The model performs this task effectively with or without context about the task itself, but domain-specific context sometimes results in faster convergence to good solutions. Furthermore, when this context is withheld, the model infers an approximate notion of the task (e.g., calling it a protein folding problem). This work provides evidence of Claude 3.5's ability to act as an evolutionary optimizer, a recently discovered emergent behavior of large language models, and demonstrates a practical use case in the study and design of soft materials.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Precision Design of Sequence-Defined Polyurethanes: Exploring Controlled Folding Through Computational Design
    Samokhvalova, Svetlana
    Lutz, Jean-Francois
    Coluzza, Ivan
    MACROMOLECULAR CHEMISTRY AND PHYSICS, 2024, 225 (23)
  • [42] Sequence-defined main-chain photoswitching macromolecules with odd-even-effect-controlled properties
    Thai, Linh Duy
    Kammerer, Jochen A.
    Golberg, Dmitri
    Mutlu, Hatice
    Barner-Kowollik, Christopher
    CHEM, 2025, 11 (02):
  • [43] Versatile Strategy to Develop Sequence-Defined Conjugated Macromolecules: A Powerful Tool toward Tunable Optoelectronic Properties
    Milis, Wout
    Peeters, Janine
    Erkens, Robin
    De Winter, Julien
    Gerbaux, Pascal
    Koeckelberghs, Guy
    ACS MACRO LETTERS, 2024, 13 (10) : 1293 - 1303
  • [44] Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction
    Porel, Mintu
    Thornlow, Dana N.
    Phan, Ngoc N.
    Alabi, Christopher A.
    NATURE CHEMISTRY, 2016, 8 (06) : 591 - 597
  • [45] Coding and decoding libraries of sequence-defined functional copolymers synthesized via photoligation
    Nicolas Zydziak
    Waldemar Konrad
    Florian Feist
    Sergii Afonin
    Steffen Weidner
    Christopher Barner-Kowollik
    Nature Communications, 7
  • [46] Intracellular Delivery via Noncharged Sequence-Defined Cell-Penetrating Oligomers
    Phan, Ngoc N.
    Li, Connie
    Alabi, Christopher A.
    BIOCONJUGATE CHEMISTRY, 2018, 29 (08) : 2628 - 2635
  • [47] Coding and decoding libraries of sequence-defined functional copolymers synthesized via photoligation
    Zydziak, Nicolas
    Konrad, Waldemar
    Feist, Florian
    Afonin, Sergii
    Weidner, Steffen
    Barner-Kowollik, Christopher
    NATURE COMMUNICATIONS, 2016, 7
  • [48] Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction
    Porel M.
    Thornlow D.N.
    Phan N.N.
    Alabi C.A.
    Nature Chemistry, 2016, 8 (6) : 590 - 596
  • [49] Opportunities and Challenges for Inverse Design of Nanostructures with Sequence Defined Macromolecules
    Reinhart, Wesley F.
    Statt, Antonia
    ACCOUNTS OF MATERIALS RESEARCH, 2021, 2 (09): : 697 - 700
  • [50] Sequence-controlled and sequence-defined polypeptoids via the Ugi reaction: synthesis and sequence-driven properties
    Zhu, Yinuo
    Tao, Youhua
    POLYMER CHEMISTRY, 2021, 12 (34) : 4895 - 4902