Predicting aggregate morphology of sequence-defined macromolecules with recurrent neural networks

被引:23
|
作者
Bhattacharya, Debjyoti [1 ]
Kleeblatt, Devon C. [1 ]
Statt, Antonia [2 ]
Reinhart, Wesley F. [1 ,3 ]
机构
[1] Penn State Univ, Mat Sci & Engn, University Pk, PA 16802 USA
[2] Univ Illinois, Grainger Coll Engn, Mat Sci & Engn, Urbana, IL 61801 USA
[3] Penn State Univ, Inst Computat & Data Sci, University Pk, PA 16802 USA
关键词
MOLECULAR-DYNAMICS SIMULATIONS; BLOCK-COPOLYMERS; DIBLOCK COPOLYMERS; VESICLES; NANOSTRUCTURES; MICELLIZATION; POLYMERS; FIELD;
D O I
10.1039/d2sm00452f
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Self-assembly of dilute sequence-defined macromolecules is a complex phenomenon in which the local arrangement of chemical moieties can lead to the formation of long-range structure. The dependence of this structure on the sequence necessarily implies that a mapping between the two exists, yet it has been difficult to model so far. Predicting the aggregation behavior of these macromolecules is challenging due to the lack of effective order parameters, a vast design space, inherent variability, and high computational costs associated with currently available simulation techniques. Here, we accurately predict the morphology of aggregates self-assembled from sequence-defined macromolecules using supervised machine learning. We find that regression models with implicit representation learning perform significantly better than those based on engineered features such as k-mer counting, and a recurrent-neural-network-based regressor performs the best out of nine model architectures we tested. Furthermore, we demonstrate the high-throughput screening of monomer sequences using the regression model to identify candidates for self-assembly into selected morphologies. Our strategy is shown to successfully identify multiple suitable sequences in every test we performed, so we hope the insights gained here can be extended to other increasingly complex design scenarios in the future, such as the design of sequences under polydispersity and at varying environmental conditions.
引用
收藏
页码:5037 / 5051
页数:15
相关论文
共 50 条
  • [21] Controlling the Surface Functionalization of Ultrasmall Gold Nanoparticles by Sequence-Defined Macromolecules
    van Der Meer, Selina Beatrice
    Seiler, Theresa
    Buchmann, Christin
    Partalidou, Georgia
    Boden, Sophia
    Loza, Kateryna
    Heggen, Marc
    Linders, Jurgen
    Prymak, Oleg
    Oliveira, Cristiano L. P.
    Hartmann, Laura
    Epple, Matthias
    CHEMISTRY-A EUROPEAN JOURNAL, 2021, 27 (04) : 1451 - 1464
  • [22] Exploring Cyclic Sulfamidate Building Blocks for the Synthesis of Sequence-Defined Macromolecules
    Hill, Stephen Andrew
    Steinfort, Robert
    Muecke, Sandra
    Reifenberger, Josefine
    Sengpiel, Tobias
    Hartmann, Laura
    MACROMOLECULAR RAPID COMMUNICATIONS, 2021, 42 (15)
  • [23] Influence of Position Isomerism on the Chiral Properties in Sequence-Defined Conjugated Macromolecules
    Milis, Wout
    Calzolai, Ginevra
    Salatelli, Elisabetta
    Koeckelberghs, Guy
    MACROMOLECULES, 2025, 58 (04) : 2106 - 2114
  • [24] Uniform soluble support for the large-scale synthesis of sequence-defined macromolecules
    De Franceschi, Irene
    Mertens, Chiel
    Badi, Nezha
    Du Prez, Filip
    POLYMER CHEMISTRY, 2022, 13 (39) : 5616 - 5624
  • [25] Reading mixtures of uniform sequence-defined macromolecules to increase data storage capacity
    Froelich, Maximiliane
    Hofheinz, Dennis
    Meier, Michael A. R.
    COMMUNICATIONS CHEMISTRY, 2020, 3 (01)
  • [26] Large language models design sequence-defined macromolecules via evolutionary optimization
    Reinhart, Wesley F.
    Statt, Antonia
    NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [27] Reading mixtures of uniform sequence-defined macromolecules to increase data storage capacity
    Maximiliane Frölich
    Dennis Hofheinz
    Michael A. R. Meier
    Communications Chemistry, 3
  • [28] Ugi Reaction of Amino Acids: From Facile Synthesis of Polypeptoids to Sequence-Defined Macromolecules
    Tao, Yue
    Tao, Youhua
    MACROMOLECULAR RAPID COMMUNICATIONS, 2021, 42 (06)
  • [29] Recent Developments in Solid-Phase Strategies towards Synthetic, Sequence-Defined Macromolecules
    Hill, Stephen A.
    Gerke, Christoph
    Hartmann, Laura
    CHEMISTRY-AN ASIAN JOURNAL, 2018, 13 (23) : 3611 - 3622
  • [30] Flow-IEG enables programmable thermodynamic properties in sequence-defined unimolecular macromolecules
    Wicker, Amanda C.
    Leibfarth, Frank A.
    Jamison, Timothy F.
    POLYMER CHEMISTRY, 2017, 8 (37) : 5786 - 5794