Unsupervised evolution of protein and antibody complexes with a structure-informed language model

被引:11
|
作者
Shanker, Varun R. [1 ,2 ,3 ]
Bruun, Theodora U. J. [2 ,3 ,4 ]
Hie, Brian L. [3 ,4 ,6 ,7 ,8 ]
Kim, Peter S. [3 ,4 ,5 ]
机构
[1] Stanford Univ, Sch Med, Stanford Biophys Program, Stanford, CA 94305 USA
[2] Stanford Univ, Sch Med, Stanford Med Scientist Training Program, Stanford, CA 94305 USA
[3] Stanford Univ, Sarafan ChEM H, Stanford, CA 94305 USA
[4] Stanford Univ, Sch Med, Dept Biochem, Stanford, CA 94305 USA
[5] Chan Zuckerberg Biohub, San Francisco, CA 94158 USA
[6] Stanford Univ, Dept Chem Engn, Stanford, CA 94305 USA
[7] Stanford Univ, Stanford Data Sci, Stanford, CA 94305 USA
[8] Arc Inst, Palo Alto, CA 94304 USA
关键词
FITNESS LANDSCAPES; SEQUENCE; DESIGN; SELECTION; RECOGNITION; INHIBITION; GENERATION; REVEALS; SET;
D O I
10.1126/science.adk8946
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Large language models trained on sequence information alone can learn high-level principles of protein design. However, beyond sequence, the three-dimensional structures of proteins determine their specific function, activity, and evolvability. Here, we show that a general protein language model augmented with protein structure backbone coordinates can guide evolution for diverse proteins without the need to model individual functional tasks. We also demonstrate that ESM-IF1, which was only trained on single-chain structures, can be extended to engineer protein complexes. Using this approach, we screened about 30 variants of two therapeutic clinical antibodies used to treat severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. We achieved up to 25-fold improvement in neutralization and 37-fold improvement in affinity against antibody-escaped viral variants of concern BQ.1.1 and XBB.1.5, respectively. These findings highlight the advantage of integrating structural information to identify efficient protein evolution trajectories without requiring any task-specific training data.
引用
收藏
页码:46 / 53
页数:8
相关论文
共 50 条
  • [31] EquiRank: Improved protein-protein interface quality estimation using protein language-model-informed equivariant graph neural networks
    Shuvo, Md Hossain
    Bhattacharya, Debswapna
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2025, 27 : 160 - 170
  • [32] Evolution of interface binding strengths in simplified model of protein quaternary structure
    Leonard, Alexander S.
    Ahnert, Sebastian E.
    PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (06)
  • [33] SLAM: Structure-aware lysine β-hydroxybutyrylation prediction with protein language model
    Qin, Zhaohui
    Liu, Huixia
    Zhao, Pei
    Wang, Kaiyuan
    Ren, Haoran
    Miao, Chunbo
    Li, Junzhou
    Chen, Yong-Zi
    Chen, Zhen
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2024, 280
  • [34] Experimental analysis of co-evolution within protein complexes: The yeast exosome as a model
    Sandler, Inga
    Medalia, Ohad
    Aharoni, Amir
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2013, 81 (11) : 1997 - 2006
  • [35] Protein structure-function continuum model: Emerging nexuses between specificity, evolution, and structure
    Gupta, Munishwar Nath
    Uversky, Vladimir N.
    PROTEIN SCIENCE, 2024, 33 (04)
  • [36] Protein language-model embeddings for fast, accurate, and alignment-free protein structure prediction
    Weissenow, Konstantin
    Heinzinger, Michael
    Rost, Burkhard
    STRUCTURE, 2022, 30 (08) : 1169 - +
  • [37] A method for multiple-sequence-alignment-free protein structure prediction using a protein language model
    Xiaomin Fang
    Fan Wang
    Lihang Liu
    Jingzhou He
    Dayong Lin
    Yingfei Xiang
    Kunrui Zhu
    Xiaonan Zhang
    Hua Wu
    Hui Li
    Le Song
    Nature Machine Intelligence, 2023, 5 : 1087 - 1096
  • [38] A method for multiple-sequence-alignment-free protein structure prediction using a protein language model
    Fang, Xiaomin
    Wang, Fan
    Liu, Lihang
    He, Jingzhou
    Lin, Dayong
    Xiang, Yingfei
    Zhu, Kunrui
    Zhang, Xiaonan
    Wu, Hua
    Li, Hui
    Song, Le
    NATURE MACHINE INTELLIGENCE, 2023, 5 (10) : 1087 - 1096
  • [39] Contact Potential for Structure Prediction of Proteins and Protein Complexes from Potts Model
    Anishchenko, Ivan
    Kundrotas, Petras J.
    Vakser, Ilya A.
    BIOPHYSICAL JOURNAL, 2018, 115 (05) : 809 - 821
  • [40] Protein structure analysis of Gonococcal cell surface protein antibody binding domains using a molecular model approach
    Ting, Yu-Shu
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2014, 248