Evolutionary velocity with protein language models predicts evolutionary dynamics of diverse proteins

被引:38
|
作者
Hie, Brian L. [1 ,2 ]
Yang, Kevin K. [3 ]
Kim, Peter S. [1 ,2 ,4 ]
机构
[1] Stanford Univ, Dept Biochem, Sch Med, Stanford, CA 94305 USA
[2] Stanford Univ, Stanford ChEM H, Stanford, CA 94305 USA
[3] Microsoft Res New England, Cambridge, MA 02142 USA
[4] Chan Zuckerberg Biohub, San Francisco, CA 94158 USA
关键词
MOLECULAR EVOLUTION; MATRICES; DATABASE; SERPINS;
D O I
10.1016/j.cels.2022.01.003
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The degree to which evolution is predictable is a fundamental question in biology. Previous attempts to predict the evolution of protein sequences have been limited to specific proteins and to small changes, such as single-residue mutations. Here, we demonstrate that by using a protein language model to predict the local evolution within protein families, we recover a dynamic ???vector field???of protein evolution that we call evolutionary velocity (evo-velocity). Evo-velocity generalizes to evolution over vastly different timescales, from viral proteins evolving over years to eukaryotic proteins evolving over geologic eons, and can predict the evolutionary dynamics of proteins that were not used to develop the original model. Evo-velocity also yields new evolutionary insights by predicting strategies of viral-host immune escape, resolving conflicting theories on the evolution of serpins, and revealing a key role of horizontal gene transfer in the evolution of eukaryotic glycolysis.
引用
收藏
页码:274 / +
页数:19
相关论文
共 50 条
  • [21] SAMPLERS, MAXIMISERS AND THE CULTURAL EVOLUTIONARY DYNAMICS OF LANGUAGE
    Blythe, Richard A.
    [J]. EVOLUTION OF LANGUAGE, PROCEEDINGS, 2010, : 369 - 370
  • [22] Evolutionary conservation of protein vibrational dynamics
    Maguid, Sandra
    Fernandez-Alberti, Sebastian
    Echave, Julian
    [J]. GENE, 2008, 422 (1-2) : 7 - 13
  • [23] Evolutionary Models of Fold-Switching Proteins
    Frechette, Layne B.
    Best, Robert B.
    [J]. BIOPHYSICAL JOURNAL, 2021, 120 (03) : 20A - 20A
  • [24] The GAF domain: an evolutionary link between diverse phototransducing proteins
    Aravind, L
    Ponting, CP
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1997, 22 (12) : 458 - 459
  • [25] Difference equations as models of evolutionary population dynamics
    Cushing, J. M.
    [J]. JOURNAL OF BIOLOGICAL DYNAMICS, 2019, 13 (01) : 103 - 127
  • [26] Deciphering evolutionary dynamics of SWEET genes in diverse plant lineages
    Li, Xiaoyu
    Si, Weina
    Qin, QianQian
    Wu, Hao
    Jiang, Haiyang
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [27] Deciphering evolutionary dynamics of SWEET genes in diverse plant lineages
    Xiaoyu Li
    Weina Si
    QianQian Qin
    Hao Wu
    Haiyang Jiang
    [J]. Scientific Reports, 8
  • [28] Evolutionary dynamics in networked trust games with diverse investment patterns
    Shang, Lihui
    Zhang, Tiantian
    Chang, Min
    [J]. EPL, 2023, 141 (02)
  • [29] Evolutionary changes in protein structure as models for protein design
    Cordes, Matthew H. J.
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2007, 233 : 85 - 85
  • [30] Evolving integral projection models: evolutionary demography meets eco-evolutionary dynamics
    Rees, Mark
    Ellner, Stephen P.
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2016, 7 (02): : 157 - 170