Language models for protein design

被引:0
|
作者
Lee, Jin Sub [1 ]
Abdin, Osama [1 ]
Kim, Philip M. [1 ,2 ,3 ]
机构
[1] Univ Toronto, Dept Mol Genet, Toronto, ON M5S 1A8, Canada
[2] Univ Toronto, Donnelly Ctr Cellular & Biomol Res, Toronto, ON M5S 3E1, Canada
[3] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 2E4, Canada
关键词
D O I
10.1016/j.sbi.2025.103027
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The recent surge of large language models has shown that machines are capable of reading, understanding, and communicating through language, even sometimes displaying capabilities surpassing those of humans. Proteins can be represented as strings of amino acids akin to words in a sentence, and the same principles of language modeling can be used to learn informative representations for protein structure prediction, design, and property prediction. In this review, we will focus on applications of language modeling to protein design. We will first cover the foundations of protein language modeling and discuss recent advances such as contextconditioned design and structure integration. We also consider current shortcomings and promising avenues of research for protein language modeling to facilitate future development of improved protein language models for design.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Are genomic language models all you need? Exploring genomic language models on protein downstream tasks
    Boshar, Sam
    Trop, Evan
    de Almeida, Bernardo P.
    Copoiu, Liviu
    Pierrot, Thomas
    BIOINFORMATICS, 2024, 40 (09)
  • [22] HaloClass: Salt-Tolerant Protein Classification with Protein Language Models
    Narang, Kush
    Nath, Abhigyan
    Hemstrom, William
    Chu, Simon K. S.
    PROTEIN JOURNAL, 2024, 43 (06): : 1035 - 1044
  • [23] Integrating protein language models and automatic biofoundry for enhanced protein evolution
    Zhang, Qiang
    Chen, Wanyi
    Qin, Ming
    Wang, Yuhao
    Pu, Zhongji
    Ding, Keyan
    Liu, Yuyue
    Zhang, Qunfeng
    Li, Dongfang
    Li, Xinjia
    Zhao, Yu
    Yao, Jianhua
    Huang, Lei
    Wu, Jianping
    Yang, Lirong
    Chen, Huajun
    Yu, Haoran
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [24] De novo protein design with a language model
    Kotsiliti, Eleni
    NATURE BIOTECHNOLOGY, 2022, 40 (10) : 1433 - 1433
  • [25] De novo protein design with a language model
    Eleni Kotsiliti
    Nature Biotechnology, 2022, 40 : 1433 - 1433
  • [26] Evolutionary changes in protein structure as models for protein design
    Cordes, Matthew H. J.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2007, 233 : 85 - 85
  • [27] Systematic synthesis of design prompts for large language models in conceptual design
    Tian, Yu
    Liu, Ang
    Dai, Yun
    Nagato, Keisuke
    Nakao, Masayuki
    CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2024, 73 (01) : 85 - 88
  • [28] LATTICE MODELS IN THE STUDY OF PROTEIN DESIGN
    YUE, K
    DILL, KA
    FASEB JOURNAL, 1992, 6 (01): : A265 - A265
  • [29] Protein design based on folding models
    Guerois, R
    Serrano, L
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2001, 11 (01) : 101 - 106
  • [30] Learning coevolutionary models for protein design
    Frechette, Layne B.
    Best, Robert B.
    BIOPHYSICAL JOURNAL, 2022, 121 (03) : 45 - 45