Cluster learning-assisted directed evolution

被引:22
|
作者
Qiu, Yuchi [1 ]
Hu, Jian [2 ,3 ]
Wei, Guo-Wei [1 ,3 ,4 ]
机构
[1] Michigan State Univ, Dept Math, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Chem, E Lansing, MI 48824 USA
[3] Michigan State Univ, Dept Biochem & Mol Biol, E Lansing, MI 48824 USA
[4] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA
来源
NATURE COMPUTATIONAL SCIENCE | 2021年 / 1卷 / 12期
关键词
PROTEIN; PREDICTION; MUTATION;
D O I
10.1038/s43588-021-00168-y
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Directed evolution, a strategy for protein engineering, optimizes protein properties (that is, fitness) by expensive and time-consuming screening or selection of a large mutational sequence space. Machine learning-assisted directed evolution (MLDE), which screens sequence properties in silico, can accelerate the optimization and reduce the experimental burden. This work introduces an MLDE framework, cluster learning-assisted directed evolution (CLADE), which combines hierarchical unsupervised clustering sampling and supervised learning to guide protein engineering. The clustering sampling selectively picks and screens variants in targeted subspaces, which guides the subsequent generation of diverse training sets. In the last stage, accurate predictions via supervised learning models improve the final outcomes. By sequentially screening 480 sequences out of 160,000 in a four-site combinatorial library with five equal experimental batches, CLADE achieves global maximal fitness hit rates of up to 91.0% and 34.0% for the GB1 and PhoQ datasets, respectively, improved from the values of 18.6% and 7.2% obtained by random sampling-based MLDE.
引用
收藏
页码:809 / 818
页数:10
相关论文
共 50 条
  • [1] Cluster learning-assisted directed evolution
    Yuchi Qiu
    Jian Hu
    Guo-Wei Wei
    [J]. Nature Computational Science, 2021, 1 : 809 - 818
  • [2] CLADE 2.0: Evolution-Driven Cluster Learning-Assisted Directed Evolution
    Qiu, Yuchi
    Wei, Guo-Wei
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (19) : 4629 - 4641
  • [3] Machine learning-assisted directed protein evolution with combinatorial libraries
    Wu, Zachary
    Kan, S. B. Jennifer
    Lewis, Russell D.
    Wittmann, Bruce J.
    Arnold, Frances H.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (18) : 8852 - 8858
  • [4] Methanol tolerance upgrading of Proteus mirabilis lipase by machine learning-assisted directed evolution
    Ma, Rui
    Li, Yingnan
    Zhang, Meng
    Xu, Fei
    [J]. SYSTEMS MICROBIOLOGY AND BIOMANUFACTURING, 2023, 3 (03): : 427 - 439
  • [5] Informed training set design enables efficient machine learning-assisted directed protein evolution
    Wittmann, Bruce J.
    Yue, Yisong
    Arnold, Frances H.
    [J]. CELL SYSTEMS, 2021, 12 (11) : 1026 - +
  • [6] Machine learning-assisted directed protein evolution with combinatorial libraries (vol 116, pg 8852, 2019)
    Wu, Zachary
    Kan, S. B. Jennifer
    Lewis, Russell D.
    Wittmann, Bruce J.
    Arnold, Frances H.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (01) : 788 - 789
  • [7] Learning-assisted optimization for transmission switching
    Pineda, Salvador
    Morales, Juan Miguel
    Jimenez-Cordero, Asuncion
    [J]. TOP, 2024,
  • [8] Machine learning-assisted enzyme engineering
    Siedhoff, Niklas E.
    Schwaneberg, Ulrich
    Davari, Mehdi D.
    [J]. ENZYME ENGINEERING AND EVOLUTION: GENERAL METHODS, 2020, 643 : 281 - 315
  • [9] Learning-Assisted Intelligent Scheduling System
    Madureira, Ana
    Pereira, Joao Paulo
    Pereira, Ivo
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 2820 - 2825
  • [10] Learning-Assisted Automated Reasoning with Flyspeck
    Cezary Kaliszyk
    Josef Urban
    [J]. Journal of Automated Reasoning, 2014, 53 : 173 - 213