CLADE 2.0: Evolution-Driven Cluster Learning-Assisted Directed Evolution

被引:6
|
作者
Qiu, Yuchi [1 ]
Wei, Guo-Wei [1 ,2 ,3 ]
机构
[1] Michigan State Univ, Dept Math, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Biochem & Mol Biol, E Lansing, MI 48824 USA
[3] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA
关键词
PROTEIN; PREDICTION; MUTATION; DESIGN;
D O I
10.1021/acs.jcim.2c01046
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Directed evolution, a revolutionary biotechnology in protein engineering, optimizes protein fitness by searching an astronomical mutational space via expensive experiments. The cluster learning-assisted directed evolution (CLADE) efficiently explores the mutational space via a combination of unsupervised hierarchical clustering and supervised learning. However, the initial-stage sampling in CLADE treats all clusters equally despite many clusters containing a large portion of non-functional mutations. Recent statistical and deep learning tools enable evolutionary density modeling to access protein fitness in an unsupervised manner. In this work, we construct an ensemble of multiple evolutionary scores to guide the initial sampling in CLADE. The resulting evolutionary score-enhanced CLADE, called CLADE 2.0, efficiently selects a training set within a small informative space using the evolution-driven clustering sampling. CLADE 2.0 is validated by using two benchmark libraries both having 160,000 sequences from four-site mutational combinations. Extensive computational experiments and comparisons with existing cutting-edge methods indicate that CLADE 2.0 is a new state-of-art tool for machine learning-assisted directed evolution.
引用
收藏
页码:4629 / 4641
页数:13
相关论文
共 50 条
  • [1] Cluster learning-assisted directed evolution
    Qiu, Yuchi
    Hu, Jian
    Wei, Guo-Wei
    [J]. NATURE COMPUTATIONAL SCIENCE, 2021, 1 (12): : 809 - 818
  • [2] Cluster learning-assisted directed evolution
    Yuchi Qiu
    Jian Hu
    Guo-Wei Wei
    [J]. Nature Computational Science, 2021, 1 : 809 - 818
  • [3] Machine learning-assisted directed protein evolution with combinatorial libraries
    Wu, Zachary
    Kan, S. B. Jennifer
    Lewis, Russell D.
    Wittmann, Bruce J.
    Arnold, Frances H.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (18) : 8852 - 8858
  • [4] Methanol tolerance upgrading of Proteus mirabilis lipase by machine learning-assisted directed evolution
    Ma, Rui
    Li, Yingnan
    Zhang, Meng
    Xu, Fei
    [J]. SYSTEMS MICROBIOLOGY AND BIOMANUFACTURING, 2023, 3 (03): : 427 - 439
  • [5] The Evolution-Driven Signature of Parkinson's Disease
    Diederich, Nico J.
    Uchihara, Toshiki
    Grillner, Sten
    Goetz, Christopher G.
    [J]. TRENDS IN NEUROSCIENCES, 2020, 43 (07) : 475 - 492
  • [6] An Evolution-Driven Analog Circuit Topology Synthesis
    Rojec, Ziga
    Burmen, Arpad
    Fajfar, Iztok
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [7] Evolution-Driven Randomized Graph Convolutional Networks
    Zhang, Zijia
    Cai, Yaoming
    Gong, Wenyin
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (12): : 7516 - 7526
  • [8] Informed training set design enables efficient machine learning-assisted directed protein evolution
    Wittmann, Bruce J.
    Yue, Yisong
    Arnold, Frances H.
    [J]. CELL SYSTEMS, 2021, 12 (11) : 1026 - +
  • [9] Evolution-driven crosstalk between glioblastoma and the tumor microenvironment
    Lingxiang Wu
    Ruichao Chai
    Zihan Lin
    Rongrong Wu
    Diru Yao
    Tao Jiang
    Qianghu Wang
    [J]. Cancer Biology & Medicine, 2023, (05) : 319 - 324
  • [10] Machine learning-assisted directed protein evolution with combinatorial libraries (vol 116, pg 8852, 2019)
    Wu, Zachary
    Kan, S. B. Jennifer
    Lewis, Russell D.
    Wittmann, Bruce J.
    Arnold, Frances H.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (01) : 788 - 789