Cluster learning-assisted directed evolution

被引:22
|
作者
Qiu, Yuchi [1 ]
Hu, Jian [2 ,3 ]
Wei, Guo-Wei [1 ,3 ,4 ]
机构
[1] Michigan State Univ, Dept Math, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Chem, E Lansing, MI 48824 USA
[3] Michigan State Univ, Dept Biochem & Mol Biol, E Lansing, MI 48824 USA
[4] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA
来源
NATURE COMPUTATIONAL SCIENCE | 2021年 / 1卷 / 12期
关键词
PROTEIN; PREDICTION; MUTATION;
D O I
10.1038/s43588-021-00168-y
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Directed evolution, a strategy for protein engineering, optimizes protein properties (that is, fitness) by expensive and time-consuming screening or selection of a large mutational sequence space. Machine learning-assisted directed evolution (MLDE), which screens sequence properties in silico, can accelerate the optimization and reduce the experimental burden. This work introduces an MLDE framework, cluster learning-assisted directed evolution (CLADE), which combines hierarchical unsupervised clustering sampling and supervised learning to guide protein engineering. The clustering sampling selectively picks and screens variants in targeted subspaces, which guides the subsequent generation of diverse training sets. In the last stage, accurate predictions via supervised learning models improve the final outcomes. By sequentially screening 480 sequences out of 160,000 in a four-site combinatorial library with five equal experimental batches, CLADE achieves global maximal fitness hit rates of up to 91.0% and 34.0% for the GB1 and PhoQ datasets, respectively, improved from the values of 18.6% and 7.2% obtained by random sampling-based MLDE.
引用
收藏
页码:809 / 818
页数:10
相关论文
共 50 条
  • [31] Machine learning-assisted smart epitaxy of Ⅲ-Ⅴ semiconductors
    Yue Hao
    [J]. Science China(Materials)., 2024, 67 (09) - 3042
  • [32] Visualizing Uncertainty in Machine Learning-Assisted Measurements
    Shirmohammadi, Shervin
    [J]. IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2023, 26 (07) : 20 - 27
  • [33] Deep learning-assisted Hubble parameter analysis
    Salti, Mehmet
    Kangal, Evrim Ersin
    Zengin, Bilgin
    [J]. MODERN PHYSICS LETTERS A, 2024, 39 (04)
  • [34] Learning-assisted multi-step planning
    Hauser, K
    Bretl, T
    Latombe, JC
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 4575 - 4580
  • [35] Review of learning-assisted power system optimization
    Ruan, Guangchun
    Zhong, Haiwang
    Zhang, Guanglun
    He, Yiliu
    Wang, Xuan
    Pu, Tianjiao
    [J]. CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2021, 7 (02): : 221 - 231
  • [36] Reinforcement Learning-Assisted Management for Convertible SSDs
    Wei, Qian
    Li, Yi
    Jia, Zhiping
    Zhao, Mengying
    Shen, Zhaoyan
    Li, Bingzhe
    [J]. 2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [37] Machine learning-assisted crystal engineering of a zeolite
    Li, Xinyu
    Han, He
    Evangelou, Nikolaos
    Wichrowski, Noah J.
    Lu, Peng
    Xu, Wenqian
    Hwang, Son-Jong
    Zhao, Wenyang
    Song, Chunshan
    Guo, Xinwen
    Bhan, Aditya
    Kevrekidis, Ioannis G.
    Tsapatsis, Michael
    [J]. NATURE COMMUNICATIONS, 2023, 14 (01)
  • [38] Machine Learning-Assisted Design of Material Properties
    Kadulkar, Sanket
    Sherman, Zachary M.
    Ganesan, Venkat
    Truskett, Thomas M.
    [J]. ANNUAL REVIEW OF CHEMICAL AND BIOMOLECULAR ENGINEERING, 2022, 13 : 235 - 254
  • [39] Learning-assisted theorem proving with millions of lemmas
    Kaliszyk, Cezary
    Urban, Josef
    [J]. JOURNAL OF SYMBOLIC COMPUTATION, 2015, 69 : 109 - 128
  • [40] Machine Learning-Assisted Hybrid ReaxFF Simulations
    Yilmaz, Dundar E.
    Woodward, William Hunter
    van Duin, Adri C. T.
    [J]. JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2021, 17 (11) : 6705 - 6712