Self-play reinforcement learning guides protein engineering

被引:19
|
作者
Wang, Yi [1 ]
Tang, Hui [1 ]
Huang, Lichao [1 ]
Pan, Lulu [2 ]
Yang, Lixiang [1 ]
Yang, Huanming [3 ,4 ]
Mu, Feng [1 ]
Yang, Meng [1 ]
机构
[1] MGI, Shenzhen, Peoples R China
[2] MGI QingDao, Qingdao, Peoples R China
[3] Chinese Acad Sci, Hangzhou Inst Med, Hangzhou, Peoples R China
[4] James D Watson Inst Genome Sci, Hangzhou, Peoples R China
关键词
FITNESS LANDSCAPE; NEURAL-NETWORKS; SEQUENCE LOGOS; DESIGN; GO; LUCIFERASES; PREDICTION; LANGUAGE; GAME;
D O I
10.1038/s42256-023-00691-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are currently promising developments in deep learning for protein design, with applications in drug discovery and synthetic biology. For more efficient exploration of the design space, Wang et al. demonstrate a reinforcement learning method, EvoZero, for directed evolution in protein engineering towards desired functional or structure-related properties. Designing protein sequences towards desired properties is a fundamental goal of protein engineering, with applications in drug discovery and enzymatic engineering. Machine learning-guided directed evolution has shown success in expediting the optimization cycle and reducing experimental burden. However, efficient sampling in the vast design space remains a challenge. To address this, we propose EvoPlay, a self-play reinforcement learning framework based on the single-player version of AlphaZero. In this work, we mutate a single-site residue as an action to optimize protein sequences, analogous to playing pieces on a chessboard. A policy-value neural network reciprocally interacts with look-ahead Monte Carlo tree search to guide the optimization agent with breadth and depth. We extensively evaluate EvoPlay on a suite of in silico directed evolution tasks over full-length sequences or combinatorial sites using functional surrogates. EvoPlay also supports AlphaFold2 as a structural surrogate to design peptide binders with high affinities, validated by binding assays. Moreover, we harness EvoPlay to prospectively engineer luciferase, resulting in the discovery of variants with 7.8-fold bioluminescence improvement beyond wild type. In sum, EvoPlay holds great promise for facilitating protein design to tackle unmet academic, industrial and clinical needs.
引用
收藏
页码:845 / +
页数:20
相关论文
共 50 条
  • [31] Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning
    Li, Bo
    Huang, Jingyi
    Bai, Shuangxia
    Gan, Zhigang
    Liang, Shiyang
    Evgeny, Neretin
    Yao, Shouwen
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 64 - 81
  • [32] Anytime Self-play Learning to Satisfy Functional Optimality Criteria
    Burkov, Andriy
    Chaib-draa, Brahim
    ALGORITHMIC DECISION THEORY, PROCEEDINGS, 2009, 5783 : 446 - 457
  • [33] Self-play: Statistical significance
    Haworth, GM
    ICGA JOURNAL, 2003, 26 (02) : 115 - 118
  • [34] Mastering the Card Game of Jaipur Through Zero-Knowledge Self-Play Reinforcement Learning and Action Masks
    Department of Artificial Intelligence, Faculty of ICT, University of Malta, Msida, Malta
    Lect. Notes Comput. Sci., (231-244):
  • [35] Manipulating the Distributions of Experience used for Self-Play Learning in Expert Iteration
    Soemers, Dennis J. N. J.
    Piette, Eric
    Stephenson, Matthew
    Browne, Cameron
    2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 245 - 252
  • [36] Learning a game strategy using pattern-weights and self-play
    Shapiro, A
    Fuchs, G
    Levinson, R
    COMPUTERS AND GAMES, 2003, 2883 : 42 - 60
  • [37] Abalearn: A risk-sensitive approach to self-play learning in abalone
    Campos, P
    Langlois, T
    MACHINE LEARNING: ECML 2003, 2003, 2837 : 35 - 46
  • [38] Towards Learning Multi-agent Negotiations via Self-Play
    Tang, Yichuan Charlie
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2427 - 2435
  • [39] Learning Existing Social Conventions via Observationally Augmented Self-Play
    Lerer, Adam
    Peysakhovich, Alexander
    AIES '19: PROCEEDINGS OF THE 2019 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2019, : 107 - 114
  • [40] Learning of Evaluation Functions via Self-Play Enhanced by Checkmate Search
    Nakayashiki, Taichi
    Kaneko, Tomoyuki
    2018 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2018, : 126 - 131