Self-play reinforcement learning guides protein engineering

被引:19
|
作者
Wang, Yi [1 ]
Tang, Hui [1 ]
Huang, Lichao [1 ]
Pan, Lulu [2 ]
Yang, Lixiang [1 ]
Yang, Huanming [3 ,4 ]
Mu, Feng [1 ]
Yang, Meng [1 ]
机构
[1] MGI, Shenzhen, Peoples R China
[2] MGI QingDao, Qingdao, Peoples R China
[3] Chinese Acad Sci, Hangzhou Inst Med, Hangzhou, Peoples R China
[4] James D Watson Inst Genome Sci, Hangzhou, Peoples R China
关键词
FITNESS LANDSCAPE; NEURAL-NETWORKS; SEQUENCE LOGOS; DESIGN; GO; LUCIFERASES; PREDICTION; LANGUAGE; GAME;
D O I
10.1038/s42256-023-00691-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are currently promising developments in deep learning for protein design, with applications in drug discovery and synthetic biology. For more efficient exploration of the design space, Wang et al. demonstrate a reinforcement learning method, EvoZero, for directed evolution in protein engineering towards desired functional or structure-related properties. Designing protein sequences towards desired properties is a fundamental goal of protein engineering, with applications in drug discovery and enzymatic engineering. Machine learning-guided directed evolution has shown success in expediting the optimization cycle and reducing experimental burden. However, efficient sampling in the vast design space remains a challenge. To address this, we propose EvoPlay, a self-play reinforcement learning framework based on the single-player version of AlphaZero. In this work, we mutate a single-site residue as an action to optimize protein sequences, analogous to playing pieces on a chessboard. A policy-value neural network reciprocally interacts with look-ahead Monte Carlo tree search to guide the optimization agent with breadth and depth. We extensively evaluate EvoPlay on a suite of in silico directed evolution tasks over full-length sequences or combinatorial sites using functional surrogates. EvoPlay also supports AlphaFold2 as a structural surrogate to design peptide binders with high affinities, validated by binding assays. Moreover, we harness EvoPlay to prospectively engineer luciferase, resulting in the discovery of variants with 7.8-fold bioluminescence improvement beyond wild type. In sum, EvoPlay holds great promise for facilitating protein design to tackle unmet academic, industrial and clinical needs.
引用
收藏
页码:845 / +
页数:20
相关论文
共 50 条
  • [1] Self-play reinforcement learning guides protein engineering
    Yi Wang
    Hui Tang
    Lichao Huang
    Lulu Pan
    Lixiang Yang
    Huanming Yang
    Feng Mu
    Meng Yang
    Nature Machine Intelligence, 2023, 5 : 845 - 860
  • [2] Self-play Reinforcement Learning for Video Transmission
    Huang, Tianchi
    Zhang, Rui-Xiao
    Sun, Lifeng
    NOSSDAV '20: PROCEEDINGS OF THE 2020 WORKSHOP ON NETWORK AND OPERATING SYSTEM SUPPORT FOR DIGITAL AUDIO AND VIDEO, 2020, : 7 - 13
  • [3] Self-Play Reinforcement Learning for Fast Image Retargeting
    Kajiura, Nobukatsu
    Kosugi, Satoshi
    Wang, Xueting
    Yamasaki, Toshihiko
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1755 - 1763
  • [4] Near-Optimal Reinforcement Learning with Self-Play
    Bai, Yu
    Jin, Chi
    Yu, Tiancheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [5] Provable Self-Play Algorithms for Competitive Reinforcement Learning
    Bai, Yu
    Jin, Chi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [6] Mastering construction heuristics with self-play deep reinforcement learning
    Wang, Qi
    He, Yuqing
    Tang, Chunlei
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (06): : 4723 - 4738
  • [7] Reinforcement learning for extended reality: designing self-play scenarios
    Leal, Leonardo A. Espinosa
    Chapman, Anthony
    Westerlund, Magnus
    PROCEEDINGS OF THE 52ND ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2019, : 156 - 163
  • [8] Mastering construction heuristics with self-play deep reinforcement learning
    Qi Wang
    Yuqing He
    Chunlei Tang
    Neural Computing and Applications, 2023, 35 : 4723 - 4738
  • [9] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
    Zha, Daochen
    Xie, Jingru
    Ma, Wenye
    Zhang, Sheng
    Lian, Xiangru
    Hu, Xia
    Liu, Ji
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [10] Self-play reinforcement learning with comprehensive critic in computer games
    Liu, Shanqi
    Cao, Junjie
    Wang, Yujie
    Chen, Wenzhou
    Liu, Yong
    NEUROCOMPUTING, 2021, 449 : 207 - 213