Self-play reinforcement learning guides protein engineering

被引:19
|
作者
Wang, Yi [1 ]
Tang, Hui [1 ]
Huang, Lichao [1 ]
Pan, Lulu [2 ]
Yang, Lixiang [1 ]
Yang, Huanming [3 ,4 ]
Mu, Feng [1 ]
Yang, Meng [1 ]
机构
[1] MGI, Shenzhen, Peoples R China
[2] MGI QingDao, Qingdao, Peoples R China
[3] Chinese Acad Sci, Hangzhou Inst Med, Hangzhou, Peoples R China
[4] James D Watson Inst Genome Sci, Hangzhou, Peoples R China
关键词
FITNESS LANDSCAPE; NEURAL-NETWORKS; SEQUENCE LOGOS; DESIGN; GO; LUCIFERASES; PREDICTION; LANGUAGE; GAME;
D O I
10.1038/s42256-023-00691-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are currently promising developments in deep learning for protein design, with applications in drug discovery and synthetic biology. For more efficient exploration of the design space, Wang et al. demonstrate a reinforcement learning method, EvoZero, for directed evolution in protein engineering towards desired functional or structure-related properties. Designing protein sequences towards desired properties is a fundamental goal of protein engineering, with applications in drug discovery and enzymatic engineering. Machine learning-guided directed evolution has shown success in expediting the optimization cycle and reducing experimental burden. However, efficient sampling in the vast design space remains a challenge. To address this, we propose EvoPlay, a self-play reinforcement learning framework based on the single-player version of AlphaZero. In this work, we mutate a single-site residue as an action to optimize protein sequences, analogous to playing pieces on a chessboard. A policy-value neural network reciprocally interacts with look-ahead Monte Carlo tree search to guide the optimization agent with breadth and depth. We extensively evaluate EvoPlay on a suite of in silico directed evolution tasks over full-length sequences or combinatorial sites using functional surrogates. EvoPlay also supports AlphaFold2 as a structural surrogate to design peptide binders with high affinities, validated by binding assays. Moreover, we harness EvoPlay to prospectively engineer luciferase, resulting in the discovery of variants with 7.8-fold bioluminescence improvement beyond wild type. In sum, EvoPlay holds great promise for facilitating protein design to tackle unmet academic, industrial and clinical needs.
引用
收藏
页码:845 / +
页数:20
相关论文
共 50 条
  • [41] Learning Diverse Risk Preferences in Population-Based Self-Play
    Jiang, Yuhua
    Liu, Qihan
    Ma, Xiaoteng
    Li, Chenghao
    Yang, Yiqin
    Yang, Jun
    Liang, Bin
    Zhao, Qianchuan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12910 - 12918
  • [42] A Generalized Framework for Self-Play Training
    Hernandez, Daniel
    Denamganai, Kevin
    Gao, Yuan
    York, Peter
    Devlin, Sam
    Samothrakis, Spyridon
    Walker, James Alfred
    2019 IEEE CONFERENCE ON GAMES (COG), 2019,
  • [43] Learning Algorithms with Self-Play: A New Approach to the Distributed Directory Problem
    Khanchandani, Pankaj
    Richter, Oliver
    Rusch, Lukas
    Wattenhofer, Roger
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 501 - 505
  • [44] Research on the Difficulty of Mobile Node Deployment's Self-Play in Wireless Ad Hoc Networks Based on Deep Reinforcement Learning
    Wang, Huitao
    Yang, Ruopeng
    Yin, Changsheng
    Zou, Xiaofei
    Wang, Xuefeng
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [45] Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates
    Soemers, Dennis J. N. J.
    Piette, Eric
    Stephenson, Matthew
    Browne, Cameron
    2019 IEEE CONFERENCE ON GAMES (COG), 2019,
  • [46] Self-play learning strategies for resource assignment in Open-RAN networks
    Wang, Xiaoyang
    Thomas, Jonathan D.
    Piechocki, Robert J.
    Kapoor, Shipra
    Santos-Rodriguez, Raul
    Parekh, Arjun
    COMPUTER NETWORKS, 2022, 206
  • [47] Self-Play or Group Practice: Learning to Play Alternating Markov Game in Multi-Agent System
    Leung, Chin-Wing
    Hu, Shuyue
    Leung, Ho-Fung
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9234 - 9241
  • [48] Fictitious Self-Play in Extensive-Form Games
    Heinrich, Johannes
    Lanctot, Marc
    Silver, David
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 805 - 813
  • [49] Self-Play for Training General Fighting Game AI
    Takano, Yoshina
    Inoue, Hideyasu
    Thawonmas, Ruck
    Harada, Tomohiro
    2019 NICOGRAPH INTERNATIONAL (NICOINT), 2019, : 120 - 120
  • [50] A Comparison of Self-Play Algorithms Under a Generalized Framework
    Hernandez, Daniel
    Denamganai, Kevin
    Devlin, Sam
    Samothrakis, Spyridon
    Walker, James Alfred
    IEEE TRANSACTIONS ON GAMES, 2022, 14 (02) : 221 - 231