Self-play reinforcement learning guides protein engineering

被引:19
|
作者
Wang, Yi [1 ]
Tang, Hui [1 ]
Huang, Lichao [1 ]
Pan, Lulu [2 ]
Yang, Lixiang [1 ]
Yang, Huanming [3 ,4 ]
Mu, Feng [1 ]
Yang, Meng [1 ]
机构
[1] MGI, Shenzhen, Peoples R China
[2] MGI QingDao, Qingdao, Peoples R China
[3] Chinese Acad Sci, Hangzhou Inst Med, Hangzhou, Peoples R China
[4] James D Watson Inst Genome Sci, Hangzhou, Peoples R China
关键词
FITNESS LANDSCAPE; NEURAL-NETWORKS; SEQUENCE LOGOS; DESIGN; GO; LUCIFERASES; PREDICTION; LANGUAGE; GAME;
D O I
10.1038/s42256-023-00691-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are currently promising developments in deep learning for protein design, with applications in drug discovery and synthetic biology. For more efficient exploration of the design space, Wang et al. demonstrate a reinforcement learning method, EvoZero, for directed evolution in protein engineering towards desired functional or structure-related properties. Designing protein sequences towards desired properties is a fundamental goal of protein engineering, with applications in drug discovery and enzymatic engineering. Machine learning-guided directed evolution has shown success in expediting the optimization cycle and reducing experimental burden. However, efficient sampling in the vast design space remains a challenge. To address this, we propose EvoPlay, a self-play reinforcement learning framework based on the single-player version of AlphaZero. In this work, we mutate a single-site residue as an action to optimize protein sequences, analogous to playing pieces on a chessboard. A policy-value neural network reciprocally interacts with look-ahead Monte Carlo tree search to guide the optimization agent with breadth and depth. We extensively evaluate EvoPlay on a suite of in silico directed evolution tasks over full-length sequences or combinatorial sites using functional surrogates. EvoPlay also supports AlphaFold2 as a structural surrogate to design peptide binders with high affinities, validated by binding assays. Moreover, we harness EvoPlay to prospectively engineer luciferase, resulting in the discovery of variants with 7.8-fold bioluminescence improvement beyond wild type. In sum, EvoPlay holds great promise for facilitating protein design to tackle unmet academic, industrial and clinical needs.
引用
收藏
页码:845 / +
页数:20
相关论文
共 50 条
  • [21] A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
    Silver, David
    Hubert, Thomas
    Schrittwieser, Julian
    Antonoglou, Ioannis
    Lai, Matthew
    Guez, Arthur
    Lanctot, Marc
    Sifre, Laurent
    Kumaran, Dharshan
    Graepel, Thore
    Lillicrap, Timothy
    Simonyan, Karen
    Hassabis, Demis
    SCIENCE, 2018, 362 (6419) : 1140 - +
  • [22] A Deep Reinforcement Learning Approach Using Asymmetric Self-Play for Robust Multirobot Flocking
    Jia, Yunjie
    Song, Yong
    Cheng, Jiyu
    Jin, Jiong
    Zhang, Wei
    Yang, Simon X.
    Kwong, Sam
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2025,
  • [23] Learning to Drive via Asymmetric Self-Play
    Zhang, Chris
    Biswas, Sourav
    Wong, Kelvin
    Fallah, Kion
    Zhang, Lunjun
    Chen, Dian
    Casas, Sergio
    Urtasun, Raquel
    COMPUTER VISION - ECCV 2024, PT LXII, 2025, 15120 : 149 - 168
  • [24] Do as you teach: a multi-teacher approach to self-play in deep reinforcement learning
    Chaitanya Kharyal
    Sai Krishna Gottipati
    Tanmay Kumar Sinha
    Fatemeh Abdollahi
    Srijita Das
    Matthew E. Taylor
    Neural Computing and Applications, 2025, 37 (8) : 5945 - 5956
  • [25] Multiagent Reinforcement Learning for Strategic Decision Making and Control in Robotic Soccer Through Self-Play
    Brandao, Bruno
    De Lima, Telma Woerle
    Soares, Anderson
    Melo, Luckeciano
    Maximo, Marcos R. O. A.
    IEEE ACCESS, 2022, 10 : 72628 - 72642
  • [26] Transforming Cybersecurity Dynamics: Enhanced Self-Play Reinforcement Learning in Intrusion Detection and Prevention System
    Jaber, Aws
    18TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON 2024, 2024,
  • [27] Learning self-play agents for combinatorial optimization problems
    Xu, Ruiyang
    Lieberherr, Karl
    KNOWLEDGE ENGINEERING REVIEW, 2020, 35
  • [28] A deep reinforcement learning method for structural dominant failure modes searching based on self-play strategy
    Guan, Xiaoshu
    Sun, Huabin
    Hou, Rongrong
    Xu, Yang
    Bao, Yuequan
    Li, Hui
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 233
  • [29] Hierarchical reinforcement learning from competitive self-play for dual-aircraft formation air combat
    Kong, Wei-ren
    Zhou, De-yun
    Zhou, Ying
    Zhao, Yi-yang
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2023, 10 (02) : 830 - 859
  • [30] Air combat intelligent decision-making method based on self-play and deep reinforcement learning
    Shan, Shengzhe
    Zhang, Weiwei
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (04):