AlphaFold Meets De Novo Drug Design: Leveraging Structural Protein Information in Multitarget Molecular Generative Models

被引:0
|
作者
Bernatavicius, Andrius [1 ,2 ]
Šícho, Martin [1 ,3 ]
Janssen, Antonius P. A. [1 ,4 ]
Hassen, Alan Kai [2 ]
Preuss, Mike [2 ]
van Westen, Gerard J. P. [1 ]
机构
[1] Leiden Academic Centre for Drug Research, Leiden University, Einsteinweg 55, Leiden,2333CC, Netherlands
[2] Leiden Institute of Advanced Computer Science, Leiden University, Niels Bohrweg 1, Leiden,2333CA, Netherlands
[3] CZ-OPENSCREEN: National Infrastructure for Chemical Biology, Department of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague, Technická 5, Prague,166 28, Czech Republic
[4] Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, Leiden,2333CC, Netherlands
基金
欧盟地平线“2020”;
关键词
Molecular docking;
D O I
10.1021/acs.jcim.4c00309
中图分类号
学科分类号
摘要
Recent advancements in deep learning and generative models have significantly expanded the applications of virtual screening for drug-like compounds. Here, we introduce a multitarget transformer model, PCMol, that leverages the latent protein embeddings derived from AlphaFold2 as a means of conditioning a de novo generative model on different targets. Incorporating rich protein representations allows the model to capture their structural relationships, enabling the chemical space interpolation of active compounds and target-side generalization to new proteins based on embedding similarities. In this work, we benchmark against other existing target-conditioned transformer models to illustrate the validity of using AlphaFold protein representations over raw amino acid sequences. We show that low-dimensional projections of these protein embeddings cluster appropriately based on target families and that model performance declines when these representations are intentionally corrupted. We also show that the PCMol model generates diverse, potentially active molecules for a wide array of proteins, including those with sparse ligand bioactivity data. The generated compounds display higher similarity known active ligands of held-out targets and have comparable molecular docking scores while maintaining novelty. Additionally, we demonstrate the important role of data augmentation in bolstering the performance of generative models in low-data regimes. Software package and AlphaFold protein embeddings are freely available at https://github.com/CDDLeiden/PCMol. © 2024 The Authors. Published by American Chemical Society.
引用
下载
收藏
页码:8113 / 8122
相关论文
共 50 条
  • [1] Generative Models for De Novo Drug Design
    Tong, Xiaochu
    Liu, Xiaohong
    Tan, Xiaoqin
    Li, Xutong
    Jiang, Jiaxin
    Xiong, Zhaoping
    Xu, Tingyang
    Jiang, Hualiang
    Qiao, Nan
    Zheng, Mingyue
    JOURNAL OF MEDICINAL CHEMISTRY, 2021, 64 (19) : 14011 - 14027
  • [2] De novo molecular design and generative models
    Meyers, Joshua
    Fabian, Benedek
    Brown, Nathan
    DRUG DISCOVERY TODAY, 2021, 26 (11) : 2707 - 2715
  • [3] De novo drug design with deep generative models
    Das, Payel
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [4] Leveraging molecular structure and bioactivity with chemical language models for de novo drug design
    Moret, Michael
    Pachon Angona, Irene
    Cotos, Leandro
    Yan, Shen
    Atz, Kenneth
    Brunner, Cyrill
    Baumgartner, Martin
    Grisoni, Francesca
    Schneider, Gisbert
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [5] Leveraging molecular structure and bioactivity with chemical language models for de novo drug design
    Michael Moret
    Irene Pachon Angona
    Leandro Cotos
    Shen Yan
    Kenneth Atz
    Cyrill Brunner
    Martin Baumgartner
    Francesca Grisoni
    Gisbert Schneider
    Nature Communications, 14
  • [6] Application progress of deep generative models in de novo drug design
    Liu, Yingxu
    Xu, Chengcheng
    Yang, Xinyi
    Zhang, Yanmin
    Chen, Yadong
    Liu, Haichun
    MOLECULAR DIVERSITY, 2024, : 2411 - 2427
  • [7] Molecular substructure tree generative model for de novo drug design
    Wang, Shuang
    Song, Tao
    Zhang, Shugang
    Jiang, Mingjian
    Wei, Zhiqiang
    Li, Zhen
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [8] De novo molecular design with deep molecular generative models for PPI inhibitors
    Wang, Jianmin
    Chu, Yanyi
    Mao, Jiashun
    Jeon, Hyeon-Nae
    Jin, Haiyan
    Zeb, Amir
    Jang, Yuil
    Cho, Kwang-Hwi
    Song, Tao
    No, Kyoung Tai
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (04)
  • [9] Attention-based generative models for de novo molecular design
    Dollar, Orion
    Joshi, Nisarg
    Beck, David A. C.
    Pfaendtner, Jim
    CHEMICAL SCIENCE, 2021, 12 (24) : 8362 - 8372
  • [10] De Novo Molecule Design Using Molecular Generative Models Constrained by Ligand-Protein Interactions
    Zhang, Jie
    Chen, Hongming
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (14) : 3291 - 3306