Chemical language models for molecular design

被引:3
|
作者
Bajorath, Juergen [1 ,2 ,3 ]
机构
[1] Rheinische Friedrich Wilhelms Univ Bonn, Bonn Aachen Int Ctr Informat Technol, Dept Life Sci Informat, Bonn, Germany
[2] Rheinische Friedrich Wilhelms Univ Bonn, Lamarr Inst Machine Learning & Artificial Intellig, Bonn, Germany
[3] Rheinische Friedrich Wilhelms Univ Bonn, Bonn Aachen Int Ctr Informat Technol, Dept Life Sci Informat, Friedrich Hirzebruch Allee 5-6, D-53115 Bonn, Germany
关键词
drug design; language models; recurrent neural networks; encoder-decoder frameworks; transformers; attention mechanisms; TRANSFORMER; DISCOVERY;
D O I
10.1002/minf.202300288
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
In drug discovery, chemical language models (CLMs) originating from natural language processing offer new opportunities for molecular design. CLMs have been developed using recurrent neural network (RNN) or transformer architectures. For the predictive performance of RNN-based encoder-decoder frameworks and transformers, attention mechanisms play a central role. Among others, emerging application areas for CLMs include constrained generative modeling and the prediction of chemical reactions or drug-target interactions. Since CLMs are applicable to any compound or target data that can be presented in a sequential format and tokenized, mappings of different types of sequences can be learned. For example, active compounds can be predicted from protein sequence motifs. Novel off-the-beat-path applications can also be considered. For example, analogue series from medicinal chemistry can be perceived and represented as chemical sequences and extended with new compounds using CLMs. Herein, methodological features of CLMs and different applications are discussed. image
引用
收藏
页数:7
相关论文
共 50 条
  • [21] MOLECULAR MODELING FOR CHEMICAL DESIGN
    LUTHER, D
    FISHER, RG
    SWANSON, E
    COMPUTER GRAPHICS WORLD, 1986, 9 (11) : 28 - 32
  • [22] Generative Models for Molecular Design
    Merz, Kenneth M., Jr.
    De Fabritiis, Gianni
    Wei, Guo-Wei
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (12) : 5635 - 5636
  • [23] Chemical models of hot molecular cores
    Millar, TJ
    Hatchell, J
    FARADAY DISCUSSIONS, 1998, 109 : 15 - 30
  • [24] Chemical models for molecular absorption features
    Gerin, M
    Roueff, E
    HIGHLY REDSHIFTED RADIO LINES, 1999, 156 : 196 - 201
  • [25] Chemical language models enable navigation in sparsely populated chemical space
    Skinnider, Michael A.
    Stacey, R. Greg
    Wishart, David S.
    Foster, Leonard J.
    NATURE MACHINE INTELLIGENCE, 2021, 3 (09) : 759 - +
  • [26] Chemical language models enable navigation in sparsely populated chemical space
    Michael A. Skinnider
    R. Greg Stacey
    David S. Wishart
    Leonard J. Foster
    Nature Machine Intelligence, 2021, 3 : 759 - 770
  • [27] A modeling language for design processes in chemical engineering
    Eggersmann, M
    Krobb, C
    Marquardt, W
    CONCEPTUAL MODELING ER 2000, PROCEEDINGS, 2000, 1920 : 369 - 382
  • [28] A LANGUAGE FOR CHEMICAL PLANT DESIGN AND SIMULATION PROGRAMS
    CAMERON, PT
    COMPUTER JOURNAL, 1969, 12 (01): : 24 - &
  • [29] Bayesian molecularL design with a chemical language model
    Ikebata, Hisaki
    Hongo, Kenta
    Isomura, Tetsu
    Maezono, Ryo
    Yoshida, Ryo
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2017, 31 (04) : 379 - 391
  • [30] The Rise and Design of Enterprise Large Language Models
    O'Leary, Daniel E.
    IEEE INTELLIGENT SYSTEMS, 2024, 39 (01) : 60 - 63