Exploring Chemical Reaction Space with Machine Learning Models: Representation and Feature Perspective

被引:3
|
作者
Ding, Yuheng [1 ]
Qiang, Bo [1 ]
Chen, Qixuan [1 ]
Liu, Yiqiao [1 ]
Zhang, Liangren [1 ]
Liu, Zhenming [1 ]
机构
[1] Peking Univ, Dept Pharmaceut, Beijing 100191, Peoples R China
基金
国家重点研发计划;
关键词
ACTIVATION-ENERGY; TRANSITION-STATES; SIMILARITY SEARCH; PROTEIN-STRUCTURE; YIELD PREDICTION; NEURAL-NETWORKS; DFT; RETROSYNTHESIS; PERFORMANCE; CONJUGATION;
D O I
10.1021/acs.jcim.4c00004
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Chemical reactions serve as foundational building blocks for organic chemistry and drug design. In the era of large AI models, data-driven approaches have emerged to innovate the design of novel reactions, optimize existing ones for higher yields, and discover new pathways for synthesizing chemical structures comprehensively. To effectively address these challenges with machine learning models, it is imperative to derive robust and informative representations or engage in feature engineering using extensive data sets of reactions. This work aims to provide a comprehensive review of established reaction featurization approaches, offering insights into the selection of representations and the design of features for a wide array of tasks. The advantages and limitations of employing SMILES, molecular fingerprints, molecular graphs, and physics-based properties are meticulously elaborated. Solutions to bridge the gap between different representations will also be critically evaluated. Additionally, we introduce a new frontier in chemical reaction pretraining, holding promise as an innovative yet unexplored avenue.
引用
收藏
页码:2955 / 2970
页数:16
相关论文
共 50 条
  • [21] Machine learning maps reaction space
    Lemonick, Sam
    [J]. CHEMICAL & ENGINEERING NEWS, 2021, 99 (05) : 6 - 6
  • [22] Exploring the Latent Chemical Space of Oxygen Vacancy Formation Energy by a Machine Learning Ensemble
    Park, Seulyoung
    Lee, Noki
    Park, Jun Oh
    Park, Jin
    Heo, Yu Seong
    Lee, Jaichan
    [J]. ACS MATERIALS LETTERS, 2023, 6 (01): : 66 - 72
  • [23] Exploring the chemical space of protein-protein interaction inhibitors through machine learning
    Choi, Jiwon
    Yun, Jun Seop
    Song, Hyeeun
    Kim, Nam Hee
    Kim, Hyun Sil
    Yook, Jong In
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [24] Mapping reaction space with machine learning
    Doyle, Abigail
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [25] Exploring catalytic reaction networks with machine learning
    Johannes T. Margraf
    Hyunwook Jung
    Christoph Scheurer
    Karsten Reuter
    [J]. Nature Catalysis, 2023, 6 : 112 - 121
  • [26] Exploring catalytic reaction networks with machine learning
    Margraf, Johannes T.
    Jung, Hyunwook
    Scheurer, Christoph
    Reuter, Karsten
    [J]. NATURE CATALYSIS, 2023, 6 (02) : 112 - 121
  • [27] Enhancing feature learning for chemical reaction prediction
    Baylon, Javier
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [28] Exploring the Relationship Between EMG Feature Space Characteristics and Control Performance in Machine Learning Myoelectric Control
    Franzke, Andreas W.
    Kristoffersen, Morten B.
    Jayaram, Vinay
    van der Sluis, Corry K.
    Murgia, Alessio
    Bongers, Raoul M.
    [J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 21 - 30
  • [29] Exploring the Relationship between EMG Feature Space Characteristics and Control Performance in Machine Learning Myoelectric Control
    Franzke, Andreas W.
    Kristoffersen, Morten B.
    Jayaram, Vinay
    Van Der Sluis, Corry K.
    Murgia, Alessio
    Bongers, Raoul M.
    [J]. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 2021, 29 : 21 - 30
  • [30] Novel Feature Representation and Machine Learning Methods in Computational Proteomics
    Chen, Lei
    [J]. CURRENT PROTEOMICS, 2021, 18 (05) : 606 - 607