Exploring Chemical Reaction Space with Machine Learning Models: Representation and Feature Perspective

被引:3
|
作者
Ding, Yuheng [1 ]
Qiang, Bo [1 ]
Chen, Qixuan [1 ]
Liu, Yiqiao [1 ]
Zhang, Liangren [1 ]
Liu, Zhenming [1 ]
机构
[1] Peking Univ, Dept Pharmaceut, Beijing 100191, Peoples R China
基金
国家重点研发计划;
关键词
ACTIVATION-ENERGY; TRANSITION-STATES; SIMILARITY SEARCH; PROTEIN-STRUCTURE; YIELD PREDICTION; NEURAL-NETWORKS; DFT; RETROSYNTHESIS; PERFORMANCE; CONJUGATION;
D O I
10.1021/acs.jcim.4c00004
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Chemical reactions serve as foundational building blocks for organic chemistry and drug design. In the era of large AI models, data-driven approaches have emerged to innovate the design of novel reactions, optimize existing ones for higher yields, and discover new pathways for synthesizing chemical structures comprehensively. To effectively address these challenges with machine learning models, it is imperative to derive robust and informative representations or engage in feature engineering using extensive data sets of reactions. This work aims to provide a comprehensive review of established reaction featurization approaches, offering insights into the selection of representations and the design of features for a wide array of tasks. The advantages and limitations of employing SMILES, molecular fingerprints, molecular graphs, and physics-based properties are meticulously elaborated. Solutions to bridge the gap between different representations will also be critically evaluated. Additionally, we introduce a new frontier in chemical reaction pretraining, holding promise as an innovative yet unexplored avenue.
引用
收藏
页码:2955 / 2970
页数:16
相关论文
共 50 条
  • [1] Exploring Chemical Space with Machine Learning
    Arus-Pous, Josep
    Awale, Mahendra
    Probst, Daniel
    Reymond, Jean-Louis
    [J]. CHIMIA, 2019, 73 (12) : 1018 - 1023
  • [2] Machine learning in chemical reaction space
    Stocker, Sina
    Csanyi, Gabor
    Reuter, Karsten
    Margraf, Johannes T.
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [3] Machine learning in chemical reaction space
    Sina Stocker
    Gábor Csányi
    Karsten Reuter
    Johannes T. Margraf
    [J]. Nature Communications, 11
  • [4] Compression and machine learning: A new perspective on feature space vectors
    Sculley, D.
    Brodley, Carla E.
    [J]. DCC 2006: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2006, : 332 - +
  • [5] Learning to Make Chemical Predictions: The Interplay of Feature Representation, Data, and Machine Learning Methods
    Haghighatlari, Mojtaba
    Li, Jie
    Heidar-Zadeh, Farnaz
    Liu, Yuchen
    Guan, Xingyi
    Head-Gordon, Teresa
    [J]. CHEM, 2020, 6 (07): : 1527 - 1542
  • [6] Mapping transition metal chemical space with continuous descriptors - feature selection and implications for machine learning models
    Janet, Jon Paul
    Kulik, Heather
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [7] Exploring the chemical space of BRAF Inhibitors: A cheminformatic and Machine learning analysis
    Aouidate, Adnane
    [J]. JOURNAL OF MOLECULAR LIQUIDS, 2024, 401
  • [8] Exploring chemical space with computers: Informatics challenges for ai and machine learning
    Baldi, Pierre
    [J]. BIOMAT 2005, 2006, : 343 - 350
  • [9] Exploring chemical compound space with quantum-based machine learning
    von Lilienfeld, O. Anatole
    Mueller, Klaus-Robert
    Tkatchenko, Alexandre
    [J]. NATURE REVIEWS CHEMISTRY, 2020, 4 (07) : 347 - 358
  • [10] Exploring chemical compound space with quantum-based machine learning
    O. Anatole von Lilienfeld
    Klaus-Robert Müller
    Alexandre Tkatchenko
    [J]. Nature Reviews Chemistry, 2020, 4 : 347 - 358