ConvPred: A deep learning-based framework for predictions of potential organic reactions

被引:6
|
作者
Wang, Wenlong [1 ]
Liu, Qilei [1 ,3 ]
Dong, Yachao [1 ]
Du, Jian [1 ]
Meng, Qingwei [1 ,2 ]
Zhang, Lei [1 ,3 ]
机构
[1] Dalian Univ Technol, Sch Chem Engn, Inst Chem Proc Syst Engn, Frontiers Sci Ctr Smart Mat Oriented Chem Engn,Sta, Dalian, Peoples R China
[2] Dalian Univ Technol, Ningbo Res Inst, Ningbo, Peoples R China
[3] Dalian Univ Technol, Sch Chem Engn, Inst Chem Proc Syst Engn, Frontiers Sci Ctr Smart Mat Oriented Chem Engn,Sta, Dalian 116024, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
deep learning; molecular fingerprint; reaction fingerprint; reaction prediction; MODEL;
D O I
10.1002/aic.18019
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
The traditional theory-based technologies on reaction prediction are not efficient due to their heavy dependence on human expertise and experience. To this end, this article proposes a framework for predictions of potential organic reactions based on reaction templates and two-dimensional convolutional neural network (2D CNN) model. The quantum mechanics-based sigma-profiles and the sub-molecular structure-based ECFP4 are used individually to encode chemical reactions. Using 605,753 patented reactions extracted from the USPTO 1976-2016 database and their generated counterparts, the 2D CNN models are trained to evaluate the likelihood of molecular transformations by learning the feature differences between reactants and products. The classification accuracies of the sigma-profiles-based model and the ECFP4-based model for the non-trained reactions are 97.881 and 99.593%. Challenging reactions from literature involving identification of chemo-, stereo-, and regio-selectivity are correctly predicted. Furthermore, a sigma-profiles-based visual reaction fingerprint is introduced to provide novel insights into the model interpretability.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Deep Learning-Based Multiparametric Predictions for IoT
    Ateeq, Muhammad
    Afzal, Muhammad Khalil
    Naeem, Muhammad
    Shafiq, Muhammad
    Choi, Jin-Ghoo
    SUSTAINABILITY, 2020, 12 (18)
  • [2] Deep learning-based uncertainty quantification of groundwater level predictions
    Nourani, Vahid
    Khodkar, Kasra
    Paknezhad, Nardin Jabbarian
    Laux, Patrick
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (10) : 3081 - 3107
  • [3] Deep Learning-Based Increment Theory for Formation Enthalpy Predictions
    Chen, Lung-Yi
    Hsu, Ting-Wei
    Hsiung, Tsai-Chen
    Li, Yi-Pei
    JOURNAL OF PHYSICAL CHEMISTRY A, 2022, 126 (41): : 7548 - 7556
  • [4] Making Deep Learning-Based Predictions for Credit Scoring Explainable
    Dastile, Xolani
    Celik, Turgay
    IEEE ACCESS, 2021, 9 : 50426 - 50440
  • [5] Deep learning-based uncertainty quantification of groundwater level predictions
    Vahid Nourani
    Kasra Khodkar
    Nardin Jabbarian Paknezhad
    Patrick Laux
    Stochastic Environmental Research and Risk Assessment, 2022, 36 : 3081 - 3107
  • [6] Graph Neural Tree: A novel and interpretable deep learning-based framework for accurate molecular property predictions
    Zhan, Haolin
    Zhu, Xin
    Qiao, Zhiwei
    Hu, Jianming
    ANALYTICA CHIMICA ACTA, 2023, 1244
  • [7] A Deep Learning-Based Framework for Retinal Disease Classification
    Choudhary, Amit
    Ahlawat, Savita
    Urooj, Shabana
    Pathak, Nitish
    Lay-Ekuakille, Aime
    Sharma, Neelam
    HEALTHCARE, 2023, 11 (02)
  • [8] A Deep Reinforcement Learning-Based Framework for Content Caching
    Zhong, Chen
    Gursoy, M. Cenk
    Velipasalar, Senem
    2018 52ND ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2018,
  • [9] A deep learning-based framework for predicting pork preference
    Ko, Eunyoung
    Jeong, Kyungchang
    Oh, Hongseok
    Park, Yunhwan
    Choi, Jungseok
    Lee, Euijong
    CURRENT RESEARCH IN FOOD SCIENCE, 2023, 6
  • [10] Deep Learning-Based Cellular Random Access Framework
    Jang, Han Seung
    Lee, Hoon
    Quek, Tony Q. S.
    Shin, Hyundong
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (11) : 7503 - 7518