Treat Molecular Linear Notations as Sentences: Accurate Quantitative Structure-Property Relationship Modeling via a Natural Language Processing Approach

被引:7
|
作者
Zhou, Zhengtao [1 ]
Eden, Mario [2 ]
Shen, Weifeng [1 ]
机构
[1] Chongqing Univ, Sch Chem & Chem Engn, Chongqing 400044, Peoples R China
[2] Auburn Univ, Dept Chem Engn, Auburn, AL 36849 USA
基金
中国国家自然科学基金;
关键词
WATER PARTITION-COEFFICIENTS; ORGANIC-COMPOUNDS; DRUG DISCOVERY; PREDICTION; SMILES; QSPRS;
D O I
10.1021/acs.iecr.2c04070
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Quantitative structure-property relationship (QSPR) modeling is an implementation for estimating molecular properties based on structural information, which is widely applied in exploring new solvents, pharmaceuticals, and materials with desired properties. In QSPR modeling, "simplified molecular input line-entry system " (SMILES) is a popular molecular representation with specific vocabulary and syntax. Herein, SMILES is considered a chemical language, and each SMILES notation is treated as a sentence. A deep pyramid convolutional neural network architecture is constructed for extracting the information from SMILES "sentences ", and the feed-forward neural network is used for the property correlation. A case study of predicting the logarithm values of the octanol-water partition coefficient is conducted to prove the effectiveness of the proposed philosophy. Compared with a precedent reference model, the outperformance of the developed QSPR models provides fascinating insights for applying natural language processing technologies for molecular information mining and exploration of chemical property space.
引用
收藏
页码:5336 / 5346
页数:11
相关论文
共 50 条
  • [21] A computational toolbox for molecular property prediction based on quantum mechanics and quantitative structure-property relationship
    Liu Qilei
    Jiang Yinke
    Zhang Lei
    Du Jian
    Frontiers of Chemical Science and Engineering, 2022, 16 (02) : 152 - 167
  • [22] A computational toolbox for molecular property prediction based on quantum mechanics and quantitative structure-property relationship
    Qilei Liu
    Yinke Jiang
    Lei Zhang
    Jian Du
    Frontiers of Chemical Science and Engineering, 2022, 16 : 152 - 167
  • [23] A computational toolbox for molecular property prediction based on quantum mechanics and quantitative structure-property relationship
    Liu, Qilei
    Jiang, Yinke
    Zhang, Lei
    Du, Jian
    FRONTIERS OF CHEMICAL SCIENCE AND ENGINEERING, 2022, 16 (02) : 152 - 167
  • [24] Assessing the factors responsible for ionic liquid toxicity to aquatic organisms via quantitative structure-property relationship modeling
    Couling, DJ
    Bernot, RJ
    Docherty, KM
    Dixon, JK
    Maginn, EJ
    GREEN CHEMISTRY, 2006, 8 (01) : 82 - 90
  • [25] QUANTITATIVE STRUCTURE-PROPERTY RELATIONSHIP MODELLING OF ANTIRADICAL PROPERTIES OF NATURAL POLYPHENOLS USING EVA VECTOR DESCRIPTOR APPROACH
    Alov, Petko
    Tsakovska, Ivanka
    Pajeva, Ilza
    COMPTES RENDUS DE L ACADEMIE BULGARE DES SCIENCES, 2016, 69 (09): : 1145 - 1152
  • [26] Molecular Modeling of Polymers 16. Gaseous Diffusion in Polymers: A Quantitative Structure-Property Relationship (QSPR) Analysis
    Hitesh C. Patel
    John S. Tokarski
    A. J. Hopfinger
    Pharmaceutical Research, 1997, 14 : 1349 - 1354
  • [27] Estimation of the Heat Capacity of Ionic Liquids: A Quantitative Structure-Property Relationship Approach
    Sattari, Mehdi
    Gharagheizi, Farhad
    Ilani-Kashkouli, Poorandokht
    Mohammadi, Amir H.
    Ramjugernath, Deresh
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2013, 52 (36) : 13217 - 13221
  • [28] MOLECULAR MODELING OF POLYMERS .14. QUANTITATIVE STRUCTURE-PROPERTY RELATIONSHIP ANALYSES OF MULTICOMPONENT SYSTEMS CONTAINING POLYMERS
    HOPFINGER, AJ
    KOEHLER, MG
    ROGERS, D
    MACROMOLECULAR SYMPOSIA, 1995, 98 : 1087 - 1100
  • [29] Molecular modeling of polymers .16. Gaseous diffusion in polymers: A quantitative structure-property relationship (QSPR) analysis
    Patel, HC
    Tokarski, JS
    Hopfinger, AJ
    PHARMACEUTICAL RESEARCH, 1997, 14 (10) : 1349 - 1354
  • [30] Research advances in deep learning based quantitative structure-property relationship modeling of solvents
    Tian L.
    Wang Z.
    Su Y.
    Wen H.
    Shen W.
    Huagong Xuebao/CIESC Journal, 2020, 71 (10): : 4462 - 4472