Treat Molecular Linear Notations as Sentences: Accurate Quantitative Structure-Property Relationship Modeling via a Natural Language Processing Approach

被引:7
|
作者
Zhou, Zhengtao [1 ]
Eden, Mario [2 ]
Shen, Weifeng [1 ]
机构
[1] Chongqing Univ, Sch Chem & Chem Engn, Chongqing 400044, Peoples R China
[2] Auburn Univ, Dept Chem Engn, Auburn, AL 36849 USA
基金
中国国家自然科学基金;
关键词
WATER PARTITION-COEFFICIENTS; ORGANIC-COMPOUNDS; DRUG DISCOVERY; PREDICTION; SMILES; QSPRS;
D O I
10.1021/acs.iecr.2c04070
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Quantitative structure-property relationship (QSPR) modeling is an implementation for estimating molecular properties based on structural information, which is widely applied in exploring new solvents, pharmaceuticals, and materials with desired properties. In QSPR modeling, "simplified molecular input line-entry system " (SMILES) is a popular molecular representation with specific vocabulary and syntax. Herein, SMILES is considered a chemical language, and each SMILES notation is treated as a sentence. A deep pyramid convolutional neural network architecture is constructed for extracting the information from SMILES "sentences ", and the feed-forward neural network is used for the property correlation. A case study of predicting the logarithm values of the octanol-water partition coefficient is conducted to prove the effectiveness of the proposed philosophy. Compared with a precedent reference model, the outperformance of the developed QSPR models provides fascinating insights for applying natural language processing technologies for molecular information mining and exploration of chemical property space.
引用
下载
收藏
页码:5336 / 5346
页数:11
相关论文
共 50 条
  • [1] Quantitative structure-property relationship of extraction behavior of sugars using molecular modeling
    Yoshizuka, K
    Matsumoto, M
    Kondo, K
    KAGAKU KOGAKU RONBUNSHU, 2006, 32 (01) : 6 - 10
  • [2] Modeling of the henry constant of a series of pesticides: Quantitative structure-property relationship approach
    Bouakkadia A.
    Driouche Y.
    Kertiou N.
    Messadi D.
    International Journal of Safety and Security Engineering, 2020, 10 (03) : 389 - 396
  • [3] Quantitative structure-property relationship modeling of skin sensitization: A quantitative prediction
    Golla, Sharath
    Madihally, Sundar
    Robinson, Robert L., Jr.
    Gasem, Khaled A. M.
    TOXICOLOGY IN VITRO, 2009, 23 (03) : 454 - 465
  • [4] Modeling of Photooxidative Degradation of Aromatics in Water Matrix: A Quantitative Structure-Property Relationship Approach
    Rasulev B.
    Božić A.L.
    Dionysiou D.D.
    Kušić H.
    ACS Symposium Series, 2019, 1331 : 257 - 292
  • [5] Quantitative Structure-Property Relationship Modeling of Diverse Materials Properties
    Le, Tu
    Epa, V. Chandana
    Burden, Frank R.
    Winkler, David A.
    CHEMICAL REVIEWS, 2012, 112 (05) : 2889 - 2919
  • [6] A quantitative structure-property relationship approach to determine the essential molecular functionalities of potent odorants
    Pal, Pallabi
    Mitra, Indrani
    Roy, Kunal
    FLAVOUR AND FRAGRANCE JOURNAL, 2014, 29 (03) : 157 - 165
  • [7] Quantitative Structure-Property Relationship Approach in Formulation Development: an Overview
    Kulkarni, Ajit S.
    Kasabe, Amit J.
    Bhatia, Manish S.
    Bhatia, Neela M.
    Gaikwad, Vinod L.
    AAPS PHARMSCITECH, 2019, 20 (07)
  • [8] Linear and nonlinear quantitative structure-property relationship modelling of skin permeability
    Khajeh, A.
    Modarress, H.
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2014, 25 (01) : 35 - 50
  • [9] Quantitative Structure-Property Relationship Modeling of Gratzel Solar Cell Dyes
    Venkatraman, Vishwesh
    Astrand, Per-Olof
    Alsberg, Bjorn Kare
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2014, 35 (03) : 214 - 226
  • [10] Quantitative structure-property relationship modeling of phosphoric polyester char formation
    Crisan, Luminita
    Iliescu, Smaranda
    Ilia, Gheorghe
    Funar-Timofei, Simona
    FIRE AND MATERIALS, 2019, 43 (01) : 101 - 109