Treat Molecular Linear Notations as Sentences: Accurate Quantitative Structure-Property Relationship Modeling via a Natural Language Processing Approach

被引:7
|
作者
Zhou, Zhengtao [1 ]
Eden, Mario [2 ]
Shen, Weifeng [1 ]
机构
[1] Chongqing Univ, Sch Chem & Chem Engn, Chongqing 400044, Peoples R China
[2] Auburn Univ, Dept Chem Engn, Auburn, AL 36849 USA
基金
中国国家自然科学基金;
关键词
WATER PARTITION-COEFFICIENTS; ORGANIC-COMPOUNDS; DRUG DISCOVERY; PREDICTION; SMILES; QSPRS;
D O I
10.1021/acs.iecr.2c04070
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Quantitative structure-property relationship (QSPR) modeling is an implementation for estimating molecular properties based on structural information, which is widely applied in exploring new solvents, pharmaceuticals, and materials with desired properties. In QSPR modeling, "simplified molecular input line-entry system " (SMILES) is a popular molecular representation with specific vocabulary and syntax. Herein, SMILES is considered a chemical language, and each SMILES notation is treated as a sentence. A deep pyramid convolutional neural network architecture is constructed for extracting the information from SMILES "sentences ", and the feed-forward neural network is used for the property correlation. A case study of predicting the logarithm values of the octanol-water partition coefficient is conducted to prove the effectiveness of the proposed philosophy. Compared with a precedent reference model, the outperformance of the developed QSPR models provides fascinating insights for applying natural language processing technologies for molecular information mining and exploration of chemical property space.
引用
收藏
页码:5336 / 5346
页数:11
相关论文
共 50 条
  • [31] Quantitative structure-property relationship modeling of small organic molecules for solar cells applications
    Tortorella, Sara
    De Angelis, Filippo
    Cruciani, Gabriele
    JOURNAL OF CHEMOMETRICS, 2018, 32 (02)
  • [32] Systematic diesel molecular performance evaluation based on quantitative structure-property relationship model
    Guang-Qing Cai
    Lin-Zhou Zhang
    Petroleum Science, 2022, 19 (02) : 809 - 818
  • [33] Role of molecular descriptors in quantitative structure-property relationship analysis of kidney cancer therapeutics
    Mahboob, Abid
    Rasheed, Muhammad Waheed
    Hanif, Iqra
    Amin, Laiba
    Alameri, Abdu
    INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2024, 124 (01)
  • [34] Estimation of molecular diffusivity of pure chemicals in water: a quantitative structure-property relationship study
    Gharagheizi, F.
    Sattari, M.
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2009, 20 (3-4) : 267 - 285
  • [35] Systematic diesel molecular performance evaluation based on quantitative structure-property relationship model
    Cai, Guang-Qing
    Zhang, Lin-Zhou
    PETROLEUM SCIENCE, 2022, 19 (02) : 809 - 818
  • [36] Systematic diesel molecular performance evaluation based on quantitative structure-property relationship model
    Guang-Qing Cai
    Lin-Zhou Zhang
    Petroleum Science, 2022, (02) : 809 - 818
  • [37] Quantitative Structure-Property Relationship (QSPR) Modeling of Drug-Loaded Polymeric Micelles via Genetic Function Approximation
    Wu, Wensheng
    Zhang, Canyang
    Lin, Wenjing
    Chen, Quan
    Guo, Xindong
    Qian, Yu
    Zhang, Lijuan
    PLOS ONE, 2015, 10 (03):
  • [38] Estimation of Aniline Point Temperature of Pure Hydrocarbons: A Quantitative Structure-Property Relationship Approach
    Gharagheizi, Farhad
    Tirandazi, Behnarn
    Barzin, Reza
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2009, 48 (03) : 1678 - 1682
  • [39] Quantitative structure-property relationship approach to predicting xylene separation with diverse exchanged faujasites
    Khabzina, Y.
    Laroche, C.
    Perez-Pellitero, J.
    Farrusseng, D.
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2018, 20 (36) : 23773 - 23782
  • [40] A structure-property relationship and quantitative approach to the development of universal transdermal drug delivery system
    Feldstein, MM
    Platé, NA
    NBC RISKS: CURRENT CAPABILITIES AND FUTURE PERSPECTIVES FOR PROTECTION, 1999, 25 : 441 - 458