Treat Molecular Linear Notations as Sentences: Accurate Quantitative Structure-Property Relationship Modeling via a Natural Language Processing Approach

被引:7
|
作者
Zhou, Zhengtao [1 ]
Eden, Mario [2 ]
Shen, Weifeng [1 ]
机构
[1] Chongqing Univ, Sch Chem & Chem Engn, Chongqing 400044, Peoples R China
[2] Auburn Univ, Dept Chem Engn, Auburn, AL 36849 USA
基金
中国国家自然科学基金;
关键词
WATER PARTITION-COEFFICIENTS; ORGANIC-COMPOUNDS; DRUG DISCOVERY; PREDICTION; SMILES; QSPRS;
D O I
10.1021/acs.iecr.2c04070
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Quantitative structure-property relationship (QSPR) modeling is an implementation for estimating molecular properties based on structural information, which is widely applied in exploring new solvents, pharmaceuticals, and materials with desired properties. In QSPR modeling, "simplified molecular input line-entry system " (SMILES) is a popular molecular representation with specific vocabulary and syntax. Herein, SMILES is considered a chemical language, and each SMILES notation is treated as a sentence. A deep pyramid convolutional neural network architecture is constructed for extracting the information from SMILES "sentences ", and the feed-forward neural network is used for the property correlation. A case study of predicting the logarithm values of the octanol-water partition coefficient is conducted to prove the effectiveness of the proposed philosophy. Compared with a precedent reference model, the outperformance of the developed QSPR models provides fascinating insights for applying natural language processing technologies for molecular information mining and exploration of chemical property space.
引用
收藏
页码:5336 / 5346
页数:11
相关论文
共 50 条
  • [41] Effectiveness of surface tension reduction by nonionic surfactants with quantitative structure-property relationship approach
    Wang, ZW
    Feng, JL
    Wang, HJ
    Cui, ZG
    Li, GZ
    JOURNAL OF DISPERSION SCIENCE AND TECHNOLOGY, 2005, 26 (04) : 441 - 447
  • [42] Multi-objective Modeling and Assessment of Partition Properties: A GA-Based Quantitative Structure-Property Relationship Approach
    印春生
    刘新会
    郭卫民
    刘树深
    韩朔睽
    王连生
    Chinese Journal of Chemistry, 2003, (09) : 1150 - 1158
  • [43] Accurate quantitative structure-property relationship analysis for prediction of nematic transition temperatures in thermotropic liquid crystals
    Xu, Jie
    Wang, Luoxin
    Zhang, Hui
    Yi, Changhai
    Xu, Weilin
    MOLECULAR SIMULATION, 2010, 36 (01) : 26 - 34
  • [44] Benchmarking of linear and nonlinear approaches for quantitative structure-property relationship studies of metal complexation with ionophores
    Tetko, IV
    Solov'ev, VP
    Antonov, AV
    Yao, XJ
    Doucet, JP
    Fan, BT
    Hoonakker, F
    Fourches, D
    Jost, P
    Lachiche, N
    Varnek, A
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) : 808 - 819
  • [45] A "non-linear" quantitative structure-property relationship for the prediction of electrical conductivity of ionic liquids
    Gharagheizi, Farhad
    Sattari, Mehdi
    Ilani-Kashkouli, Poorandokht
    Mohammadi, Amir H.
    Ramjugernath, Deresh
    Richon, Dominique
    CHEMICAL ENGINEERING SCIENCE, 2013, 101 : 478 - 485
  • [46] A natural language processing approach based on embedding deep learning from heterogeneous compounds for quantitative structure-activity relationship modeling
    Bouhedjar, Khalid
    Boukelia, Abdelbasset
    Khorief Nacereddine, Abdelmalek
    Boucheham, Anouar
    Belaidi, Amine
    Djerourou, Abdelhafid
    CHEMICAL BIOLOGY & DRUG DESIGN, 2020, 96 (03) : 961 - 972
  • [47] A Quantitative Structure-Property Relationship (QSPR) Study of Aliphatic Alcohols by the Method of Dividing the Molecular Structure into Substructure
    Liu, Fengping
    Cao, Chenzhong
    Cheng, Bin
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2011, 12 (04): : 2448 - 2462
  • [48] Modeling of the physicochemical properties of aliphatic alcohols using topological indices and quantitative structure-property relationship
    Arjmand, F.
    Shafiei, F.
    BULGARIAN CHEMICAL COMMUNICATIONS, 2017, 49 (04): : 852 - 858
  • [49] Quantitative structure-property relationship of extraction equilibria of lanthanoid series using molecular mechanics calculations
    Yoshizuka, K
    Inoue, K
    Comba, P
    KAGAKU KOGAKU RONBUNSHU, 2000, 26 (04) : 517 - 522
  • [50] A study of novel molecular descriptors and quantitative structure-property relationship analysis of blood cancer drugs
    Mahboob, Abid
    Rasheed, Muhammad Waheed
    Amin, Laiba
    Hanif, Iqra
    EUROPEAN PHYSICAL JOURNAL PLUS, 2023, 138 (09):