Linear and nonlinear methods in modeling the aqueous solubility of organic compounds

被引:39
|
作者
Catana, C
Gao, H
Orrenius, C
Stouten, PFW
机构
[1] CADD, Pfizer Global Res & Dev, Ann Arbor Labs, Ann Arbor, MI 48105 USA
[2] CADD, Pfizer Global Res & Dev, Groton Labs, Groton, CT 06340 USA
[3] Nerviano Med Sci, I-20014 Nerviano, MI, Italy
关键词
D O I
10.1021/ci049797u
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Solubility data for 930 diverse compounds have been analyzed using linear Partial Least Square (PLS) and nonlinear PLS methods, Continuum Regression (CR), and Neural Networks (NN). 1D and 2D descriptors from MOE package in combination with E-state or ISIS keys have been used. The best model was obtained using linear PLS for a combination between 22 MOE descriptors and 65 ISIS keys. It has a correlation coefficient (r(2)) of 0.935 and a root-mean-square error (RMSE) of 0.468 log molar solubility (log S,). The model validated on a test set of 177 compounds not included in the training set has r(2) 0.911 and RMSE 0.475 log S-w. The descriptors were ranked according to their importance, and at the top of the list have been found the 22 MOE descriptors. The CR model produced results as good as PLS, and because of the way in which cross-validation has been done it is expected to be a valuable tool in prediction besides PLS model. The statistics obtained using nonlinear methods did not surpass those got with linear ones. The good statistic obtained for linear PLS and CR recommends these models to be used in prediction when it is difficult or impossible to make experimental measurements, for virtual screening, combinatorial library design, and efficient leads optimization.
引用
收藏
页码:170 / 176
页数:7
相关论文
共 50 条
  • [1] Linear and nonlinear functions on modeling of aqueous solubility of organic compounds by two structure representation methods
    Aixia Yan
    Johann Gasteiger
    Michael Krug
    Soheila Anzali
    Journal of Computer-Aided Molecular Design, 2004, 18 : 75 - 87
  • [2] Linear and nonlinear functions on modeling of aqueous solubility of organic compounds by two structure representation methods
    Yan, AX
    Gasteiger, J
    Krug, M
    Anzali, S
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2004, 18 (02) : 75 - 87
  • [3] Modeling of Gibbs energy of formation of organic compounds by linear and nonlinear methods
    Yan, Aixia
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (06) : 2299 - 2304
  • [4] Versatile in silico modeling of partition coefficients of organic compounds in polydimethylsiloxane using linear and nonlinear methods
    Zhu, Tengyi
    Chen, Wenxuan
    Singh, Rajendra Prasad
    Cui, Yanran
    JOURNAL OF HAZARDOUS MATERIALS, 2020, 399
  • [5] PREDICTION OF AQUEOUS SOLUBILITY OF ORGANIC-COMPOUNDS
    NELSON, TM
    JURS, PC
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (03): : 601 - 609
  • [6] CALCULATION OF AQUEOUS SOLUBILITY OF ORGANIC-COMPOUNDS
    TEWARI, YB
    MILLER, MM
    WASIK, SP
    JOURNAL OF RESEARCH OF THE NATIONAL BUREAU OF STANDARDS, 1982, 87 (02): : 155 - 158
  • [7] Modeling of linear and nonlinear quantitative structure property relationships of the aqueous solubility of phenol derivatives
    Kherouf, Soumaya
    Bouarra, Nabil
    Bouakkadia, Amel
    Messadi, Djelloul
    JOURNAL OF THE SERBIAN CHEMICAL SOCIETY, 2019, 84 (06) : 575 - 590
  • [8] Prediction of aqueous solubility of organic compounds by the general solubility equation (GSE)
    Ran, YQ
    Jain, N
    Yalkowsky, SH
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (05): : 1208 - 1217
  • [9] Estimation of aqueous solubility of organic compounds by using the general solubility equation
    Ran, YQ
    He, Y
    Yang, G
    Johnson, JLH
    Yalkowsky, SH
    CHEMOSPHERE, 2002, 48 (05) : 487 - 509
  • [10] Estimation of aqueous solubility of organic compounds with QSPR approach
    Gao, H
    Shanmugasundaram, V
    Lee, P
    PHARMACEUTICAL RESEARCH, 2002, 19 (04) : 497 - 503