Kernel-Based Partial Least Squares: Application to Fingerprint-Based QSAR with Model Visualization

被引:50
|
作者
An, Yuling [1 ]
Sherman, Woody [1 ]
Dixon, Steven L. [1 ]
机构
[1] Schrodinger Inc, New York, NY 10036 USA
关键词
METHIONINE AMINOPEPTIDASE-2; APPLICABILITY DOMAIN; INHIBITORS; POTENT; DESIGN; OPTIMIZATION; DESCRIPTORS; SIMILARITY; PREDICTION; REDUCTION;
D O I
10.1021/ci400250c
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Numerous regression-based and machine learning techniques are available for the development of linear and nonlinear QSAR models that can accurately predict biological endpoints. Such tools can be quite powerful in the hands of an experienced modeler, but too frequently a disconnect remains between the modeler and project chemist because the resulting QSAR models are effectively black boxes. As a result, learning methods that yield models that can be visualized in the context of chemical structures are in high demand. In this work, we combine direct kernel-based PLS with Canvas 2D fingerprints to arrive at predictive QSAR models that can be projected onto the atoms of a chemical structure, allowing immediate identification of favorable and unfavorable characteristics. The method is validated using binding affinities for ligands from 10 different protein targets covering 7 distinct protein families. Models with significant predictive ability (test set Q(2) > 0.5) are obtained for 6 of 10 data sets, and fingerprints are shown to consistently outperform large collections of classical physicochemical and topological descriptors. In addition, we demonstrate how a simple bootstrapping technique may be employed to obtain uncertainties that provide meaningful estimates of prediction accuracy.
引用
收藏
页码:2312 / 2321
页数:10
相关论文
共 50 条
  • [41] Nonlinear multivariate quality estimation and prediction based on kernel partial least squares
    Zhang, Xi
    Yan, Weiwu
    Shao, Huihe
    [J]. INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2008, 47 (04) : 1120 - 1131
  • [42] Nonlinear Analysis for Motor Imagery EEG based Kernel Partial Least Squares
    Bao, Xuecai
    Mu, Zhendong
    Hu, Jianfeng
    [J]. ICIEA: 2009 4TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-6, 2009, : 2097 - 2100
  • [43] Self-Learning Cruise Control Using Kernel-Based Least Squares Policy Iteration
    Wang, Jian
    Xu, Xin
    Liu, Daxue
    Sun, Zhenping
    Chen, Qingyang
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2014, 22 (03) : 1078 - 1087
  • [44] When cannot regularization improve the least squares estimate in the kernel-based regularized system identification
    Mu, Biqiang
    Ljung, Lennart
    Chen, Tianshi
    [J]. AUTOMATICA, 2024, 160
  • [45] Kernel-based regression via a novel robust loss function and iteratively reweighted least squares
    Dong, Hongwei
    Yang, Liming
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (05) : 1149 - 1172
  • [46] Kernel-based regression via a novel robust loss function and iteratively reweighted least squares
    Hongwei Dong
    Liming Yang
    [J]. Knowledge and Information Systems, 2021, 63 : 1149 - 1172
  • [47] DeepRSSI: Generative Model for Fingerprint-Based Localization
    Yoon, Namkyung
    Jung, Wooyong
    Kim, Hwangnam
    [J]. IEEE ACCESS, 2024, 12 : 66196 - 66213
  • [48] A robust least squares fuzzy regression model based on kernel function
    Khammar, A. H.
    Arefi, M.
    Akbari, M. G.
    [J]. IRANIAN JOURNAL OF FUZZY SYSTEMS, 2020, 17 (04): : 105 - 119
  • [49] Model selection for partial least squares based dimension reduction
    Li, Guo-Zheng
    Zhao, Rui-Wei
    Qu, Hai-Ni
    You, Mingyu
    [J]. PATTERN RECOGNITION LETTERS, 2012, 33 (05) : 524 - 529
  • [50] Compressed Multivariate Kernel Density Estimation for WiFi Fingerprint-based Localization
    Xu, Zhendong
    Huang, Baoqi
    Jia, Bing
    Li, Wuyungerile
    [J]. 2020 16TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2020), 2020, : 106 - 112