Building Quantitative Structure-Activity Relationship Models Using Bayesian Additive Regression Trees

被引:6
|
作者
Feng, Dai [1 ]
Syetnik, Vladimir [1 ]
Liaw, Andy [1 ]
Pratola, Matthew [2 ]
Sheridan, Robert P. [3 ]
机构
[1] Merck & Co Inc, Biomet Res, Kenilworth, NJ 07033 USA
[2] Ohio State Univ, Dept Stat, Cockins Hall,1958 Neil Ave, Columbus, OH 43210 USA
[3] Merck & Co Inc, Modeling & Informat, Kenilworth, NJ 07033 USA
基金
美国国家科学基金会;
关键词
COMPOUND CLASSIFICATION; CART; TOOL;
D O I
10.1021/acs.jcim.9b00094
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Quantitative structure-activity relationship (QSAR) is a very commonly used technique for predicting the biological activity of a molecule using information contained in the molecular descriptors. The large number of compounds and descriptors and the sparseness of descriptors pose important challenges to traditional statistical methods and machine learning (ML) algorithms (such as random forest (RF)) used in this field. Recently, Bayesian Additive Regression Trees (BART), a flexible Bayesian nonparametric regression approach, has been demonstrated to be competitive with widely used ML approaches. Instead of only focusing on accurate point estimation, BART is formulated entirely in a hierarchical Bayesian modeling framework, allowing one to also quantify uncertainties and hence to provide both point and interval estimation for a variety of quantities of interest. We studied BART as a model builder for QSAR and demonstrated that the approach tends to have predictive performance comparable to RF. More importantly, we investigated BARTs natural capability to analyze truncated (or qualified) data, generate interval estimates for molecular activities as well as descriptor importance, and conduct model diagnosis, which could not be easily handled through other approaches.
引用
收藏
页码:2642 / 2655
页数:14
相关论文
共 50 条
  • [41] Quantum chemical descriptors in quantitative structure-activity relationship models and their applications
    Liangliang, Wang
    Junjie, Ding
    Li, Pan
    Dongsheng, Cao
    Hui, Jiang
    Xiaoqin, Ding
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 217
  • [42] Recent advances in quantitative structure-activity relationship models of antimalarial drugs
    Ojha, Probir Kumar
    Kumar, Vinay
    Roy, Joyita
    Roy, Kunal
    EXPERT OPINION ON DRUG DISCOVERY, 2021, 16 (06) : 659 - 695
  • [43] In silico human and rat Vss quantitative structure-activity relationship models
    Gleeson, MP
    Waters, NJ
    Paine, SW
    Davis, AM
    JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (06) : 1953 - 1963
  • [44] Generalizability Improvement of Interpretable Symbolic Regression Models for Quantitative Structure-Activity Relationships
    Shirasawa, Raku
    Takaki, Katsushi
    Miyao, Tomoyuki
    ACS OMEGA, 2024, 9 (08): : 9463 - 9474
  • [45] Quantitative Structure-Activity Relationship Analysis of Antimalarial Compound of Mangostin Derivatives Using Regression Linear Approach
    Hadanu, Ruslin
    Syamsudin
    ASIAN JOURNAL OF CHEMISTRY, 2013, 25 (11) : 6136 - 6140
  • [46] Carbon fiber adsorption using quantitative structure-activity relationship
    Qi, SY
    Hay, KJ
    Rood, MJ
    Cal, MP
    JOURNAL OF ENVIRONMENTAL ENGINEERING-ASCE, 2000, 126 (09): : 865 - 868
  • [47] Investigation and prediction of protein precipitation by polyethylene glycol using quantitative structure-activity relationship models
    Haemmerling, Frank
    Effio, Christopher Ladd
    Andris, Sebastian
    Kittelmann, Joerg
    Hubbuch, Juergen
    JOURNAL OF BIOTECHNOLOGY, 2017, 241 : 87 - 97
  • [48] Quantitative weight of evidence method for combining predictions of quantitative structure-activity relationship models
    Tinto-Moliner, A.
    Martin, M.
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2020, 31 (04) : 261 - 279
  • [49] Improving quantitative structure-activity relationship models using Artificial Neural Networks trained with dropout
    Mendenhall, Jeffrey
    Meiler, Jens
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2016, 30 (02) : 177 - 189
  • [50] Quantitative structure-activity relationship (QSAR) models for predicting the estrogenic activity of xenoestrogens.
    Yu, SJ
    Welsh, WJ
    Chen, Y
    Tong, W
    Perkins, R
    Sheehan, D
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1999, 217 : U675 - U676