Building Quantitative Structure-Activity Relationship Models Using Bayesian Additive Regression Trees

被引:6
|
作者
Feng, Dai [1 ]
Syetnik, Vladimir [1 ]
Liaw, Andy [1 ]
Pratola, Matthew [2 ]
Sheridan, Robert P. [3 ]
机构
[1] Merck & Co Inc, Biomet Res, Kenilworth, NJ 07033 USA
[2] Ohio State Univ, Dept Stat, Cockins Hall,1958 Neil Ave, Columbus, OH 43210 USA
[3] Merck & Co Inc, Modeling & Informat, Kenilworth, NJ 07033 USA
基金
美国国家科学基金会;
关键词
COMPOUND CLASSIFICATION; CART; TOOL;
D O I
10.1021/acs.jcim.9b00094
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Quantitative structure-activity relationship (QSAR) is a very commonly used technique for predicting the biological activity of a molecule using information contained in the molecular descriptors. The large number of compounds and descriptors and the sparseness of descriptors pose important challenges to traditional statistical methods and machine learning (ML) algorithms (such as random forest (RF)) used in this field. Recently, Bayesian Additive Regression Trees (BART), a flexible Bayesian nonparametric regression approach, has been demonstrated to be competitive with widely used ML approaches. Instead of only focusing on accurate point estimation, BART is formulated entirely in a hierarchical Bayesian modeling framework, allowing one to also quantify uncertainties and hence to provide both point and interval estimation for a variety of quantities of interest. We studied BART as a model builder for QSAR and demonstrated that the approach tends to have predictive performance comparable to RF. More importantly, we investigated BARTs natural capability to analyze truncated (or qualified) data, generate interval estimates for molecular activities as well as descriptor importance, and conduct model diagnosis, which could not be easily handled through other approaches.
引用
收藏
页码:2642 / 2655
页数:14
相关论文
共 50 条
  • [1] Quantitative structure-activity relationship models for compounds with anticonvulsant activity
    Bellera, Carolina L.
    Talevi, Alan
    EXPERT OPINION ON DRUG DISCOVERY, 2019, 14 (07) : 653 - 665
  • [2] Quantitative Structure-activity Relationship Models of Monomer Reactivity
    禹新良
    易翔
    杨辉琼
    ChineseJournalofStructuralChemistry, 2019, 38 (11) : 1867 - 1873
  • [3] Quantitative structure-activity relationship models for bee toxicity
    Toropov, Andrey A.
    Toropova, Alla P.
    Como, Francesca
    Benfenati, Emilio
    TOXICOLOGICAL AND ENVIRONMENTAL CHEMISTRY, 2017, 99 (7-8): : 1117 - 1128
  • [4] Advances in quantitative structure-activity relationship models of antioxidants
    Roy, Kunal
    Mitra, Indrani
    EXPERT OPINION ON DRUG DISCOVERY, 2009, 4 (11) : 1157 - 1175
  • [5] On the interpretation and interpretability of quantitative structure-activity relationship models
    Guha, Rajarshi
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2008, 22 (12) : 857 - 871
  • [6] Quantitative Structure-activity Relationship Models of Monomer Reactivity
    Yu Xin-Liang
    Yi Xiang
    Yang Hui-Qiong
    CHINESE JOURNAL OF STRUCTURAL CHEMISTRY, 2019, 38 (11) : 1867 - 1873
  • [7] Advances in quantitative structure-activity relationship models of antimalarials
    Roy, Kunal
    Ojha, Probir Kumar
    EXPERT OPINION ON DRUG DISCOVERY, 2010, 5 (08) : 751 - 778
  • [8] Bayesian Additive Regression Trees using Bayesian model averaging
    Belinda Hernández
    Adrian E. Raftery
    Stephen R Pennington
    Andrew C. Parnell
    Statistics and Computing, 2018, 28 : 869 - 890
  • [9] Bayesian Additive Regression Trees using Bayesian model averaging
    Hernandez, Belinda
    Raftery, Adrian E.
    Pennington, Stephen R.
    Parnell, Andrew C.
    STATISTICS AND COMPUTING, 2018, 28 (04) : 869 - 890
  • [10] Quantitative Structure-activity Relationship (QSAR) Modelling of Indomethacin Derivatives using Regression Analysis
    Shukla, Neerja
    Sharma, Bechan
    CURRENT MEDICINAL CHEMISTRY, 2024, 31 (40) : 6722 - 6732