Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery

被引:104
|
作者
Bosc, Nicolas [1 ]
Atkinson, Francis [1 ]
Felix, Eloy [1 ]
Gaulton, Anna [1 ]
Hersey, Anne [1 ]
Leach, Andrew R. [1 ]
机构
[1] European Bioinformat Inst EMBL EBI, Chemogen Team, Wellcome Genome Campus, Cambridge CB10 1SD, England
基金
英国惠康基金; 欧盟地平线“2020”;
关键词
QSAR; Mondrian conformal prediction; ChEMBL; Classification models; Cheminformatics; APPLICABILITY DOMAIN; CLASSIFICATION; DATABASE; CHEMICALS; DESIGN;
D O I
10.1186/s13321-018-0325-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Structure-activity relationship modelling is frequently used in the early stage of drug discovery to assess the activity of a compound on one or several targets, and can also be used to assess the interaction of compounds with liability targets. QSAR models have been used for these and related applications over many years, with good success. Conformal prediction is a relatively new QSAR approach that provides information on the certainty of a prediction, and so helps in decision-making. However, it is not always clear how best to make use of this additional information. In this article, we describe a case study that directly compares conformal prediction with traditional QSAR methods for large-scale predictions of target-ligand binding. The ChEMBL database was used to extract a data set comprising data from 550 human protein targets with different bioactivity profiles. For each target, a QSAR model and a conformal predictor were trained and their results compared. The models were then evaluated on new data published since the original models were built to simulate a real world application. The comparative study highlights the similarities between the two techniques but also some differences that it is important to bear in mind when the methods are used in practical drug discovery applications.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Large-scale Direct Targeting for Drug Repositioning and Discovery
    Chunli Zheng
    Zihu Guo
    Chao Huang
    Ziyin Wu
    Yan Li
    Xuetong Chen
    Yingxue Fu
    Jinlong Ru
    Piar Ali Shar
    Yuan Wang
    Yonghua Wang
    Scientific Reports, 5
  • [42] Accurate prediction of logD and hERG liability by pharmacophore fingerprint QSAR (pFPQSAR) for drug discovery in GSK
    Yang, Zheng
    Wu, Tong-Ying
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2009, 237
  • [43] Development of drug design methods and applications in first-in-class drug discovery
    Zhang, Jian
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [44] Docking and scoring in virtual screening for drug discovery: methods and applications
    Douglas B. Kitchen
    Hélène Decornez
    John R. Furr
    Jürgen Bajorath
    Nature Reviews Drug Discovery, 2004, 3 : 935 - 949
  • [45] Cancer Biology Aspects of Computational Methods & Applications in Drug Discovery
    Chien, Shang-Tao
    Kumar, Ajay
    Pandey, Shifa
    Yen, Chung-Kun
    Wang, Shao-Yu
    Wen, Zhi-Hong
    Kaushik, Aman C.
    Shiue, Yow-Ling
    Pan, Cheng-Tang
    CURRENT PHARMACEUTICAL DESIGN, 2018, 24 (32) : 3758 - 3766
  • [46] Machine-learning approaches in drug discovery: methods and applications
    Lavecchia, Antonio
    DRUG DISCOVERY TODAY, 2015, 20 (03) : 318 - 331
  • [47] Computational systems biology in drug discovery and development: methods and applications
    Materi, Wayne
    Wishart, David S.
    DRUG DISCOVERY TODAY, 2007, 12 (7-8) : 295 - 303
  • [48] Yeast two-hybrid methods and their applications in drug discovery
    Hamdi, Amel
    Colas, Pierre
    TRENDS IN PHARMACOLOGICAL SCIENCES, 2012, 33 (02) : 109 - 118
  • [49] Molecular Dynamics and Related Computational Methods with Applications to Drug Discovery
    Preto, Jordane
    Gentile, Francesco
    Winter, Philip
    Churchill, Cassandra
    Omar, Sara Ibrahim
    Tuszynski, Jack A.
    COUPLED MATHEMATICAL MODELS FOR PHYSICAL AND BIOLOGICAL NANOSCALE SYSTEMS AND THEIR APPLICATIONS, 2018, 232 : 267 - 285
  • [50] Applications of high throughput thermodynamic methods in proteomics & drug discovery
    Salemme, FR
    BIOPHYSICAL JOURNAL, 2003, 84 (02) : 284A - 284A