Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery

被引:104
|
作者
Bosc, Nicolas [1 ]
Atkinson, Francis [1 ]
Felix, Eloy [1 ]
Gaulton, Anna [1 ]
Hersey, Anne [1 ]
Leach, Andrew R. [1 ]
机构
[1] European Bioinformat Inst EMBL EBI, Chemogen Team, Wellcome Genome Campus, Cambridge CB10 1SD, England
基金
英国惠康基金; 欧盟地平线“2020”;
关键词
QSAR; Mondrian conformal prediction; ChEMBL; Classification models; Cheminformatics; APPLICABILITY DOMAIN; CLASSIFICATION; DATABASE; CHEMICALS; DESIGN;
D O I
10.1186/s13321-018-0325-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Structure-activity relationship modelling is frequently used in the early stage of drug discovery to assess the activity of a compound on one or several targets, and can also be used to assess the interaction of compounds with liability targets. QSAR models have been used for these and related applications over many years, with good success. Conformal prediction is a relatively new QSAR approach that provides information on the certainty of a prediction, and so helps in decision-making. However, it is not always clear how best to make use of this additional information. In this article, we describe a case study that directly compares conformal prediction with traditional QSAR methods for large-scale predictions of target-ligand binding. The ChEMBL database was used to extract a data set comprising data from 550 human protein targets with different bioactivity profiles. For each target, a QSAR model and a conformal predictor were trained and their results compared. The models were then evaluated on new data published since the original models were built to simulate a real world application. The comparative study highlights the similarities between the two techniques but also some differences that it is important to bear in mind when the methods are used in practical drug discovery applications.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery
    Nicolas Bosc
    Francis Atkinson
    Eloy Felix
    Anna Gaulton
    Anne Hersey
    Andrew R. Leach
    Journal of Cheminformatics, 11
  • [2] Missed opportunities in large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery
    Damjan Krstajic
    Journal of Cheminformatics, 11
  • [3] Missed opportunities in large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery
    Krstajic, Damjan
    JOURNAL OF CHEMINFORMATICS, 2019, 11 (01)
  • [4] Reply to “Missed opportunities in large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery”
    Nicolas Bosc
    Francis Atkinson
    Eloy Félix
    Anna Gaulton
    Anne Hersey
    Andrew R. Leach
    Journal of Cheminformatics, 11
  • [5] Reply to "Missed opportunities in large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery"
    Bosc, Nicolas
    Atkinson, Francis
    Felix, Eloy
    Gaulton, Anna
    Hersey, Anne
    Leach, Andrew R.
    JOURNAL OF CHEMINFORMATICS, 2019, 11 (01)
  • [6] Comprehensive ensemble in QSAR prediction for drug discovery
    Kwon, Sunyoung
    Bae, Ho
    Jo, Jeonghee
    Yoon, Sungroh
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [7] Comprehensive ensemble in QSAR prediction for drug discovery
    Sunyoung Kwon
    Ho Bae
    Jeonghee Jo
    Sungroh Yoon
    BMC Bioinformatics, 20
  • [8] The impact of QSAR and CADD methods in drug discovery
    Wermuth, CG
    RATIONAL APPROACHES TO DRUG DESIGN, 2001, : 3 - 20
  • [9] Large-scale comparison of machine learning methods for drug target prediction on ChEMBL
    Mayr, Andreas
    Klambauer, Guenter
    Unterthiner, Thomas
    Steijaert, Marvin
    Wegner, Jorg K.
    Ceulemans, Hugo
    Clevert, Djork-Arne
    Hochreiter, Sepp
    CHEMICAL SCIENCE, 2018, 9 (24) : 5441 - 5451
  • [10] The application of conformal prediction to the drug discovery process
    Eklund, Martin
    Norinder, Ulf
    Boyer, Scott
    Carlsson, Lars
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2015, 74 (1-2) : 117 - 132