CPSign: conformal prediction for cheminformatics modeling

被引:0
|
作者
McShane, Staffan Arvidsson [1 ]
Norinder, Ulf [1 ,2 ,3 ]
Alvarsson, Jonathan [1 ]
Ahlberg, Ernst [1 ,4 ]
Carlsson, Lars [4 ,5 ]
Spjuth, Ola [1 ]
机构
[1] Uppsala Univ, Dept Pharmaceut Biosci & Sci Life Lab, S-75124 Uppsala, Sweden
[2] Stockholm Univ, Dept Comp & Syst Sci, S-10587 Stockholm, Sweden
[3] Orebro Univ, MTM Res Ctr, Sch Sci & Technol, S-70182 Orebro, Sweden
[4] Royal Holloway Univ London, Dept Comp Sci, Egham TW20 0EX, England
[5] Jonkoping Univ, Dept Comp, S-55111 Jonkoping, Sweden
来源
JOURNAL OF CHEMINFORMATICS | 2024年 / 16卷 / 01期
基金
瑞典研究理事会;
关键词
SIGNATURE MOLECULAR DESCRIPTOR; APPLICABILITY DOMAIN; FINGERPRINTS; CHEMISTRY; LIBRARY; QSAR;
D O I
10.1186/s13321-024-00870-9
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Conformal prediction has seen many applications in pharmaceutical science, being able to calibrate outputs of machine learning models and producing valid prediction intervals. We here present the open source software CPSign that is a complete implementation of conformal prediction for cheminformatics modeling. CPSign implements inductive and transductive conformal prediction for classification and regression, and probabilistic prediction with the Venn-ABERS methodology. The main chemical representation is signatures but other types of descriptors are also supported. The main modeling methodology is support vector machines (SVMs), but additional modeling methods are supported via an extension mechanism, e.g. DeepLearning4J models. We also describe features for visualizing results from conformal models including calibration and efficiency plots, as well as features to publish predictive models as REST services. We compare CPSign against other common cheminformatics modeling approaches including random forest, and a directed message-passing neural network. The results show that CPSign produces robust predictive performance with comparative predictive efficiency, with superior runtime and lower hardware requirements compared to neural network based models. CPSign has been used in several studies and is in production-use in multiple organizations. The ability to work directly with chemical input files, perform descriptor calculation and modeling with SVM in the conformal prediction framework, with a single software package having a low footprint and fast execution time makes CPSign a convenient and yet flexible package for training, deploying, and predicting on chemical data. CPSign can be downloaded from GitHub at https://github.com/arosbio/cpsign.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Cheminformatics Analysis and Modeling with MacrolactoneDB
    Zin, Phyo Phyo Kyaw
    Williams, Gavin J.
    Ekins, Sean
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [2] Cheminformatics Analysis and Modeling with MacrolactoneDB
    Phyo Phyo Kyaw Zin
    Gavin J. Williams
    Sean Ekins
    Scientific Reports, 10
  • [3] Multitask Modeling with Confidence Using Matrix Factorization and Conformal Prediction
    Norinder, Ulf
    Svensson, Fredrik
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (04) : 1598 - 1604
  • [4] Localized conformal prediction: a generalized inference framework for conformal prediction
    Guan, Leying
    BIOMETRIKA, 2023, 110 (01) : 33 - 50
  • [5] Parallel screening and cheminformatics modeling of flavonoid activated aptasensors
    Xiu, Yu
    Zhang, Ni
    Prabhakaran, Pranesha
    Jang, Sungho
    Yuan, Qipeng
    Breneman, Curt M.
    Jung, Gyoo Yeol
    Vongsangnak, Wanwipa
    Koffas, Mattheos A. G.
    SYNTHETIC AND SYSTEMS BIOTECHNOLOGY, 2022, 7 (04) : 1148 - 1158
  • [6] Conformal K Band Array Performance Prediction Based on Improved Element Modeling
    Nelson, G.
    Branner, G. R.
    Chun, M.
    Kumar, B. P.
    2015 45TH EUROPEAN MICROWAVE CONFERENCE (EUMC), 2015, : 1511 - 1514
  • [7] Conformal K Band Array Performance Prediction Based on Improved Element Modeling
    Nelson, G.
    Branner, G. R.
    Chun, M.
    Kumar, B. P.
    2015 12TH EUROPEAN RADAR CONFERENCE (EURAD), 2015, : 489 - 492
  • [8] Herbal compounds for rheumatoid arthritis: Literatures review and cheminformatics prediction
    Li, Xu-zhao
    Zhang, Shuai-nan
    PHYTOTHERAPY RESEARCH, 2020, 34 (01) : 51 - 66
  • [9] Synergy Conformal Prediction
    Gauraha, Niharika
    Spjuth, Ola
    CONFORMAL AND PROBABILISTIC PREDICTION AND APPLICATIONS, VOL 152, 2021, 152 : 91 - 110
  • [10] A tutorial on conformal prediction
    Shafer, Glenn
    Vovk, Vladimir
    JOURNAL OF MACHINE LEARNING RESEARCH, 2008, 9 : 371 - 421