chemmodlab: a cheminformatics modeling laboratoryR package for fitting and assessing machine learning models

被引:0
|
作者
Ash, Jeremy R. [1 ]
Hughes-Oliver, Jacqueline M. [2 ]
机构
[1] North Carolina State Univ, Bioinformat Res Ctr, Dept Stat, 335 Ricks Hall,Campus Box 7566, Raleigh, NC 27695 USA
[2] North Carolina State Univ, Dept Stat, 2311 Stinson Dr,Campus Box 8203, Raleigh, NC 27695 USA
来源
关键词
Machine learning; QSAR; R package; Initial enhancement; Enrichment factor; Accumulation curve; Hit enrichment curve; Repeated cross-validation; CROSS-VALIDATION; SELECTION BIAS; ERROR RATE; PREDICTION; PROPERTY;
D O I
10.1186/s13321-018-0309-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The goal of chemmodlab is to streamline the fitting and assessment pipeline for many machine learning models in R, making it easy for researchers to compare the utility of these models. While focused on implementing methods for model fitting and assessment that have been accepted by experts in the cheminformatics field, all of the methods in chemmodlab have broad utility for the machine learning community. chemmodlab contains several assessment utilities, including a plotting function that constructs accumulation curves and a function that computes many performance measures. The most novel feature of chemmodlab is the ease with which statistically significant performance differences for many machine learning models is presented by means of the multiple comparisons similarity plot. Differences are assessed using repeated k-fold cross validation, where blocking increases precision and multiplicity adjustments are applied. chemmodlab is freely available on CRAN at https://cran.r-project.org/web/packages/chemmodlab/index.html.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] A hybrid modeling approach for assessing mechanistic models of small molecule partitioning in vivo using a machine learning-integrated modeling platform
    Antontsev, Victor
    Jagarapu, Aditya
    Bundey, Yogesh
    Hou, Hypatia
    Khotimchenko, Maksim
    Walsh, Jason
    Varshney, Jyotika
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [22] A hybrid modeling approach for assessing mechanistic models of small molecule partitioning in vivo using a machine learning-integrated modeling platform
    Victor Antontsev
    Aditya Jagarapu
    Yogesh Bundey
    Hypatia Hou
    Maksim Khotimchenko
    Jason Walsh
    Jyotika Varshney
    Scientific Reports, 11
  • [23] Optimization and supervised machine learning methods for fitting numerical physics models without derivatives*
    Bollapragada, Raghu
    Menickelly, Matt
    Nazarewicz, Witold
    O'Neal, Jared
    Reinhard, Paul-Gerhard
    Wild, Stefan M.
    JOURNAL OF PHYSICS G-NUCLEAR AND PARTICLE PHYSICS, 2021, 48 (02)
  • [24] Assessing English language sentences readability using machine learning models
    Maqsood, Shazia
    Shahid, Abdul
    Afzal, Muhammad Tanvir
    Roman, Muhammad
    Khan, Zahid
    Nawaz, Zubair
    Aziz, Muhammad Haris
    PEERJ COMPUTER SCIENCE, 2022, 7
  • [25] Assessing the reliability of complex networks: Empirical models based on machine learning
    Rocco, Claudio M.
    Muselli, Marco
    APPLIED ARTIFICIAL INTELLIGENCE, 2006, : 267 - +
  • [26] Assessing the Sentiment of Book Characteristics Using Machine Learning NLP Models
    Drozda, Pawel
    Sopyla, Krzysztof
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT II, 2023, 13589 : 218 - 231
  • [27] Assessing the Applicability of Machine Learning Models for Robotic Emotion Monitoring: A Survey
    Khan, Md Ayshik Rahman
    Rostov, Marat
    Rahman, Jessica Sharmin
    Ahmed, Khandaker Asif
    Hossain, Md Zakir
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [28] Assessing the net benefit of machine learning models in the presence of resource constraints
    Singh, Karandeep
    Shah, Nigam H.
    Vickers, Andrew J.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (04) : 668 - 673
  • [29] Assessing predictability of environmental time series with statistical and machine learning models
    Bonas, Matthew
    Datta, Abhirup
    Wikle, Christopher K.
    Boone, Edward L.
    Alamri, Faten S.
    Hari, Bhava Vyasa
    Kavila, Indulekha
    Simmons, Susan J.
    Jarvis, Shannon M.
    Burr, Wesley S.
    Pagendam, Daniel E.
    Chang, Won
    Castruccio, Stefano
    ENVIRONMETRICS, 2025, 36 (01)
  • [30] Assessing Machine Learning and Deep Learning Models for Suggested Dosing of Anesthetic Induction Medications
    Kendale, Samir
    ANESTHESIA AND ANALGESIA, 2020, 130