Novel machine learning models to predict endocrine disruption activity for high-throughput chemical screening

被引:4
|
作者
Collins, Sean P. [1 ]
Barton-Maclaren, Tara S. [1 ]
机构
[1] Hlth Canada Ottawa, Existing Subst Risk Assessment Bur, Hlth Environm & Consumer Safety Branch, Ottawa, ON, Canada
来源
FRONTIERS IN TOXICOLOGY | 2022年 / 4卷
关键词
estrogen; androgen; (Q)SAR; random forest; in silico; machine learning; RECEPTOR; VALIDATION;
D O I
10.3389/ftox.2022.981928
中图分类号
R99 [毒物学(毒理学)];
学科分类号
100405 ;
摘要
An area of ongoing concern in toxicology and chemical risk assessment is endocrine disrupting chemicals (EDCs). However, thousands of legacy chemicals lack the toxicity testing required to assess their respective EDC potential, and this is where computational toxicology can play a crucial role. The US (United States) Environmental Protection Agency (EPA) has run two programs, the Collaborative Estrogen Receptor Activity Project (CERAPP) and the Collaborative Modeling Project for Receptor Activity (CoMPARA) which aim to predict estrogen and androgen activity, respectively. The US EPA solicited research groups from around the world to provide endocrine receptor activity Qualitative (or Quantitative) Structure Activity Relationship ([Q]SAR) models and then combined them to create consensus models for different toxicity endpoints. Random Forest (RF) models were developed to cover a broader range of substances with high predictive capabilities using large datasets from CERAPP and CoMPARA for estrogen and androgen activity, respectively. By utilizing simple descriptors from open-source software and large training datasets, RF models were created to expand the domain of applicability for predicting endocrine disrupting activity and help in the screening and prioritization of extensive chemical inventories. In addition, RFs were trained to conservatively predict the activity, meaning models are more likely to make false-positive predictions to minimize the number of False Negatives. This work presents twelve binary and multi-class RF models to predict binding, agonism, and antagonism for estrogen and androgen receptors. The RF models were found to have high predictive capabilities compared to other in silico modes, with some models reaching balanced accuracies of 93% while having coverage of 89%. These models are intended to be incorporated into evolving priority-setting workflows and integrated strategies to support the screening and selection of chemicals for further testing and assessment by identifying potential endocrine-disrupting substances.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] High-Throughput Chemical Screening and Structure-Based Models to Predict hERG Inhibition
    Krishna, Shagun
    Borrel, Alexandre
    Huang, Ruili
    Zhao, Jinghua
    Xia, Menghang
    Kleinstreuer, Nicole
    [J]. BIOLOGY-BASEL, 2022, 11 (02):
  • [2] High-Throughput Screening to Predict Chemical-Assay Interference
    Alexandre Borrel
    Ruili Huang
    Srilatha Sakamuru
    Menghang Xia
    Anton Simeonov
    Kamel Mansouri
    Keith A. Houck
    Richard S. Judson
    Nicole C. Kleinstreuer
    [J]. Scientific Reports, 10
  • [3] High-Throughput Screening to Predict Chemical-Assay Interference
    Borrel, Alexandre
    Huang, Ruili
    Sakamuru, Srilatha
    Xia, Menghang
    Simeonov, Anton
    Mansouri, Kamel
    Houck, Keith A.
    Judson, Richard S.
    Kleinstreuer, Nicole C.
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [4] EDITORIAL: Chemical Compound Space Exploration by Multiscale High-Throughput Screening and Machine Learning
    Gryn'ova, Ganna
    Bereau, Tristan
    Muller, Carolin
    Friederich, Pascal
    Wade, Rebecca C.
    Nunes-Alves, Ariane
    Soares, Thereza A.
    Merz Jr, Kenneth
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (15) : 5737 - 5738
  • [5] High-throughput screening of bimetallic catalysts enabled by machine learning
    Li, Zheng
    Wang, Siwen
    Chin, Wei Shan
    Achenie, Luke E.
    Xin, Hongliang
    [J]. JOURNAL OF MATERIALS CHEMISTRY A, 2017, 5 (46) : 24131 - 24138
  • [6] Application of machine learning for high-throughput tumor marker screening
    Fu, Xingxing
    Ma, Wanting
    Zuo, Qi
    Qi, Yanfei
    Zhang, Shubiao
    Zhao, Yinan
    [J]. LIFE SCIENCES, 2024, 348
  • [7] Application of Life Cycle Assessment and Machine Learning for High-Throughput Screening of Green Chemical Substitutes
    Zhu, Xinzhe
    Ho, Chi-Hung
    Wang, Xiaonan
    [J]. ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 2020, 8 (30) : 11141 - 11151
  • [8] Machine-learning models for high-throughput materials discovery
    Landrum, GA
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2003, 225 : U560 - U560
  • [9] A NOVEL STRATEGY FOR HIGH-THROUGHPUT SCREENING
    Gombosuren, N.
    Bacsa, B.
    Furka, A.
    Dibo, G.
    [J]. JOURNAL OF PEPTIDE SCIENCE, 2004, 10 : 155 - 155
  • [10] Novel trends in high-throughput screening
    Mayr, Lorenz M.
    Bojanic, Dejan
    [J]. CURRENT OPINION IN PHARMACOLOGY, 2009, 9 (05) : 580 - 588