In silico prediction of toxicity of non-congeneric industrial chemicals using ensemble learning based modeling approaches

被引:23
|
作者
Singh, Kunwar P. [1 ]
Gupta, Shikha
机构
[1] Acad Sci & Innovat Res, New Delhi 110001, India
关键词
Toxicity; Diverse chemicals; Ensemble learning models; Interspecies model; Molecular descriptors; Regulatory toxicology; SUPPORT VECTOR MACHINES; ACUTE AQUATIC TOXICITY; TETRAHYMENA-PYRIFORMIS; NEURAL-NETWORK; QSAR MODELS; MOLECULAR DESCRIPTORS; APPLICABILITY DOMAINS; AROMATIC-COMPOUNDS; PHENOLS; VALIDATION;
D O I
10.1016/j.taap.2014.01.006
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Ensemble learning approach based decision treeboost (DTB) and decision tree forest (DTF) models are introduced in order to establish quantitative structure-toxicity relationship (QSTR) for the prediction of toxicity of 1450 diverse chemicals. Eight non-quantum mechanical molecular descriptors were derived. Structural diversity of the chemicals was evaluated using Tanimoto similarity index. Stochastic gradient boosting and bagging algorithms supplemented DTB and DTF models were constructed for classification and function optimization problems using the toxicity end-point in T. pyriformis. Special attention was drawn to prediction ability and robustness of the models, investigated both in external and 10-fold cross validation processes. In complete data, optimal DTB and DTF models rendered accuracies of 98.90%, 98.83% in two-category and 98.14%, 98.14% in four-category toxicity classifications. Both the models further yielded classification accuracies of 100% in external toxicity data of T. pyriformis. The constructed regression models (DTB and DTF) using five descriptors yielded correlation coefficients (R-2) of 0.945, 0.944 between the measured and predicted toxicities with mean squared errors (MSEs) of 0.059, and 0.064 in complete T. pyriformis data. The T. pyriformis regression models (DTB and DTF) applied to the external toxicity data sets yielded R-2 and MSE values of 0.637, 0.655; 0.534, 0.507 (marine bacteria) and 0.741, 0.691; 0.155, 0.173 (algae). The results suggest for wide applicability of the inter-species models in predicting toxicity of new chemicals for regulatory purposes. These approaches provide useful strategy and robust tools in the screening of ecotoxicological risk or environmental hazard potential of chemicals. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:198 / 212
页数:15
相关论文
共 50 条
  • [1] Quantitative and qualitative models for carcinogenicity prediction for non-congeneric chemicals using CP ANN method for regulatory uses
    Natalja Fjodorova
    Marjan Vračko
    Marjan Tušar
    Aneta Jezierska
    Marjana Novič
    Ralph Kühne
    Gerrit Schüürmann
    Molecular Diversity, 2010, 14 : 581 - 594
  • [2] Quantitative and qualitative models for carcinogenicity prediction for non-congeneric chemicals using CP ANN method for regulatory uses
    Fjodorova, Natalja
    Vracko, Marjan
    Tusar, Marjan
    Jezierska, Aneta
    Novic, Marjana
    Kuehne, Ralph
    Schueuermann, Gerrit
    MOLECULAR DIVERSITY, 2010, 14 (03) : 581 - 594
  • [3] Counter propagation artificial neural network categorical models for prediction of carcinogenicity for non-congeneric chemicals
    Fjodorova, N.
    Vracko, M.
    Jezierska, A.
    Novic, M.
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2010, 21 (1-2) : 57 - 75
  • [4] In silico prediction of mitochondrial toxicity of chemicals using machine learning methods
    Zhao, Piaopiao
    Peng, Yayuan
    Xu, Xuan
    Wang, Zhiyuan
    Wu, Zengrui
    Li, Weihua
    Tang, Yun
    Liu, Guixia
    JOURNAL OF APPLIED TOXICOLOGY, 2021, 41 (10) : 1518 - 1526
  • [5] Predicting human intestinal absorption of diverse chemicals using ensemble learning based QSAR modeling approaches
    Basant, Nikita
    Gupta, Shikha
    Singh, Kunwar P.
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2016, 61 : 178 - 196
  • [6] In silico prediction of ocular toxicity of compounds using explainable machine learning and deep learning approaches
    Zhou, Yiqing
    Wang, Ze
    Huang, Zejun
    Li, Weihua
    Chen, Yuanting
    Yu, Xinxin
    Tang, Yun
    Liu, Guixia
    JOURNAL OF APPLIED TOXICOLOGY, 2024, 44 (06) : 892 - 907
  • [7] In silico prediction of cellular permeability of diverse chemicals using qualitative and quantitative SAR modeling approaches
    Singh, Kunwar P.
    Gupta, Shikha
    Basant, Nikita
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 140 : 61 - 72
  • [8] In silico prediction of Tetrahymena pyriformis toxicity for diverse industrial chemicals with substructure pattern recognition and machine learning methods
    Cheng, Feixiong
    Shen, Jie
    Yu, Yue
    Li, Weihua
    Liu, Guixia
    Lee, Philip W.
    Tang, Yun
    CHEMOSPHERE, 2011, 82 (11) : 1636 - 1643
  • [9] In silico prediction of drug-induced developmental toxicity by using machine learning approaches
    Hui Zhang
    Jun Mao
    Hua-Zhao Qi
    Lan Ding
    Molecular Diversity, 2020, 24 : 1281 - 1290
  • [10] In silico prediction of drug-induced developmental toxicity by using machine learning approaches
    Zhang, Hui
    Mao, Jun
    Qi, Hua-Zhao
    Ding, Lan
    MOLECULAR DIVERSITY, 2020, 24 (04) : 1281 - 1290