Accurate Prediction of Rat Acute Oral Toxicity and Reference Dose for Thousands of Polycyclic Aromatic Hydrocarbon Derivatives Based on Chemometric QSAR and Machine Learning

被引:5
|
作者
Wu, Shuang [1 ]
Li, Shi-Xin [1 ]
Qiu, Jing [1 ]
Zhao, Hai-Ming [1 ]
Li, Yan-Wen [1 ]
Feng, Nai-Xian [1 ]
Liu, Bai-Lin [1 ]
Cai, Quan-Ying [1 ]
Xiang, Lei [1 ]
Mo, Ce-Hui [1 ]
Li, Qing X. [2 ]
机构
[1] Jinan Univ, Coll Life Sci & Technol, Guangdong Prov Res Ctr Environm Pollut Control & R, Guangzhou 510632, Peoples R China
[2] Univ Hawaii Manoa, Dept Mol Biosci & Bioengn, Honolulu, HI 96822 USA
基金
中国国家自然科学基金;
关键词
acute oral; PAH; toxicity; quantitativestructure-activity relationship; machine learning; interaction effects; CHEMICALS; MODELS; PAHS;
D O I
10.1021/acs.est.4c03966
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Acute oral toxicity is currently not available for most polycyclic aromatic hydrocarbons (PAHs), especially their derivatives, because it is cost-prohibitive to experimentally determine all of them. Here, quantitative structure-activity relationship (QSAR) models using machine learning (ML) for predicting the toxicity of PAH derivatives were developed, based on oral toxicity data points of 788 individual substances of rats. Both the individual ML algorithm gradient boosting regression trees (GBRT) and the stacking ML algorithm (extreme gradient boosting + GBRT + random forest regression) provided the best prediction results with satisfactory determination coefficients for both cross-validation and the test set. It was found that those PAH derivatives with fewer polar hydrogens, more large-sized atoms, more branches, and lower polarizability have higher toxicity. Software based on the optimal ML-QSAR model was successfully developed to expand the application potential of the developed model, obtaining reliable prediction of pLD(50) values and reference doses for 6893 external PAH derivatives. Among these chemicals, 472 were identified as moderately or highly toxic; 10 out of them had clear environment detection or use records. The findings provide valuable insights into the toxicity of PAHs and their derivatives, offering a standard platform for effectively evaluating chemical toxicity using ML-QSAR models.
引用
收藏
页码:15100 / 15110
页数:11
相关论文
共 7 条
  • [1] Chemometric QSAR modeling of acute oral toxicity of Polycyclic Aromatic Hydrocarbons (PAHs) to rat using simple 2D descriptors and interspecies toxicity modeling with mouse
    Sun, Guohui
    Zhang, Yifan
    Pei, Luyu
    Lou, Yuqing
    Mu, Yao
    Yun, Jiayi
    Li, Feifan
    Wang, Yachen
    Hao, Zhaoqi
    Xi, Sha
    Li, Chen
    Chen, Chuhan
    Zhao, Lijiao
    Zhang, Na
    Zhong, Rugang
    Peng, Yongzhen
    ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 2021, 222
  • [2] Evaluation of Assay Central Machine Learning Models for Rat Acute Oral Toxicity Prediction
    Minerali, Eni
    Foil, Daniel H.
    Zorn, Kimberley M.
    Ekins, Sean
    ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 2020, 8 (42) : 16020 - 16027
  • [3] Random forest algorithm-based accurate prediction of rat acute oral toxicity
    Xiao, Linrong
    Deng, Jiyong
    Yang, Liping
    Huang, Xianwei
    Yu, Xinliang
    MOLECULAR PHYSICS, 2022, 120 (24)
  • [4] Machine learning-assisted data filtering and QSAR models for prediction of chemical acute toxicity on rat and mouse
    Bo, Tao
    Lin, Yaohui
    Han, Jinglong
    Hao, Zhineng
    Liu, Jingfu
    JOURNAL OF HAZARDOUS MATERIALS, 2023, 452
  • [5] ADMET evaluation in drug discovery: 15. Accurate prediction of rat oral acute toxicity using relevance vector machine and consensus modeling
    Lei, Tailong
    Li, Youyong
    Song, Yunlong
    Li, Dan
    Sun, Huiyong
    Hou, Tingjun
    JOURNAL OF CHEMINFORMATICS, 2016, 8
  • [6] ADMET evaluation in drug discovery: 15. Accurate prediction of rat oral acute toxicity using relevance vector machine and consensus modeling
    Tailong Lei
    Youyong Li
    Yunlong Song
    Dan Li
    Huiyong Sun
    Tingjun Hou
    Journal of Cheminformatics, 8
  • [7] High-throughput prediction of oral acute toxicity in Rat and Mouse of over 100,000 polychlorinated persistent organic pollutants (PC-POPs) by interpretable data fusion-driven machine learning global models
    Chen, Shuo
    Fan, Tengjiao
    Ren, Ting
    Zhang, Na
    Zhao, Lijiao
    Zhong, Rugang
    Sun, Guohui
    JOURNAL OF HAZARDOUS MATERIALS, 2024, 480