In silico prediction of toxicity of phenols to Tetrahymena pyriformis by using genetic algorithm and decision tree-based modeling approach

被引:23
|
作者
Abbasitabar, Fatemeh [1 ]
Zare-Shahabadi, Vahid [2 ]
机构
[1] Islamic Azad Univ, Dept Chem, Marvdasht Branch, Marvdasht, Iran
[2] Islamic Azad Univ, Dept Chem, Mahshahr Branch, Mahshahr, Iran
关键词
Toxicity; Phenol; Decision tree; Genetic algorithm; Tetrahymena pyriformis; MINNOW PIMEPHALES-PROMELAS; STRUCTURE-PROPERTY RELATIONSHIP; MULTIPLE LINEAR REGRESSIONS; ACUTE AQUATIC TOXICITY; QUANTITATIVE STRUCTURE; FATHEAD MINNOW; QSAR MODELS; MOLECULAR-STRUCTURE; ORGANIC-COMPOUNDS; SAR MODELS;
D O I
10.1016/j.chemosphere.2016.12.095
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Risk assessment of chemicals is an important issue in environmental protection; however, there is a huge lack of experimental data for a large number of end-points. The experimental determination of toxicity of chemicals involves high costs and time-consuming process. In silica tools such as quantitative structure toxicity relationship (QSTR) models, which are constructed on the basis of computational molecular descriptors, can predict missing data for toxic end-points for existing or even not yet synthesized chemicals. Phenol derivatives are known to be aquatic pollutants. With this background, we aimed to develop an accurate and reliable QSTR model for the prediction of toxicity of 206 phenols to Tetrahymena pyriformis. A multiple linear regression (MLR)-based QSTR was obtained using a powerful descriptor selection tool named Memorized_ACO algorithm. Statistical parameters of the model were 0.72 and 0.68 for R-training(2) and R-test(2), respectively. To develop a high-quality QSTR model, classification and regression raining tree (CART) was employed. Two approaches were considered; (1) phenols were classified into different modes of action using CART and (2) the phenols in the training set were partitioned to several subsets by a tree in such a manner that in each subset, a high-quality MLR could be developed. For the first approach, the statistical parameters of the resultant QSTR model were improved to 0.83 and 0.75 for R-training(2) and R-test(2), respectively. Genetic algorithm was employed in the second approach to obtain an optimal tree, and it was shown that the final QSTR model provided excellent prediction accuracy for the training and test sets (R-training(2) and R-test(2) were 0.91 and 0.93, respectively). The mean absolute error for the test set was computed as 0.1615. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:249 / 259
页数:11
相关论文
共 50 条
  • [41] Prediction of axillary lymph node metastasis in primary breast cancer patients using a decision tree-based model
    Masahiro Takada
    Masahiro Sugimoto
    Yasuhiro Naito
    Hyeong-Gon Moon
    Wonshik Han
    Dong-Young Noh
    Masahide Kondo
    Katsumasa Kuroi
    Hironobu Sasano
    Takashi Inamoto
    Masaru Tomita
    Masakazu Toi
    [J]. BMC Medical Informatics and Decision Making, 12
  • [42] Prediction of gross calorific value from coal analysis using decision tree-based bagging and boosting techniques
    Munshi, Tanveer Alam
    Jahan, Labiba Nusrat
    Howladar, M. Farhad
    Hashan, Mahamudul
    [J]. HELIYON, 2024, 10 (01)
  • [43] Prediction of axillary lymph node metastasis in primary breast cancer patients using a decision tree-based model
    Takada, Masahiro
    Sugimoto, Masahiro
    Naito, Yasuhiro
    Moon, Hyeong-Gon
    Han, Wonshik
    Noh, Dong-Young
    Kondo, Masahide
    Kuroi, Katsumasa
    Sasano, Hironobu
    Inamoto, Takashi
    Tomita, Masaru
    Toi, Masakazu
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2012, 12
  • [44] Series AC Arc Fault Detection Using Decision Tree-Based Machine Learning Algorithm and Raw Current
    Paul, Kamal Chandra
    Schweizer, Linus
    Zhao, Tiefu
    Chen, Chen
    Wang, Yao
    [J]. 2022 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2022,
  • [45] Search tree-based approach for the p-median problem using the ant colony optimization algorithm
    Bodnariuc, Gabriel
    Cataranciuc, Sergiu
    [J]. COMPUTER SCIENCE JOURNAL OF MOLDOVA, 2014, 22 (01) : 62 - 76
  • [46] Breast Cancer Prediction Using Genetic Algorithm Based Ensemble Approach
    Chauhan, Pragya
    Swami, Amit
    [J]. 2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [47] Identifying Predictors of Inpatient Verbal Aggression in a Forensic Psychiatric Setting Using a Tree-based Modeling Approach
    Neumann, Merten
    Klatt, Thimna
    [J]. JOURNAL OF INTERPERSONAL VIOLENCE, 2022, 37 (17-18) : NP16351 - NP16376
  • [48] A novel tree-based algorithm for real-time prediction of rockburst risk using field microseismic monitoring
    Yin, Xin
    Liu, Quansheng
    Pan, Yucong
    Huang, Xing
    [J]. ENVIRONMENTAL EARTH SCIENCES, 2021, 80 (16)
  • [49] A novel tree-based algorithm for real-time prediction of rockburst risk using field microseismic monitoring
    Xin Yin
    Quansheng Liu
    Yucong Pan
    Xing Huang
    [J]. Environmental Earth Sciences, 2021, 80
  • [50] ECG-based prediction algorithm for imminent malignant ventricular arrhythmias using decision tree
    Mandala, Satria
    Di, Tham Cai
    Sunar, Mohd Shahrizal
    Adiwijaya
    [J]. PLOS ONE, 2020, 15 (05):