In silico prediction of toxicity of phenols to Tetrahymena pyriformis by using genetic algorithm and decision tree-based modeling approach

被引:23
|
作者
Abbasitabar, Fatemeh [1 ]
Zare-Shahabadi, Vahid [2 ]
机构
[1] Islamic Azad Univ, Dept Chem, Marvdasht Branch, Marvdasht, Iran
[2] Islamic Azad Univ, Dept Chem, Mahshahr Branch, Mahshahr, Iran
关键词
Toxicity; Phenol; Decision tree; Genetic algorithm; Tetrahymena pyriformis; MINNOW PIMEPHALES-PROMELAS; STRUCTURE-PROPERTY RELATIONSHIP; MULTIPLE LINEAR REGRESSIONS; ACUTE AQUATIC TOXICITY; QUANTITATIVE STRUCTURE; FATHEAD MINNOW; QSAR MODELS; MOLECULAR-STRUCTURE; ORGANIC-COMPOUNDS; SAR MODELS;
D O I
10.1016/j.chemosphere.2016.12.095
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Risk assessment of chemicals is an important issue in environmental protection; however, there is a huge lack of experimental data for a large number of end-points. The experimental determination of toxicity of chemicals involves high costs and time-consuming process. In silica tools such as quantitative structure toxicity relationship (QSTR) models, which are constructed on the basis of computational molecular descriptors, can predict missing data for toxic end-points for existing or even not yet synthesized chemicals. Phenol derivatives are known to be aquatic pollutants. With this background, we aimed to develop an accurate and reliable QSTR model for the prediction of toxicity of 206 phenols to Tetrahymena pyriformis. A multiple linear regression (MLR)-based QSTR was obtained using a powerful descriptor selection tool named Memorized_ACO algorithm. Statistical parameters of the model were 0.72 and 0.68 for R-training(2) and R-test(2), respectively. To develop a high-quality QSTR model, classification and regression raining tree (CART) was employed. Two approaches were considered; (1) phenols were classified into different modes of action using CART and (2) the phenols in the training set were partitioned to several subsets by a tree in such a manner that in each subset, a high-quality MLR could be developed. For the first approach, the statistical parameters of the resultant QSTR model were improved to 0.83 and 0.75 for R-training(2) and R-test(2), respectively. Genetic algorithm was employed in the second approach to obtain an optimal tree, and it was shown that the final QSTR model provided excellent prediction accuracy for the training and test sets (R-training(2) and R-test(2) were 0.91 and 0.93, respectively). The mean absolute error for the test set was computed as 0.1615. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:249 / 259
页数:11
相关论文
共 50 条
  • [21] Classification of multiple and single power quality disturbances using a decision tree-based approach
    Barbosa B.H.G.
    Ferreira D.D.
    [J]. Journal of Control, Automation and Electrical Systems, 2013, 24 (5) : 638 - 648
  • [22] A tree-based approach for event prediction using episode rules over event streams
    Cho, Chung-Wen
    Zheng, Ying
    Wu, Yi-Hung
    Chen, Arbee L. P.
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, 5181 : 225 - +
  • [23] Student academic performance prediction model using decision tree and fuzzy genetic algorithm
    Hamsa, Hashmia
    Indiradevi, Simi
    Kizhakkethottam, Jubilant J.
    [J]. 1ST GLOBAL COLLOQUIUM ON RECENT ADVANCEMENTS AND EFFECTUAL RESEARCHES IN ENGINEERING, SCIENCE AND TECHNOLOGY - RAEREST 2016, 2016, 25 : 326 - 332
  • [24] Software Fault Prediction Using Quad Tree-Based K-Means Clustering Algorithm
    Bishnu, Partha Sarathi
    Bhattacherjee, Vandana
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (06) : 1146 - 1150
  • [25] Structuring different manifestations of misinformation for better policy development using a decision tree-based approach
    Haegle, Olivia
    Escher, Stephan
    Heil, Reinhhard
    Jahnel, Jutta
    [J]. POLICY AND INTERNET, 2024,
  • [26] Estimation of Unit Process Data for Life Cycle Assessment Using a Decision Tree-Based Approach
    Zhao, Bu
    Shuai, Chenyang
    Hou, Ping
    Qu, Shen
    Xu, Ming
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2021, 55 (12) : 8439 - 8446
  • [27] Study on multi-stage logistic chain network: a spanning tree-based genetic algorithm approach
    Syarif, A
    Yun, Y
    Gen, M
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2002, 43 (1-2) : 299 - 314
  • [28] Prediction of Syncope based on Physiological Data Analysis using Decision Tree Algorithm
    Saxena, Utkarsh
    Moulik, Soumen
    Roy, Diptendu Sinha
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [29] Prediction of Material Removal Rate for Chemical Mechanical Planarization Using Decision Tree-Based Ensemble Learning
    Li, Zhixiong
    Wu, Dazhong
    Yu, Tianyu
    [J]. JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2019, 141 (03):
  • [30] A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis
    Maia, Ranniery
    Toda, Tomoki
    Tokuda, Keiichi
    Sakai, Shinsuke
    Nakamura, Satoshi
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1743 - 1746