Predicting persistence in the sediment compartment with a new automatic software based on the k-Nearest Neighbor (k-NN) algorithm

被引:23
|
作者
Manganaro, Alberto [1 ,2 ]
Pizzo, Fabiola [1 ]
Lombardo, Anna [1 ]
Pogliaghi, Alberto [1 ]
Benfenati, Emilio [1 ]
机构
[1] IRCCS Ist Ric Farmacol Mario Negri, Lab Environm Chem & Toxicol, I-20159 Milan, Italy
[2] Kode Srl, I-56124 Pisa, Italy
关键词
Persistence; Half-life; Sediment; PBT; In silica; Read across; READY BIODEGRADABILITY; ORGANIC-CHEMICALS; PRIORITIZATION; TOXICITY; PBT;
D O I
10.1016/j.chemosphere.2015.10.054
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The ability of a substance to resist degradation and persist in the environment needs to be readily identified in order to protect the environment and human health. Many regulations require the assessment of persistence for substances commonly manufactured and marketed. Besides laboratory-based testing methods, in silico tools May be used to obtain a computational prediction of persistence. We present a new program to develop k-Nearest Neighbor (k-NN) models. The k-NN algorithm is a similarity-based approach that predicts the property of a substance in relation to the experimental data for its most similar compounds. We employed this software to identify persistence in the sediment compartment. Data on half-life (HL) in sediment were obtained from different sources and, after careful data pruning the final dataset, containing 297 organic compounds, was divided into four experimental classes. We developed several models giving satisfactory performances, considering that both the training and test set accuracy ranged between 0.90 and 0.96. We finally selected one model which will be made available in the near future in the freely available software platform VEGA. This model offers a valuable in silico tool that may be really useful for fast and inexpensive screening. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1624 / 1630
页数:7
相关论文
共 50 条
  • [1] K-Nearest Neighbor (K-NN) based Missing Data Imputation
    Murti, Della Murbarani Prawidya
    Wibawa, Aji Prasetya
    Akbar, Muhammad Iqbal
    Ianto, Utomo Puj
    2019 5TH INTERNATIONAL CONFERENCE ON SCIENCE ININFORMATION TECHNOLOGY (ICSITECH): EMBRACING INDUSTRY 4.0 - TOWARDS INNOVATION IN CYBER PHYSICAL SYSTEM, 2019, : 83 - 88
  • [2] Application of the k-nearest neighbor (k-NN) machine learning algorithm for the identification of colorectal cancer based on microRNAs
    Fajar, Rifaldy
    Kurniastuti, Nana Indri
    Jupri, Prihantini
    Wulandari, Titik
    JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2021, 36 : 54 - 54
  • [3] The Grading of Agarwood Oil Quality using k-Nearest Neighbor (k-NN)
    Ismail, Nurlaila
    Rahiman, Mohd Hezri Fazalul
    Taib, Mohd Nasir
    Ali, Nor Azah Mohd
    Jamil, Mailina
    Tajuddin, Saiful Nizam
    2013 IEEE CONFERENCE ON SYSTEMS, PROCESS & CONTROL (ICSPC), 2013, : 1 - 5
  • [4] Estimation of Forestry-Biomass using k-Nearest Neighbor(k-NN) method
    Lee, Jung-soo
    Yoshida, Shigejiro
    JOURNAL OF THE FACULTY OF AGRICULTURE KYUSHU UNIVERSITY, 2013, 58 (02): : 339 - 349
  • [5] SENTIMENT ANALYSIS ON USER SATISFACTION LEVEL OF CELLULAR DATA SERVICE USING THE K-NEAREST NEIGHBOR (K-NN) ALGORITHM
    Wibawa, Desdwyatma Wahyu
    Nasrun, Muhammad
    Setianingsih, Casi
    2018 INTERNATIONAL CONFERENCE ON CONTROL, ELECTRONICS, RENEWABLE ENERGY AND COMMUNICATIONS (ICCEREC), 2018, : 235 - 241
  • [6] The Comparison of K-Nearest Neighbor (K-NN) Algorithm and Fuzzy Tsukamoto Logic in the Determination of SMA Students Majors in Banten
    Akhirina, T. Y.
    Rusmardiana, A.
    Yulistyanti, D.
    Pauziah, U.
    1ST INTERNATIONAL CONFERENCE ON ADVANCE AND SCIENTIFIC INNOVATION, 2019, 1175
  • [7] Classification of stock index movement using k-nearest neighbours (k-NN) algorithm
    Subha, M.V.
    Nambi, S. Thirupparkadal
    WSEAS Transactions on Information Science and Applications, 2012, 9 (09): : 261 - 270
  • [8] Predicting the number of nearest neighbors for the k-NN classification algorithm
    Zhang, Xueying
    Song, Qinbao
    INTELLIGENT DATA ANALYSIS, 2014, 18 (03) : 449 - 464
  • [9] A memetic algorithm based on k-nearest neighbor
    Xu, Jin
    Gu, Qiong
    Gai, Zhihua
    Gong, Wenyin
    Journal of Computational Information Systems, 2014, 10 (22): : 9565 - 9574
  • [10] K-NN FOREST: a software for the non-parametric prediction and mapping of environmental variables by the k-Nearest Neighbors algorithm
    Chirici, Gherardo
    Corona, Piermaria
    Marchetti, Marco
    Mastronardi, Alessandro
    Maselli, Fabio
    Bottai, Lorenzo
    Travaglini, Davide
    EUROPEAN JOURNAL OF REMOTE SENSING, 2012, 45 : 433 - 442