Machine Learning Models for Identification and Prediction of Toxic Organic Compounds Using Daphnia magna Transcriptomic Profiles

被引:7
|
作者
Choi, Tae-June [1 ]
An, Hyung-Eun [1 ]
Kim, Chang-Bae [1 ]
机构
[1] Sangmyung Univ, Dept Biotechnol, Seoul 03016, South Korea
来源
LIFE-BASEL | 2022年 / 12卷 / 09期
关键词
environmental monitoring; aquatic ecosystem; toxic organic compounds; Daphnia magna; transcriptomic profiles; machine learning; random forest; CLASSIFICATION;
D O I
10.3390/life12091443
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A wide range of environmental factors heavily impact aquatic ecosystems, in turn, affecting human health. Toxic organic compounds resulting from anthropogenic activity are a source of pollution in aquatic ecosystems. To evaluate these contaminants, current approaches mainly rely on acute and chronic toxicity tests, but cannot provide explicit insights into the causes of toxicity. As an alternative, genome-wide gene expression systems allow the identification of contaminants causing toxicity by monitoring the organisms' response to toxic substances. In this study, we selected 22 toxic organic compounds, classified as pesticides, herbicides, or industrial chemicals, that induce environmental problems in aquatic ecosystems and affect human-health. To identify toxic organic compounds using gene expression data from Daphnia magna, we evaluated the performance of three machine learning based feature-ranking algorithms (Learning Vector Quantization, Random Forest, and Support Vector Machines with a Linear kernel), and nine classifiers (Linear Discriminant Analysis, Classification And Regression Trees, K-nearest neighbors, Support Vector Machines with a Linear kernel, Random Forest, Boosted C5.0, Gradient Boosting Machine, eXtreme Gradient Boosting with tree, and eXtreme Gradient Boosting with DART booster). Our analysis revealed that a combination of feature selection based on feature-ranking and a random forest classification algorithm had the best model performance, with an accuracy of 95.7%. This is a preliminary study to establish a model for the monitoring of aquatic toxic substances by machine learning. This model could be an effective tool to manage contaminants and toxic organic compounds in aquatic systems.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Prediction of phytoplankton biomass and identification of key influencing factors using interpretable machine learning models
    Xu, Yi
    Zhang, Di
    Lin, Junqiang
    Peng, Qidong
    Lei, Xiaohui
    Jin, Tiantian
    Wang, Jia
    Yuan, Ruifang
    ECOLOGICAL INDICATORS, 2024, 158
  • [32] 2018 Kerala Flood Damage: Survey, Identification, and Damage Prediction Models Using Machine Learning
    Anisha, A.
    Malavika, K.
    Erfana, K.
    Dheeraj, Kanram
    Subahan, Abdul
    Shinsha Raj, A.K.
    Mangalathu, Sujith
    Davis, Robin
    ASCE Open: Multidisciplinary Journal of Civil Engineering, 2023, 1
  • [33] Utilizing Machine Learning Models for Predicting Diamagnetic Susceptibility of Organic Compounds
    Zhang, Yining
    Xing, Sijie
    Wei, Lai
    Shi, Tongfei
    ACS OMEGA, 2024, 9 (12): : 14368 - 14374
  • [34] Identification of Phishing URLs Using Machine Learning Models
    Vivek, Meghashyam
    Premjith, Nithin
    Johnson, Aaron Antonio
    Maurya, Ashutosh Kumar
    Jingle, I. Diana Jeba
    FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 3, CIS 2023, 2024, 865 : 209 - 219
  • [35] Machine Learning Uses Chemo-Transcriptomic Profiles to Stratify Antimalarial Compounds With Similar Mode of Action
    van Heerden, Ashleigh
    van Wyk, Roelof
    Birkholtz, Lyn-Marie
    FRONTIERS IN CELLULAR AND INFECTION MICROBIOLOGY, 2021, 11
  • [36] Prediction of acute toxicity of emerging contaminants on the water flea Daphnia magna by Ant Colony Optimization - Support Vector Machine QSTR models
    Aalizadeh, Reza
    von der Ohe, Peter C.
    Thomaidis, Nikolaos S.
    ENVIRONMENTAL SCIENCE-PROCESSES & IMPACTS, 2017, 19 (03) : 438 - 448
  • [37] Prediction of Preeclampsia Using Machine Learning and Deep Learning Models: A Review
    Aljameel, Sumayh S.
    Alzahrani, Manar
    Almusharraf, Reem
    Altukhais, Majd
    Alshaia, Sadeem
    Sahlouli, Hanan
    Aslam, Nida
    Khan, Irfan Ullah
    Alabbad, Dina A.
    Alsumayt, Albandari
    BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (01)
  • [38] Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data
    Armeli, Gianluca
    Peters, Jan-Hendrik
    Koop, Thomas
    ACS OMEGA, 2023, 8 (13): : 12298 - 12309
  • [39] Volatile Organic Compounds for the Prediction of Lung Cancer by Using Ensembled Machine Learning Model and Feature Selection
    Khanna, Divya
    Kumar, Arun
    Bhat, Shahid Ahmad
    IEEE ACCESS, 2025, 13 : 9809 - 9820
  • [40] Prediction of the lattice constants of pyrochlore compounds using machine learning
    Ibrahim Olanrewaju Alade
    Mojeed Opeyemi Oyedeji
    Mohd Amiruddin Abd Rahman
    Tawfik A. Saleh
    Soft Computing, 2022, 26 : 8307 - 8315