Machine Learning Models for Identification and Prediction of Toxic Organic Compounds Using Daphnia magna Transcriptomic Profiles

被引:7
|
作者
Choi, Tae-June [1 ]
An, Hyung-Eun [1 ]
Kim, Chang-Bae [1 ]
机构
[1] Sangmyung Univ, Dept Biotechnol, Seoul 03016, South Korea
来源
LIFE-BASEL | 2022年 / 12卷 / 09期
关键词
environmental monitoring; aquatic ecosystem; toxic organic compounds; Daphnia magna; transcriptomic profiles; machine learning; random forest; CLASSIFICATION;
D O I
10.3390/life12091443
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A wide range of environmental factors heavily impact aquatic ecosystems, in turn, affecting human health. Toxic organic compounds resulting from anthropogenic activity are a source of pollution in aquatic ecosystems. To evaluate these contaminants, current approaches mainly rely on acute and chronic toxicity tests, but cannot provide explicit insights into the causes of toxicity. As an alternative, genome-wide gene expression systems allow the identification of contaminants causing toxicity by monitoring the organisms' response to toxic substances. In this study, we selected 22 toxic organic compounds, classified as pesticides, herbicides, or industrial chemicals, that induce environmental problems in aquatic ecosystems and affect human-health. To identify toxic organic compounds using gene expression data from Daphnia magna, we evaluated the performance of three machine learning based feature-ranking algorithms (Learning Vector Quantization, Random Forest, and Support Vector Machines with a Linear kernel), and nine classifiers (Linear Discriminant Analysis, Classification And Regression Trees, K-nearest neighbors, Support Vector Machines with a Linear kernel, Random Forest, Boosted C5.0, Gradient Boosting Machine, eXtreme Gradient Boosting with tree, and eXtreme Gradient Boosting with DART booster). Our analysis revealed that a combination of feature selection based on feature-ranking and a random forest classification algorithm had the best model performance, with an accuracy of 95.7%. This is a preliminary study to establish a model for the monitoring of aquatic toxic substances by machine learning. This model could be an effective tool to manage contaminants and toxic organic compounds in aquatic systems.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Prediction of the lattice constants of pyrochlore compounds using machine learning
    Alade, Ibrahim Olanrewaju
    Oyedeji, Mojeed Opeyemi
    Abd Rahman, Mohd Amiruddin
    Saleh, Tawfik A.
    SOFT COMPUTING, 2022, 26 (17) : 8307 - 8315
  • [42] Machine Learning-Based Prediction of Optical Properties in Chromophoric Organic Compounds
    Kuramoto, Aika
    Kurata, Shugo
    Yotsumoto, Kensuke
    Kawanobe, Hiroko
    Hasegawa, Makoto
    2024 INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS, AND COMMUNICATIONS, ITC-CSCC 2024, 2024,
  • [43] Stokes shift prediction of fluorescent organic dyes using machine learning based hybrid cascade models
    Mahato, Kapil Dev
    Das, S. S. Gourab Kumar
    Azad, Chandrashekhar
    Kumar, Uday
    DYES AND PIGMENTS, 2024, 222
  • [44] Experimental analysis and prediction of radionuclide solubility using machine learning models: Effects of organic complexing agents
    Kim, Bolam
    Manchuri, Amaranadha Reddy
    Oh, Gi-Taek
    Lim, Youngsu
    Son, Yuhwa
    Choi, Seho
    Kang, Myunggoo
    Jang, Jiseon
    Ha, Jaechul
    Cho, Chun-Hyung
    Lee, Min-Woo
    Lee, Dae Sung
    JOURNAL OF HAZARDOUS MATERIALS, 2024, 469
  • [45] Prediction of organic contaminant rejection by nanofiltration and reverse osmosis membranes using interpretable machine learning models
    Zhu, Tengyi
    Zhang, Yu
    Tao, Cuicui
    Chen, Wenxuan
    Cheng, Haomiao
    SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 857
  • [46] Evaluation and Prediction of Topsoil organic carbon using Machine learning and hybrid models at a Field-scale
    Matinfar, Hamid Reza
    Maghsodi, Ziba
    Mousavi, Sayed Roholla
    Rahmani, Asghar
    CATENA, 2021, 202
  • [47] Identification and Prediction of Chronic Diseases Using Machine Learning Approach
    Alanazi, Rayan
    JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022
  • [48] Novel Prediction Models for Myelodysplastic Syndromes Using Machine Learning
    Taoka, Kazuki
    Tsubosaka, Ayumu
    Nakazaki, Kumi
    Honda, Akira
    Maki, Hiroaki
    Kurokawa, Mineo
    BLOOD, 2021, 138 : 1939 - +
  • [49] Cost Prediction for Roads Construction using Machine Learning Models
    Abed, Yasamin Ghadbhan
    Hasan, Taha Mohammed
    Zehawi, Raquim Nihad
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2022, 13 (10) : 927 - 936
  • [50] Refractive index prediction models for polymers using machine learning
    Lightstone, Jordan P.
    Chen, Lihua
    Kim, Chiho
    Batra, Rohit
    Ramprasad, Rampi
    JOURNAL OF APPLIED PHYSICS, 2020, 127 (21)