Predictive Modeling of Pesticides Reproductive Toxicity in Earthworms Using Interpretable Machine-Learning Techniques on Imbalanced Data

被引:0
|
作者
Kotli, Mihkel [1 ]
Piir, Geven [1 ]
Maran, Uko [1 ]
机构
[1] Univ Tartu, Inst Chem, EE-50411 Tartu, Estonia
来源
ACS OMEGA | 2025年 / 10卷 / 05期
基金
欧盟地平线“2020”;
关键词
QSAR MODELS; CHEMICALS; SORPTION;
D O I
10.1021/acsomega.4c09719
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The earthworm is a key indicator species in soil ecosystems. This makes the reproductive toxicity of chemical compounds to earthworms a desired property of determination and makes computational models necessary for descriptive and predictive purposes. Thus, the aim was to develop an advanced Quantitative Structure-Activity Relationship modeling approach for this complex property with imbalanced data. The approach integrated gradient-boosted decision trees as classifiers with a genetic algorithm for feature selection and Bayesian optimization for hyperparameter tuning. An additional goal was to analyze and interpret, using SHAP values, the structural features encoded by the molecular descriptors that contribute to pesticide toxicity and nontoxicity, the most notable of which are solvation entropy and a number of hydrolyzable bonds. The final model was constructed as a stacked ensemble of models and combined the strengths of the individual models. Evaluation of this model with an external test set of 147 compounds demonstrated a well-defined applicability domain and sufficient predictive capabilities with a Balanced Accuracy of 77%. The model representation follows FAIR principles and is available on QsarDB.org.
引用
收藏
页码:4732 / 4744
页数:13
相关论文
共 50 条
  • [21] Interpretable machine learning models for failure cause prediction in imbalanced oil pipeline data
    Awuku, Bright
    Huang, Ying
    Yodo, Nita
    Asa, Eric
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (07)
  • [22] A Framework for Data-Driven Mineral Prospectivity Mapping with Interpretable Machine Learning and Modulated Predictive Modeling
    Mou, Nini
    Carranza, Emmanuel John M.
    Wang, Gongwen
    Sun, Xiang
    NATURAL RESOURCES RESEARCH, 2023, 32 (06) : 2439 - 2462
  • [23] A Framework for Data-Driven Mineral Prospectivity Mapping with Interpretable Machine Learning and Modulated Predictive Modeling
    Nini Mou
    Emmanuel John M. Carranza
    Gongwen Wang
    Xiang Sun
    Natural Resources Research, 2023, 32 : 2439 - 2462
  • [24] Modeling and evaluation of the permeate volume in membrane desalination processes using machine-learning techniques
    Kumar, S. Vinod
    Mukil, S.
    Naveen, P.
    Rathi, B. Senthil
    DIGITAL CHEMICAL ENGINEERING, 2024, 11
  • [25] Video Recommendation System Using Machine-Learning Techniques
    Meesala Sravani
    Ch Vidyadhari
    S Anjali Devi
    Journal of Harbin Institute of Technology(New Series), 2024, 31 (04) : 24 - 33
  • [26] Improving sequence tagging using machine-learning techniques
    Jiang, Wei
    Wang, Xiao-Long
    Guan, Yi
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2636 - +
  • [27] Refining empiric subgroups of pediatric sepsis using machine-learning techniques on observational data
    Qin, Yidi
    Caldino Bohn, Rebecca I.
    Sriram, Aditya
    Kernan, Kate F.
    Carcillo, Joseph A.
    Kim, Soyeon
    Park, Hyun Jung
    FRONTIERS IN PEDIATRICS, 2023, 11
  • [28] Predictive modeling of photovoltaic system cleaning schedules using machine learning techniques
    Abuzaid, Haneen
    Awad, Mahmoud
    Shamayleh, Abdulrahim
    Alshraideh, Hussam
    RENEWABLE ENERGY, 2025, 239
  • [29] Predictive Modeling of Crop Yield in Precision Agriculture Using Machine Learning Techniques
    Raj, G. Bhupal
    EswararaoBoddepalli
    Veena, C. H.
    Manjunatha
    Singla, Atul
    Dhanraj, JoshuvaArockia
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [30] Multimodal Predictive Modeling of Endovascular Treatment Outcome for Acute Ischemic Stroke Using Machine-Learning
    Brugnara, Gianluca
    Neuberger, Ulf
    Mahmutoglu, Mustafa A.
    Foltyn, Martha
    Herweh, Christian
    Nagel, Simon
    Schonenberger, Silvia
    Heiland, Sabine
    Ulfert, Christian
    Ringleb, Peter Arthur
    Bendszus, Martin
    Mohlenbruch, Markus A.
    Pfaff, Johannes A. R.
    Vollmuth, Philipp
    STROKE, 2020, 51 (12) : 3541 - 3551