Machine learning for predicting halogen radical reactivity toward aqueous organic chemicals

被引:4
|
作者
Liang, Youheng [1 ]
Huangfu, Xiaoliu [1 ]
Huang, Ruixing [1 ]
Han, Zhenpeng [1 ]
Wu, Sisi [1 ]
Wang, Jingrui [1 ]
Long, Xinlong [1 ]
Ma, Jun [2 ]
He, Qiang [1 ]
机构
[1] Chongqing Univ, Coll Environm, Key Lab Ecoenvironm Three Gorges Reservoir Reg, Minist Educ, Chongqing 400044, Peoples R China
[2] Harbin Inst Technol, Sch Municipal & Environm Engn, State Key Lab Urban Water Resources & Environm, Harbin 150090, Peoples R China
基金
中国国家自然科学基金;
关键词
Halogen radical rate constants; Morgan fingerprint; Mordred descriptor; Machine learning; Web application; DEGRADATION; UV/CHLORINE; OXIDATION; QSAR; CONTAMINANTS; KINETICS;
D O I
10.1016/j.jhazmat.2024.134501
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Rapid advances in machine learning (ML) provide fast, accurate, and widely applicable methods for predicting free radical-mediated organic pollutant reactivity. In this study, the rate constants (logk) of four halogen radicals were predicted using Morgan fingerprint (MF) and Mordred descriptor (MD) in combination with a series of ML models. The findings highlighted that making accurate predictions for various datasets depended on an effective combination of descriptors and algorithms. To further alleviate the challenge of limited sample size, we introduced a data combination strategy that improved prediction accuracy and mitigated overfitting by combining different datasets. The Light Gradient Boosting Machine (LightGBM) with MF and Random Forest (RF) with MD models based on the unified dataset were finally selected as the optimal models. The SHapley Additive exPlanations revealed insights: the MF-LightGBM model successfully captured the influence of electron-withdrawing/ donating groups, while autocorrelation, walk count and information content descriptors in the MD-RF model were identified as key features. Furthermore, the important contribution of pH was emphasized. The results of the applicability domain analysis further supported that the developed model can make reliable predictions for query compounds across a broader range. Finally, a practical web application for logk calculations was built.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Predicting reactivity dynamics of halogen species and trace organic contaminants using machine learning models
    Zhu J.
    Huang Y.
    Yi Q.
    Bu L.
    Zhou S.
    Shi Z.
    Chemosphere, 2024, 346
  • [2] Predicting aqueous sorption of organic pollutants on microplastics with machine learning
    Qiu, Ye
    Li, Zhejun
    Zhang, Tong
    Zhang, Ping
    WATER RESEARCH, 2023, 244
  • [3] Shedding light on "Black Box" machine learning models for predicting the reactivity of HO• radicals toward organic compounds
    Zhong, Shifa
    Zhang, Kai
    Wang, Dong
    Zhang, Huichun
    CHEMICAL ENGINEERING JOURNAL, 2021, 405
  • [4] Predicting the chemical reactivity of organic materials using a machine-learning approach
    Lee, Byungju
    Yoo, Jaekyun
    Kang, Kisuk
    CHEMICAL SCIENCE, 2020, 11 (30) : 7813 - 7822
  • [5] Machine Learning-Assisted Exploration and Identification of Aqueous Dispersants in the Vast Diversity of Organic Chemicals
    Jintoku, Hirokuni
    Futaba, Don N.
    ACS APPLIED MATERIALS & INTERFACES, 2024, 16 (09) : 11800 - 11808
  • [6] Hydroxyl free radical reactivity toward aqueous chlorinated phenols
    Zimbron, JA
    Reardon, KF
    WATER RESEARCH, 2005, 39 (05) : 865 - 869
  • [7] Toward Predicting Efficiency of Organic Solar Cells via Machine Learning and Improved Descriptors
    Sahu, Harikrishna
    Rao, Weining
    Troisi, Alessandro
    Ma, Haibo
    ADVANCED ENERGY MATERIALS, 2018, 8 (24)
  • [8] Modeling the reactivity of ozone and sulphate radicals towards organic chemicals in water using machine learning approaches
    Gupta, Shikha
    Basant, Nikita
    RSC ADVANCES, 2016, 6 (110) : 108448 - 108457
  • [9] Classification and Regression Machine Learning Models for Predicting Aerobic Ready and Inherent Biodegradation of Organic Chemicals in Water
    Huang, Kuan
    Zhang, Huichun
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2022, 56 (17) : 12755 - 12764
  • [10] Organic reactivity from mechanism to machine learning
    Kjell Jorner
    Anna Tomberg
    Christoph Bauer
    Christian Sköld
    Per-Ola Norrby
    Nature Reviews Chemistry, 2021, 5 : 240 - 255