Machine learning for predicting halogen radical reactivity toward aqueous organic chemicals

被引:4
|
作者
Liang, Youheng [1 ]
Huangfu, Xiaoliu [1 ]
Huang, Ruixing [1 ]
Han, Zhenpeng [1 ]
Wu, Sisi [1 ]
Wang, Jingrui [1 ]
Long, Xinlong [1 ]
Ma, Jun [2 ]
He, Qiang [1 ]
机构
[1] Chongqing Univ, Coll Environm, Key Lab Ecoenvironm Three Gorges Reservoir Reg, Minist Educ, Chongqing 400044, Peoples R China
[2] Harbin Inst Technol, Sch Municipal & Environm Engn, State Key Lab Urban Water Resources & Environm, Harbin 150090, Peoples R China
基金
中国国家自然科学基金;
关键词
Halogen radical rate constants; Morgan fingerprint; Mordred descriptor; Machine learning; Web application; DEGRADATION; UV/CHLORINE; OXIDATION; QSAR; CONTAMINANTS; KINETICS;
D O I
10.1016/j.jhazmat.2024.134501
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Rapid advances in machine learning (ML) provide fast, accurate, and widely applicable methods for predicting free radical-mediated organic pollutant reactivity. In this study, the rate constants (logk) of four halogen radicals were predicted using Morgan fingerprint (MF) and Mordred descriptor (MD) in combination with a series of ML models. The findings highlighted that making accurate predictions for various datasets depended on an effective combination of descriptors and algorithms. To further alleviate the challenge of limited sample size, we introduced a data combination strategy that improved prediction accuracy and mitigated overfitting by combining different datasets. The Light Gradient Boosting Machine (LightGBM) with MF and Random Forest (RF) with MD models based on the unified dataset were finally selected as the optimal models. The SHapley Additive exPlanations revealed insights: the MF-LightGBM model successfully captured the influence of electron-withdrawing/ donating groups, while autocorrelation, walk count and information content descriptors in the MD-RF model were identified as key features. Furthermore, the important contribution of pH was emphasized. The results of the applicability domain analysis further supported that the developed model can make reliable predictions for query compounds across a broader range. Finally, a practical web application for logk calculations was built.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Predicting Rate Constants of Reactive Chlorine Species toward Organic Compounds by Combining Machine Learning and Quantum Chemical Calculation
    Zheng, Shanshan
    Qin, Wenlei
    Ji, He
    Guo, Wanqian
    Fang, Jingyun
    ENVIRONMENTAL SCIENCE & TECHNOLOGY LETTERS, 2023, 10 (09) : 804 - 809
  • [32] Controlling an organic synthesis robot with machine learning to search for new reactivity
    Granda, Jaroslaw M.
    Donina, Liva
    Dragone, Vincenza
    Long, De-Liang
    Cronin, Leroy
    NATURE, 2018, 559 (7714) : 377 - +
  • [33] Predicting Aqueous Adsorption of Organic Compounds onto Biochars, Carbon Nanotubes, Granular Activated Carbons, and Resins with Machine Learning
    Zhang, Kai
    Zhong, Shifa
    Zhang, Huichun
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2020, 54 (11) : 7008 - 7018
  • [34] PREDICTING VALENCE AND AROUSAL FROM PUPIL REACTIVITY IN VR: A MACHINE LEARNING APPROACH
    Pacula, Beata
    Jemiolo, Pawel
    Szymanska, Agata
    Kuniecki, Michal
    PSYCHOPHYSIOLOGY, 2024, 61 : S213 - S213
  • [35] Controlling an organic synthesis robot with machine learning to search for new reactivity
    Jarosław M. Granda
    Liva Donina
    Vincenza Dragone
    De-Liang Long
    Leroy Cronin
    Nature, 2018, 559 : 377 - 381
  • [36] Development of a QSAR model for predicting aqueous reaction rate constants of organic chemicals with hydroxyl radicals
    Luo, Xiang
    Yang, Xianhai
    Qiao, Xianliang
    Wang, Ya
    Chen, Jingwen
    Wei, Xiaoxuan
    Peijnenburg, Willie J. G. M.
    ENVIRONMENTAL SCIENCE-PROCESSES & IMPACTS, 2017, 19 (03) : 350 - 356
  • [37] INFRARED SPECTROSCOPY-BASED PROPERTY REACTIVITY CORRELATIONS FOR PREDICTING ENVIRONMENTAL FATE OF ORGANIC-CHEMICALS
    COLLETTE, TW
    ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY, 1992, 11 (07) : 981 - 991
  • [38] On the reactivity of anodically generated trifluoromethyl radicals toward aryl alkynes in organic/aqueous media
    Jud, Wolfgang
    Kappe, C. Oliver
    Cantillo, David
    ORGANIC & BIOMOLECULAR CHEMISTRY, 2019, 17 (14) : 3529 - 3537
  • [39] On the reactivity of anodically generated trifluoromethyl radicals toward aryl alkynes in organic/aqueous media
    Jud W.
    Kappe C.O.
    Cantillo D.
    Organic and Biomolecular Chemistry, 2019, 17 (14): : 3529 - 3537
  • [40] Construction of interpretable ensemble learning models for predicting bioaccumulation parameters of organic chemicals in fish
    Zhu, Minghua
    Xiao, Zijun
    Zhang, Tao
    Lu, Guanghua
    JOURNAL OF HAZARDOUS MATERIALS, 2025, 482