A data-driven framework to identify influencing factors for soil heavy metal contaminations using random forest and bivariate local Moran's I: A case study

被引:0
|
作者
Zhou, Rui [1 ]
Chen, Jian [2 ]
Cui, Shiwen [1 ,2 ]
Li, Lu [2 ]
Qian, Jiangbo [3 ]
Zhao, Hang [2 ]
Huang, Guoxin [2 ]
机构
[1] Jilin Univ, Coll New Energy & Environm, Changchun 130012, Peoples R China
[2] Chinese Acad Environm Planning, 15 Shixing St, Beijing 100041, Peoples R China
[3] Zhejiang Kehuan Environm Engn Technol Corp Ltd, Hangzhou 311200, Peoples R China
关键词
Influencing factors; Soil; Heavy metals; Random forest; Bivariate local Moran's I; AGRICULTURAL SOILS; SPATIAL-DISTRIBUTION; CITY; POLLUTION; NORTH; CHINA;
D O I
10.1016/j.jenvman.2025.124172
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The efficacy of traceability analysis is often limited by a lack of information on influencing factors for heavy metal (HM) contaminations in soil, such as spatial correlations between HM concentrations and influencing factors. To overcome this limitation, a novel data-driven framework was established to identify influencing factors for soil HM concentrations in an industrialised study area, in Guangdong Province, China, mainly using random forest (RF) and bivariate local Moran's I (BLMI) on the basis of the 577 soil samples and the 18 environmental covariates. The quantitative contributions of the 18 influencing factors for the Cd, As, Pb, and Cr concentrations were determined by the optimised RF. The main influencing factors of Cd were petrol stations (10.97%) and railways (9.99%), the main ones of As were groundwater depth (8.45%) and elevation (8.24%), the main ones of Pb were soil pH (8.82%) and hazardous waste disposal sites (8.02%), and the main ones of Cr were mine tailings (13.65%) and rainfall (11.88%). The eight spatial clustering maps between the four HM concentrations and the two key influencing factors were generated by BLMI. The middle part of the study area has shown the higher concentrations of Cd, As, Pb, and Cr, the more complex human activities and the more high- high clusters. Priority attention should be paid to the middle part when taking the specific prevention and control measures for their contaminations. This data-driven framework provided rich information on influencing factors, including HM concentrations, HM contaminations, quantitative contributions, and qualitative spatial clusters.
引用
收藏
页数:10
相关论文
共 4 条
  • [1] A hybrid data-driven framework for diagnosing contributing factors for soil heavy metal contaminations using machine learning and spatial clustering analysis
    Huang, Guoxin
    Wang, Xiahui
    Chen, Di
    Wang, Yipeng
    Zhu, Shouxin
    Zhang, Tao
    Liao, Lei
    Tian, Zi
    Wei, Nan
    JOURNAL OF HAZARDOUS MATERIALS, 2022, 437
  • [2] A hybrid data-driven framework for diagnosing contributing factors for soil heavy metal contaminations using machine learning and spatial clustering analysis
    Huang, Guoxin
    Wang, Xiahui
    Chen, Di
    Wang, Yipeng
    Zhu, Shouxin
    Zhang, Tao
    Liao, Lei
    Tian, Zi
    Wei, Nan
    JOURNAL OF HAZARDOUS MATERIALS, 2022, 437
  • [3] Risk hotspots and influencing factors identification of heavy metal(loid)s in agricultural soils using spatial bivariate analysis and random forest
    Xu, Xiaohang
    Xu, Zhidong
    Liang, Longchao
    Han, Jialiang
    Wu, Gaoen
    Lu, Qinhui
    Liu, Lin
    Li, Pan
    Han, Qiao
    Wang, Le
    Zhang, Sensen
    Hu, Yanhai
    Jiang, Yuping
    Qiu, Guangle
    Wu, Pan
    SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 954
  • [4] Mapping potential wetlands by a new framework method using random forest algorithm and big Earth data: A case study in China's Yangtze River Basin
    Xiang, Hengxing
    Xi, Yanbiao
    Mao, Dehua
    Mahdianpari, Masoud
    Zhang, Jian
    Wang, Ming
    Jia, Mingming
    Yu, Fudong
    Wang, Zongming
    GLOBAL ECOLOGY AND CONSERVATION, 2023, 42