Discriminative algorithm approach to forecast Cd threshold exceedance probability for rice grain based on soil characteristics

被引:10
|
作者
Yang, Jun [1 ,2 ]
Zhao, Chen [1 ,2 ]
Yang, Junxing [1 ,2 ]
Wang, Jingyun [1 ,2 ]
Li, Zhitao [3 ]
Wan, Xiaoming [1 ,2 ]
Guo, Guanghui [1 ,2 ]
Lei, Mei [1 ,2 ]
Chen, Tongbin [1 ,2 ]
机构
[1] Chinese Acad Sci, Ctr Environm Remediat, Inst Geog Sci & Nat Resources Res, 11A Datun Rd, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Minist Ecol & Environm, Tech Ctr Soil Agr & Rural Ecol & Environm, Beijing 100012, Peoples R China
关键词
Probabilistic forecasting model; Discriminative algorithm; Cadmium; Rice grain; Soil; HEAVY-METAL CONCENTRATIONS; LOGISTIC-REGRESSION; ORGANIC-MATTER; CADMIUM UPTAKE; PADDY FIELDS; MICROBIAL BIOMASS; TRACE-ELEMENTS; EDIBLE PARTS; MINING AREA; HAN RIVER;
D O I
10.1016/j.envpol.2020.114211
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The relationship between cadmium (Cd) concentration in rice grains and the soil that they are cultivated in is highly uncertain due to the influence of soil properties, rice varieties, and other undetermined factors. In this study, we introduce the probability of exceeding the threshold to characterize this uncertainty and then, build a probabilistic forewarning model. Additionally, a number of associated factors have been used as parameters to improve model performance. Considering that the physicochemical properties and Cd concentration in the soil (Cd-soil) do not follow a normal distribution, and are not independent of each other, a discriminative algorithm, represented by a logistic regression (LR), performed better than generative algorithms, such as the naive Bayes and quadratic discriminant analysis models. The performance of the LR based model was found to be 0.5% better in the case of the univariate model (Cd-soil) and 4.1% better with a multivariate model (soil properties used as additional factors) (p < 0.01). The output of the LR based model predicted probabilities that were positively correlated to the true exceedance rate (R-2 = 0.949,p < 0.01), within an exceedance threshold range of 0.1-0.4 mg kg(-1) and a mean deviation of 5.75%. A sensitivity analysis showed that the effect of soil properties on the exceedance probability weakens with an increase in Cd concentration in rice grains. When the threshold is below 0.15 mg kg(-1), soil pH strongly influences the exceedance probability. As the threshold increases, the influence of pH on the exceedance probability is gradually superseded. By quantifying the uncertainty regarding the relationship between Cd concentration in rice grains and soil, the discriminative algorithm-based probabilistic forecasting model offers a new way to assess Cd pollution in rice grown in contaminated paddy fields. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 3 条
  • [1] Classification and regression tree (CART) for predicting cadmium (Cd) uptake by rice ( Oryza sativa L.) and its application to derive soil Cd threshold based on field data
    Tian, Haoting
    Zhang, Yan
    Yang, Xiaohui
    Zhang, Huan
    Wang, Dengfeng
    Wu, Pengbao
    Yin, Aijing
    Gao, Chao
    ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 2024, 285
  • [2] A novel method for predicting cadmium concentration in rice grain using genetic algorithm and back-propagation neural network based on soil properties
    Yi Xuan Hou
    Hua Fu Zhao
    Zhuo Zhang
    Ke Ning Wu
    Environmental Science and Pollution Research, 2018, 25 : 35682 - 35692
  • [3] A novel method for predicting cadmium concentration in rice grain using genetic algorithm and back-propagation neural network based on soil properties
    Hou, Yi Xuan
    Zhao, Hua Fu
    Zhang, Zhuo
    Wu, Ke Ning
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2018, 25 (35) : 35682 - 35692