More Data or a Better Model? Figuring Out What Matters Most for the Spatial Prediction of Soil Carbon

被引:75
|
作者
Somarathna, P. D. S. N. [1 ]
Minasny, Budiman [1 ]
Malone, Brendan P. [1 ]
机构
[1] Univ Sydney, Sydney Inst Agr, Sch Life & Environm Sci, Sydney, NSW, Australia
关键词
GEOGRAPHICALLY WEIGHTED REGRESSION; ORGANIC-CARBON; SAMPLE-SIZE; COEFFICIENT; INFERENCE; ACCURACY; INDEX;
D O I
10.2136/sssaj2016.11.0376
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Modeling techniques used in digital soil carbon mapping encompass a variety of algorithms to address spatial prediction problems such as spatial non-stationarity, nonlinearity and multi-colinearity. A given study site can inherit one or more such spatial prediction problems, necessitating the use of a combination of statistical learning algorithms to improve the accuracy of predictions. In addition, the training sample size may affect the accuracy of the model predictions. The effect of varying sample size on model accuracy has not been widely studied in pedometrics. To help fill this gap, we examined the behavior of multiple linear regression (MLR), geographically weighted regression (GWR), linear mixed models (LMMs), Cubist regression trees, quantile regression forests (QRFs), and extreme learning machine regression (ELMR) under varying sample sizes. The results showed that for the study site in the Hunter Valley, Australia, the accuracy of spatial prediction of soil carbon is more sensitive to training sample size compared to the model type used. The prediction accuracy initially increases exponentially with increasing sample size, eventually reaching a plateau. Different models reach their maximum predictive potential at different sample sizes. Furthermore, the uncertainty of model predictions decreases with increasing training sample sizes.
引用
收藏
页码:1413 / 1426
页数:14
相关论文
共 25 条
  • [21] Prediction of spatial soil property information from ancillary sensor data using ordinary linear regression: Model derivations, residual assumptions and model validation tests
    Lesch, S. M.
    Corwin, D. L.
    GEODERMA, 2008, 148 (02) : 130 - 140
  • [22] Spatial distribution prediction of soil As in a large-scale arsenic slag contaminated site based on an integrated model and multi-source environmental data
    Liu, Geng
    Zhou, Xin
    Li, Qiang
    Shi, Ying
    Guo, Guanlin
    Zhao, Long
    Wang, Jie
    Su, Yingqing
    Zhang, Chao
    ENVIRONMENTAL POLLUTION, 2020, 267
  • [23] Spatial Prediction and Mapping of Soil Water Content by TPE-GBDT Model in Chinese Coastal Delta Farmland with Sentinel-2 Remote Sensing Data
    Zhan, Dexi
    Mu, Yongqi
    Duan, Wenxu
    Ye, Mingzhu
    Song, Yingqiang
    Song, Zhenqi
    Yao, Kaizhong
    Sun, Dengkuo
    Ding, Ziqi
    AGRICULTURE-BASEL, 2023, 13 (05):
  • [24] An advanced soil organic carbon content prediction model via fused temporal-spatial-spectral (TSS) information based on machine learning and deep learning algorithms
    Meng, Xiangtian
    Bao, Yilin
    Wang, Yiang
    Zhang, Xinle
    Liu, Huanjun
    REMOTE SENSING OF ENVIRONMENT, 2022, 280
  • [25] Quantitative assessment of different straw management practices on soil organic carbon and crop yield in the Chinese upland soils: A data-driven approach based on simulation and prediction model
    Ul Islam, Mahbub
    Jiang, Fahui
    Halder, Milton
    Barman, Alak
    Liu, Shuai
    Peng, Xinhua
    EUROPEAN JOURNAL OF AGRONOMY, 2024, 154