Significant Improvement in Soil Organic Carbon Estimation Using Data-Driven Machine Learning Based on Habitat Patches

被引:2
|
作者
Yu, Wenping [1 ,2 ]
Zhou, Wei [2 ,3 ]
Wang, Ting [2 ]
Xiao, Jieyun [2 ]
Peng, Yao [2 ]
Li, Haoran [4 ]
Li, Yuechen [2 ]
机构
[1] Chinese Acad Agr Sci, Inst Agr Resources & Reg Planning, State Key Lab Efficient Utilizat Arid & Semiarid A, Beijing 100081, Peoples R China
[2] Southwest Univ, Chongqing Engn Res Ctr Remote Sensing Big Data App, Sch Geog Sci, Chongqing Jinfo Mt Karst Ecosyst Natl Observat & R, Chongqing 400715, Peoples R China
[3] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing 100101, Peoples R China
[4] Minist Nat Resources, Topog Survey Team 6, Chengdu 610500, Peoples R China
关键词
soil organic carbon; clustering algorithm; machine learning; digital soil mapping; SUPPORT VECTOR MACHINE; CLIMATE-CHANGE; RANDOM FOREST; STOCKS; CLASSIFICATION; MODELS; SEQUESTRATION; REGRESSION; VEGETATION; PREDICTION;
D O I
10.3390/rs16040688
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Soil organic carbon (SOC) is generally thought to act as a carbon sink; however, in areas with high spatial heterogeneity, using a single model to estimate the SOC of the whole study area will greatly reduce the simulation accuracy. The earth surface unit division is important to consider in building different models. Here, we divided the research area into different habitat patches using partitioning around a medoids clustering (PAM) algorithm; then, we built an SOC simulation model using machine learning algorithms. The results showed that three habitat patches were created. The simulation accuracy for Habitat Patch 1 (R2 = 0.55; RMSE = 2.89) and Habitat Patch 3 (R2 = 0.47; RMSE = 3.94) using the XGBoost model was higher than that for the whole study area (R2 = 0.44; RMSE = 4.35); although the R2 increased by 25% and 6.8%, the RMSE decreased by 33.6% and 9.4%, and the field sample points significantly declined by 70% and 74%. The R2 of Habitat Patch 2 using the RF model increased by 17.1%, and the RMSE also decreased by 10.5%; however, the sample points significantly declined by 58%. Therefore, using different models for corresponding patches will significantly increase the SOC simulation accuracy over using one model for the whole study area. This will provide scientific guidance for SOC or soil property monitoring with low field survey costs and high simulation accuracy.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] A Data-Driven Predictive Approach for Drug Delivery Using Machine Learning Techniques
    Li, YuanYuan
    Lenaghan, Scott C.
    Zhang, Mingjun
    [J]. PLOS ONE, 2012, 7 (02):
  • [42] Data-Driven Trend Forecasting in Stock Market Using Machine Learning Techniques
    Misra, Puneet
    Chaurasia, Siddharth
    [J]. JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2020, 13 (01) : 130 - 149
  • [43] A paradigm for data-driven predictive modeling using field inversion and machine learning
    Parish, Eric J.
    Duraisamy, Karthik
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 305 : 758 - 774
  • [44] A Review of Data-Driven Machinery Fault Diagnosis Using Machine Learning Algorithms
    Jian Cen
    Zhuohong Yang
    Xi Liu
    Jianbin Xiong
    Honghua Chen
    [J]. Journal of Vibration Engineering & Technologies, 2022, 10 : 2481 - 2507
  • [45] Data-driven surrogate modeling of multiphase flows using machine learning techniques
    Ganti, Himakar
    Khare, Prashant
    [J]. COMPUTERS & FLUIDS, 2020, 211
  • [46] A Review of Data-Driven Machinery Fault Diagnosis Using Machine Learning Algorithms
    Cen, Jian
    Yang, Zhuohong
    Liu, Xi
    Xiong, Jianbin
    Chen, Honghua
    [J]. JOURNAL OF VIBRATION ENGINEERING & TECHNOLOGIES, 2022, 10 (07) : 2481 - 2507
  • [47] Data-driven monitoring of the gearbox using multifractal analysis and machine learning methods
    Puchalski, Andrzej
    Komorska, Iwona
    [J]. III INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN ENGINEERING SCIENCE (CMES 18), 2019, 252
  • [48] Predicting the state of charge and health of batteries using data-driven machine learning
    Man-Fai Ng
    Jin Zhao
    Qingyu Yan
    Gareth J. Conduit
    Zhi Wei Seh
    [J]. Nature Machine Intelligence, 2020, 2 : 161 - 170
  • [49] Data-driven track geometry fault localisation using unsupervised machine learning
    Popov, K.
    De Bold, R.
    Chai, H. -K.
    Forde, M. C.
    Ho, C. L.
    Hyslip, J. P.
    Kashani, H. F.
    Kelly, R.
    Hsu, S. S.
    Rippin, M.
    [J]. CONSTRUCTION AND BUILDING MATERIALS, 2023, 377
  • [50] Generation of Automatic Data-Driven Feedback to Students Using Explainable Machine Learning
    Afzaal, Muhammad
    Nouri, Jalal
    Zia, Aayesha
    Papapetrou, Panagiotis
    Fors, Uno
    Wu, Yongchao
    Li, Xiu
    Weegar, Rebecka
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2021), PT II, 2021, 12749 : 37 - 42