Significant Improvement in Soil Organic Carbon Estimation Using Data-Driven Machine Learning Based on Habitat Patches

被引:2
|
作者
Yu, Wenping [1 ,2 ]
Zhou, Wei [2 ,3 ]
Wang, Ting [2 ]
Xiao, Jieyun [2 ]
Peng, Yao [2 ]
Li, Haoran [4 ]
Li, Yuechen [2 ]
机构
[1] Chinese Acad Agr Sci, Inst Agr Resources & Reg Planning, State Key Lab Efficient Utilizat Arid & Semiarid A, Beijing 100081, Peoples R China
[2] Southwest Univ, Chongqing Engn Res Ctr Remote Sensing Big Data App, Sch Geog Sci, Chongqing Jinfo Mt Karst Ecosyst Natl Observat & R, Chongqing 400715, Peoples R China
[3] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing 100101, Peoples R China
[4] Minist Nat Resources, Topog Survey Team 6, Chengdu 610500, Peoples R China
关键词
soil organic carbon; clustering algorithm; machine learning; digital soil mapping; SUPPORT VECTOR MACHINE; CLIMATE-CHANGE; RANDOM FOREST; STOCKS; CLASSIFICATION; MODELS; SEQUESTRATION; REGRESSION; VEGETATION; PREDICTION;
D O I
10.3390/rs16040688
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Soil organic carbon (SOC) is generally thought to act as a carbon sink; however, in areas with high spatial heterogeneity, using a single model to estimate the SOC of the whole study area will greatly reduce the simulation accuracy. The earth surface unit division is important to consider in building different models. Here, we divided the research area into different habitat patches using partitioning around a medoids clustering (PAM) algorithm; then, we built an SOC simulation model using machine learning algorithms. The results showed that three habitat patches were created. The simulation accuracy for Habitat Patch 1 (R2 = 0.55; RMSE = 2.89) and Habitat Patch 3 (R2 = 0.47; RMSE = 3.94) using the XGBoost model was higher than that for the whole study area (R2 = 0.44; RMSE = 4.35); although the R2 increased by 25% and 6.8%, the RMSE decreased by 33.6% and 9.4%, and the field sample points significantly declined by 70% and 74%. The R2 of Habitat Patch 2 using the RF model increased by 17.1%, and the RMSE also decreased by 10.5%; however, the sample points significantly declined by 58%. Therefore, using different models for corresponding patches will significantly increase the SOC simulation accuracy over using one model for the whole study area. This will provide scientific guidance for SOC or soil property monitoring with low field survey costs and high simulation accuracy.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Soil organic carbon estimation using remote sensing data-driven machine learning
    Chen, Qi
    Wang, Yiting
    Zhu, Xicun
    [J]. PEERJ, 2024, 12
  • [2] Estimation of data-driven streamflow predicting models using machine learning methods
    Siddiqi T.A.
    Ashraf S.
    Khan S.A.
    Iqbal M.J.
    [J]. Arabian Journal of Geosciences, 2021, 14 (11)
  • [3] A data-driven QSPR model for screening organic corrosion inhibitors for carbon steel using machine learning techniques
    Pham, Thanh Hai
    Le, Phung K.
    Son, Do Ngoc
    [J]. RSC ADVANCES, 2024, 14 (16) : 11157 - 11168
  • [4] Data-Driven Soil Analysis and Evaluation for Smart Farming Using Machine Learning Approaches
    Huang, Yixin
    Srivastava, Rishi
    Ngo, Chloe
    Gao, Jerry
    Wu, Jane
    Chiao, Sen
    [J]. AGRICULTURE-BASEL, 2023, 13 (09):
  • [5] Data-Driven Estimation of a Driving Safety Tolerance Zone Using Imbalanced Machine Learning
    Garefalakis, Thodoris
    Katrakazas, Christos
    Yannis, George
    [J]. SENSORS, 2022, 22 (14)
  • [6] Dynamic Data-Driven Carbon-Based Electric Vehicle Charging Pricing Strategy Using Machine Learning
    Garrido, Jacqueline
    Barth, Matthew J.
    Enriquez-Contreras, Luis
    Hasan, Asm Jahid
    Todd, Michael
    Ula, Sadrul
    Yusuf, Jubair
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 1670 - 1676
  • [7] Data-Driven Load Forecasting Using Machine Learning and Meteorological Data
    Alrashidi A.
    Qamar A.M.
    [J]. Computer Systems Science and Engineering, 2023, 44 (03): : 1973 - 1988
  • [8] Measurement of Soil Organic Matter and Total Nitrogen Based on Visible/Near Infrared Spectroscopy and Data-Driven Machine Learning Method
    Zhang Hai-liang
    Xie Chao-yong
    Tian Peng
    Zhan Bai-shao
    Chen Zai-liang
    Luo Wei
    Liu Xue-mei
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43 (07) : 2226 - 2231
  • [9] Fracture Permeability Estimation Under Complex Physics: A Data-Driven Model Using Machine Learning
    He, Xupeng
    AlSinan, Marwah M.
    Kwak, Hyung T.
    Hoteit, Hussein
    [J]. Saudi Aramco Journal of Technology, 2022, 2022 : 2 - 11
  • [10] Data-driven recipe completion using machine learning methods
    De Clercq, Marlies
    Stock, Michiel
    De Baets, Bernard
    Waegeman, Willem
    [J]. TRENDS IN FOOD SCIENCE & TECHNOLOGY, 2016, 49 : 1 - 13