Comparison of boosted regression tree and random forest models for mapping topsoil organic carbon concentration in an alpine ecosystem

被引:276
|
作者
Yang, Ren-Min [1 ,2 ]
Zhang, Gan-Lin [1 ,2 ]
Liu, Feng [1 ]
Lu, Yuan-Yuan [1 ,2 ]
Yang, Fan [1 ,2 ]
Yang, Fei [1 ,2 ]
Yang, Min [1 ,2 ]
Zhao, Yu-Guo [1 ]
Li, De-Cheng [1 ]
机构
[1] Chinese Acad Sci, Inst Soil Sci, State Key Lab Soil & Sustainable Agr, Nanjing 210008, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Digital soil mapping; Soil organic carbon; Boosted regression tree; Random forest; Environmental variables; Tibetan Plateau; SOIL CARBON; SPATIAL-DISTRIBUTION; STOCKS; VEGETATION; STORAGE; IMAGERY; QUANTIFICATION; CLASSIFICATION; VARIABILITY; PREDICTION;
D O I
10.1016/j.ecolind.2015.08.036
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
Soil organic carbon (SOC) plays an important role in soil fertility and carbon sequestration, and a better understanding of the spatial patterns of SOC is essential for soil resource management. In this study, we used boosted regression tree (BRT) and random forest (RF) models to map the distribution of topsoil organic carbon content at the northeastern edge of the Tibetan Plateau in China. A set 01 105 soil samples and 12 environmental variables (including topography, climate and vegetation) were analyzed. The performance of the models was evaluated using a 10-fold cross-validation procedure. Maps of the mean values and standard deviations of SOC were generated to illustrate model variability and uncertainty. The results indicate that the BRT and RF models exhibited very similar performance and yielded similar predicted distributions of SOC. The two models explained approximately 70% of the total SOC variability. The BRT and RF models robustly predicted the SOC at low observed SOC values, whereas they underestimated high observed SOC values. This underestimation may have been caused by biased distributions of soil samples in the SOC space. Vegetation-related variables were assigned the highest importance in both models, followed by climate and topography. Both models produced spatial distribution maps of SOC that were closely related to vegetation cover. The SOC content predicted by the BRT model was clearly higher than that of the RF model in areas with greater vegetation cover because the contributions of vegetation-related variables in the two models (65% and 43%, respectively) differed significantly. The predicted SOC content increased from the northwestern to the southeastern part of the study area, average values produced by the BRT and RF models were 27.3 g kg(-1) and 26.6 g kg(-1), respectively. We conclude that the BRT and RF methods should be calibrated and compared to obtain the best prediction of SOC spatial distribution in similar regions. In addition, vegetation variables, including those obtained from remote sensing imagery, should be taken as the main environmental indicators and explicitly included when generating SOC maps in Alpine environments. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:870 / 878
页数:9
相关论文
共 50 条
  • [1] Landslide Susceptibility Mapping Based on Random Forest and Boosted Regression Tree Models, and a Comparison of Their Performance
    Park, Soyoung
    Kim, Jinsoo
    APPLIED SCIENCES-BASEL, 2019, 9 (05):
  • [2] GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran
    Naghibi, Seyed Amir
    Pourghasemi, Hamid Reza
    Dixon, Barnali
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2016, 188 (01) : 1 - 27
  • [3] GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran
    Seyed Amir Naghibi
    Hamid Reza Pourghasemi
    Barnali Dixon
    Environmental Monitoring and Assessment, 2016, 188
  • [4] Stream water quality prediction using boosted regression tree and random forest models
    Ali O. Alnahit
    Ashok K. Mishra
    Abdul A. Khan
    Stochastic Environmental Research and Risk Assessment, 2022, 36 : 2661 - 2680
  • [5] Stream water quality prediction using boosted regression tree and random forest models
    Alnahit, Ali O.
    Mishra, Ashok K.
    Khan, Abdul A.
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (09) : 2661 - 2680
  • [6] Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia
    Youssef, Ahmed Mohamed
    Pourghasemi, Hamid Reza
    Pourtaghi, Zohre Sadat
    Al-Katheeri, Mohamed M.
    LANDSLIDES, 2016, 13 (05) : 839 - 856
  • [7] Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia
    Ahmed Mohamed Youssef
    Hamid Reza Pourghasemi
    Zohre Sadat Pourtaghi
    Mohamed M. Al-Katheeri
    Landslides, 2016, 13 : 839 - 856
  • [8] Mapping the estuarine ecosystem service of pollutant removal using empirically validated boosted regression tree models
    Lohrer, Andrew M.
    Stephenson, Fabrice
    Douglas, Emily J.
    Townsend, Michael
    ECOLOGICAL APPLICATIONS, 2020, 30 (05)
  • [9] Comparison of Random Forest and Kriging Models for Soil Organic Carbon Mapping in the Himalayan Region of Kashmir
    Farooq, Iqra
    Bangroo, Shabir Ahmed
    Bashir, Owais
    Shah, Tajamul Islam
    Malik, Ajaz A.
    Iqbal, Asif M.
    Mahdi, Syed Sheraz
    Wani, Owais Ali
    Nazir, Nageena
    Biswas, Asim
    LAND, 2022, 11 (12)
  • [10] Forest stand susceptibility mapping during harvesting using logistic regression and boosted regression tree machine learning models
    Shabani, Saeid
    Pourghasemi, Hamid Reza
    Blaschke, Thomas
    GLOBAL ECOLOGY AND CONSERVATION, 2020, 22