Estimation of Forest Aboveground Biomass in Northwest Hunan Province Based on Machine Learning and Multi-Source Data

被引:5
|
作者
Ding J. [1 ,3 ]
Huang W. [1 ,2 ]
Liu Y. [4 ]
Hu Y. [5 ]
机构
[1] School of Resource and Environmental Sciences, Wuhan University, Wuhan
[2] Key Laboratory of Urban Land Resource Monitoring and Simulation of Ministry of Natural Resources, Shenzhen
[3] College of Urban and Environmental Sciences, Peking University, Beijing
[4] Academy of Forest Inventory and Planning, National Forestry and Grassland Administration, Beijing
[5] Key Laboratory for Restoration and Reconstruction of Degraded Ecosystem in Northwest China of Ministry of Education, Yinchuan
来源
Linye Kexue/Scientia Silvae Sinicae | 2021年 / 57卷 / 10期
关键词
Forest aboveground biomass(AGB); Multiple linear regression(MLR); Random forest(RF); Support vector regression(SVR);
D O I
10.11707/j.1001-7488.20211004
中图分类号
学科分类号
摘要
【Objective】Aiming at the problems of high cost, low timeliness and poor uniformity of the results of traditional forest inventory method, based on multi-source remote sensing data, machine learning method was used to select characteristic variables and establish an estimation model to make the map products of aboveground biomass(AGB) in the study area in order to provide technical means for forest resource information survey. 【Method】Taking the northwest of Hunan Province as the study area, the AGB reference values of 393 sample plots were selected by using allometric growth equations to convert the survey data into AGB. The Landsat-8 data were used as the optical remote sensing data source to extract spectral information, vegetation index, texture feature and the components of tasseled cap transformation. ALOS PALSAR-2 and Sentinel-1 data were used as radar remote sensing data sources to extract backscatter intensity and normalized polarization difference index for each polarization mode. A total of 122 candidate feature variables were obtained including topographical variables(elevation, slope and aspect). Multivariate linear regression(MLR), random forest(RF) and support vector regression(SVR) models were established after selecting the modeling variables by stepwise regression and random forest method. Using the coefficient of determination(R2) and root mean square error(RMSE) as model evaluation index, the models were evaluated by ten-fold cross-validation method, the best model was selected to complete biomass mapping, and five biomass map products at China or global scale were selected for comparative analysis. 【Result】For the training set, the random forest model performed the best with RMSE=12.8 mg•hm-2, rRMSE=21.1%, and R2=0.93, which fitted the data well, followed by the support vector regression model (RMSE=26.1 mg•hm-2, rRMSE=43.3%, R2=0.55) and the multivariate linear regression model(RMSE=30.9 mg•hm-2, rRMSE=50.5%, R2=0.39). On the test set, the model performance achieved by RF method (RMSE=30.1 mg•hm-2, rRMSE=51.3%, R2=0.42) was also better than that of MLR(RMSE=32.6 mg•hm-2, rRMSE=54.1%, R2=0.30) and SVR(RMSE=32.8 mg•hm-2, rRMSE=55.3%, R2=0.25). At the same time, all three models showed a certain degree of underestimation over small AGB and overestimation over large AGB. The RF model selected 13 modeling variables, including PALSAR-2 backscatter information, elevation and Landsat-8 spectral information, vegetation index, and the difference between tasseled cap transformation humidity and greenness components. Regional biomass mapping was completed using the RF model. Compared with other products, this model could basically reflect the distribution of biomass in the study area and showed abundant detailed information on biomass distribution. The range of biomass in the study area was 0-119 mg•hm-2, the average biomass was 37.5 mg•hm-2 and the standard deviation was 35.9 mg•hm-2. 【Conclusion】Combined with multi-source remote sensing data and machine learning algorithm, large-scale biomass could be accurately and quickly calculated, which would be great application potentials. Compared with SVR model and MLR model, RF model performed better in AGB estimation for the study area. The RF algorithm could effectively select variables for AGB machine learning modeling from multi-source variables. © 2021, Editorial Department of Scientia Silvae Sinicae. All right reserved.
引用
收藏
页码:36 / 48
页数:12
相关论文
共 33 条
  • [11] Wu Y R., Concept of multidimensional space joint-observation SAR, Journal of Radars, 2, 2, pp. 135-142, (2013)
  • [12] Xu W Y, Jin X B, Yang X H, Et al., The estimation of forest vegetation biomass in China in spatial grid, Journal of Natural Resources, 33, 10, pp. 1725-1741, (2018)
  • [13] Asner G P., Cloud cover in Landsat observations of the Brazilian Amazon, International Journal of Remote Sensing, 22, 18, pp. 3855-3862, (2001)
  • [14] Astola H, Hame T, Sirro L, Et al., Comparison of Sentinel-2 and Landsat 8 imagery for forest variable prediction in boreal region, Remote Sensing of Environment, 223, pp. 257-273, (2019)
  • [15] Chen G, Hay G J, Zhou Y L., Estimation of forest height, biomass and volume using support vector regression and segmentation from LiDAR transects and Quickbird imagery, 2010 18th International Conference on Geoinformatics, pp. 1-4, (2010)
  • [16] Chi H, Sun G Q, Huang J L, Et al., National forest aboveground biomass mapping from ICESat/GLAS Data and MODIS imagery in China, Remote Sensing, 7, pp. 5534-5564, (2015)
  • [17] Foody G M, Boyd D S, Cutler M E J., Predictive relations of tropical forest biomass from Landsat TM data and their transferability between regions, Remote Sensing of Environment, 85, 4, pp. 463-474, (2003)
  • [18] Guner S T, Comez A, Ozkan K., Predicting soil and forest floor carbon stocks in Western Anatolian Scots pine stands, Turkey, African Journal of Agricultural Research, 7, 28, pp. 4075-4083, (2012)
  • [19] Hu T Y, Su Y J, Xue B L, Et al., Mapping global forest aboveground biomass with spaceborne LiDAR, optical imagery, and forest inventory data, Remote Sensing, 8, 7, (2016)
  • [20] Hu Y, Xu X L, Wu F Y, Et al., Estimating forest stock volume in Hunan Province, China, by integrating in situ plot data, Sentinel-2 images, and linear and machine learning regression models, Remote Sensing, 12, 1, (2020)